為智慧家庭建構任務導向式對話系統

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：45

、訪客IP：18.227.49.56

姓名

周建豪(Chien-Hao Chou) 查詢紙本館藏

畢業系所

資訊工程學系在職專班

論文名稱

為智慧家庭建構任務導向式對話系統
(Building Task-Oriented Dialogue Systems For Smart Home)

相關論文

★ 行程邀約郵件的辨識與不規則時間擷取之研究	★ NCUFree校園無線網路平台設計及應用服務開發
★ 網際網路半結構性資料擷取系統之設計與實作	★ 非簡單瀏覽路徑之探勘與應用
★ 遞增資料關聯式規則探勘之改進	★ 應用卡方獨立性檢定於關連式分類問題
★ 中文資料擷取系統之設計與研究	★ 非數值型資料視覺化與兼具主客觀的分群
★ 關聯性字組在文件摘要上的探討	★ 淨化網頁：網頁區塊化以及資料區域擷取
★ 問題答覆系統使用語句分類排序方式之設計與研究	★ 時序資料庫中緊密頻繁連續事件型樣之有效探勘
★ 星狀座標之軸排列於群聚視覺化之應用	★ 由瀏覽歷程自動產生網頁抓取程式之研究
★ 動態網頁之樣版與資料分析研究	★ 同性質網頁資料整合之自動化研究

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

至系統瀏覽論文 (2026-1-15以後開放)

摘要(中)

隨著智慧家庭市場經濟規模的擴大，許多專屬的封閉生態系統逐漸形成，
例如 Apple 的 HomeKit、Google 的 Nest 和 Amazon 的 Alexa 等。這種封閉
的生態系統不僅限制了用戶的選擇，還迫使用戶學習不同系統的操作方式，難
以全面體驗智慧家庭應有的便利性。
本文旨在透過建構智慧家庭任務導向式對話系統，降低用戶的學習曲線
並突破封閉生態的問題。在建立任務導向式對話系統之前，需要先收集對話
語料庫。相較於傳統的人對人或人對機器語料收集方式，本文參考了 SchemaGuided Dialogue (SGD) 方法，並建構了 SmartHomeSGD 對話模擬器。透過
環境感知的機率方法決策用戶代理與助理代理的對話行為，使用戶代理人和助
理代理人能更有效地模擬人與系統的交互方式。
在對話行為設計方面，我們針對用戶代理和助理代理設計了專屬的對話
行為。此外，為了實現任務導向式對話系統與外部智慧家庭服務（傳統裝置、
多媒體播放器和暖通空調服務）的串接，我們加入了助理代理的 EXECUTE
對話行為，藉由此對話行為，發送 HTTP 請求，串接外部的智慧家庭服務。
在傳統機器對機器的語料生成中，機器生成的對話大綱往往需要大量人
力進行改寫，以提升語料的多樣性與自然性。為了解決此問題，本文設計了對
話改寫提示，以此引導大型語言模型執行對話改寫任務，有效降低人力成本。
最後，本文使用 mT5 (Multilingual Text-to-Text Transfer Transformer) 預
訓練模型作為基礎，並基於 SmartHomeSGD 語料庫進行微調，成功建構了中
文智慧家庭任務導向式對話系統。

摘要(英)

With the expansion of the smart home market, many proprietary closed
ecosystems have gradually emerged, such as Apple’s HomeKit, Google’s Nest,
and Amazon’s Alexa. These closed ecosystems not only limit users’choices
but also require them to learn different system operation methods, making it
difficult to fully experience the convenience that smart homes should offer.
This paper aims to address the learning curve and the issue of closed
ecosystems by constructing a task-oriented dialogue system for smart homes.
Before building the task-oriented dialogue system, a dialogue corpus must first
be collected. Compared to traditional methods of collecting human-to-human or
human-to-machine dialogues, this paper refers to the Schema-Guided Dialogue
(SGD) approach and constructs the SmartHomeSGD dialogue simulator. Using
a context-aware probabilistic method, the dialogue actions of the user agent and
assistant agent are decided, allowing the agents to more effectively simulate the
interactions between humans and systems.
In the design of dialogue actions, we created specific actions for both the
user agent and the assistant agent. Additionally, to integrate the task-oriented
dialogue system with external smart home services (such as traditional devices,
media players, and HVAC services), we added the EXECUTE dialogue action
for the assistant agent. Through this action, HTTP requests are sent to connect
to external smart home services.
In traditional machine-to-machine dialogue corpus generation, the dialogue
outlines generated by machines often require a significant amount of manual effort
to revise in order to increase diversity and naturalness. To address this issue,
this paper designs dialogue rewriting prompts to guide large language models in
performing dialogue rewriting tasks, effectively reducing human labor costs.
Finally, this paper uses the mT5 (Multilingual Text-to-Text Transfer Transformer) pre-trained model and fine-tunes it based on the SmartHomeSGD corpus
to successfully construct a chinese task-oriented dialogue system for smart homes

關鍵字(中)

★ 智慧家庭
★ 自然語言處理
★ 任務導向式對話系統
★ 語料庫建構
★ 綱要引導式對話

關鍵字(英)

★ Smart Home
★ Natural Language Processing
★ Task-Oriented Dialogue System
★ Corpus Construction
★ Schema-Guided Dialogue

論文目次

中文摘要 i
英文摘要 ii
目錄 iii
圖目錄 v
表目錄 vi
一、緒論 1
二、相關研究 4
2.1 MessageSGD 4
2.2 Schema-Guided LLM Prompting 5
三、 SmartHomeSGD Simulator 6
3.1 情境 (Scenario) 6
3.2 綱要 (Schema) 7
3.3 代理人 (Agent) 與對話行為 (Action) 8
3.4 環境感知的機率 8
3.5 對話行為轉換矩陣 9
3.6 資料庫 (DB) 10
3.7 對話模擬器 10
3.8 對話改寫 12
四、 SmartHomeTOD 14
4.1 SmartHomeSGD 任務導向式對話系統 14
4.2 任務導向式對話系統模組 15
4.2.1 自然語言理解 (NLU) 15
4.2.2 對話狀態追蹤 (DST) 15
4.2.3 對話決策 (DP) 16
4.2.4 自然語言生成 (NLG) 16
五、實驗與分析 17
5.1 語料庫分析 17
5.2 大型語言模型改寫分析 18
5.3 評估指標 18
5.3.1 評估指標公式 19
5.4 實驗結果 20
5.4.1 自然語言理解模組 20
5.4.2 對話狀態追蹤模組 20
5.4.3 對話決策模組 21
5.4.4 自然語言生成模組 21
六、結論 23
參考文獻 24

參考文獻

[1] Stephanie Seneff and Joseph Polifroni. Dialogue management in the mercury
flight reservation system. In ANLP-NAACL 2000 Workshop: Conversational
Systems, 2000.
[2] Antoine Raux, Brian Langner, Dan Bohus, Alan W Black, and Maxine
Eskenazi. Let’s go public! taking a spoken dialog system to the real world.
In in Proc. of Interspeech 2005. Citeseer, 2005.
[3] Charles T Hemphill, John J Godfrey, and George R Doddington. The atis
spoken language systems pilot corpus. In Speech and Natural Language:
Proceedings of a Workshop Held at Hidden Valley, Pennsylvania, June 24-
27, 1990, 1990.
[4] Pararth Shah, Dilek Hakkani-Tur, Gokhan Tur, Abhinav Rastogi, Ankur
Bapna, Neha Nayak, and Larry Heck. Building a conversational agent
overnight with dialogue self-play. arXiv preprint arXiv:1801.04871, 2018.
[5] Alan Ritter, Colin Cherry, and Bill Dolan. Unsupervised modeling of Twitter
conversations. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational
Linguistics, pages 172–180, Los Angeles, California, June 2010. Association
for Computational Linguistics.
[6] Ryan Lowe, Nissan Pow, Iulian Serban, and Joelle Pineau. The Ubuntu
dialogue corpus: A large dataset for research in unstructured multi-turn
dialogue systems. In Proceedings of the 16th Annual Meeting of the Special
Interest Group on Discourse and Dialogue, pages 285–294, Prague, Czech
Republic, September 2015. Association for Computational Linguistics.
[7] Abhinav Rastogi, Xiaoxue Zang, Srinivas Sunkara, Raghav Gupta, and
Pranav Khaitan. Towards scalable multi-domain conversational agents: The
schema-guided dialogue dataset. Proceedings of the AAAI Conference on
Artificial Intelligence, 34(05):8689–8696, Apr. 2020.
[8] Cheng-Hung Yeh and Chia-Hui Chang. Construction of message deliver
service dialog systems. In Jheng-Long Wu and Ming-Hsiang Su, editors,
Proceedings of the 35th Conference on Computational Linguistics and Speech
Processing (ROCLING 2023), pages 29–37, Taipei City, Taiwan, October
2023. The Association for Computational Linguistics and Chinese Language
Processing (ACLCLP).
[9] J. F. Kelley. An iterative design methodology for user-friendly natural language office information applications. ACM Trans. Inf. Syst., 2(1):26–41,
jan 1984.
[10] OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge
Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam
Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher
Berner, Lenny Bogdonoff, Oleg Boiko, Madelaine Boyd, Anna-Luisa Brakman, Greg Brockman, Tim Brooks, Miles Brundage, Kevin Button, Trevor
Cai, Rosie Campbell, Andrew Cann, Brittany Carey, Chelsea Carlson, Rory
Carmichael, Brooke Chan, Che Chang, Fotis Chantzis, Derek Chen, Sully
Chen, Ruby Chen, Jason Chen, Mark Chen, Ben Chess, Chester Cho, Casey
Chu, Hyung Won Chung, Dave Cummings, Jeremiah Currier, Yunxing
Dai, Cory Decareaux, Thomas Degry, Noah Deutsch, Damien Deville, Arka
Dhar, David Dohan, Steve Dowling, Sheila Dunning, Adrien Ecoffet, Atty
Eleti, Tyna Eloundou, David Farhi, Liam Fedus, Niko Felix, Simon Posada
Fishman, Juston Forte, Isabella Fulford, Leo Gao, Elie Georges, Christian
Gibson, Vik Goel, Tarun Gogineni, Gabriel Goh, Rapha Gontijo-Lopes,
Jonathan Gordon, Morgan Grafstein, Scott Gray, Ryan Greene, Joshua
Gross, Shixiang Shane Gu, Yufei Guo, Chris Hallacy, Jesse Han, Jeff Harris, Yuchen He, Mike Heaton, Johannes Heidecke, Chris Hesse, Alan Hickey,
Wade Hickey, Peter Hoeschele, Brandon Houghton, Kenny Hsu, Shengli Hu,
Xin Hu, Joost Huizinga, Shantanu Jain, Shawn Jain, Joanne Jang, Angela
Jiang, Roger Jiang, Haozhun Jin, Denny Jin, Shino Jomoto, Billie Jonn,
Heewoo Jun, Tomer Kaftan, ukasz Kaiser, Ali Kamali, Ingmar Kanitscheider, Nitish Shirish Keskar, Tabarak Khan, Logan Kilpatrick, Jong Wook
Kim, Christina Kim, Yongjik Kim, Jan Hendrik Kirchner, Jamie Kiros,
Matt Knight, Daniel Kokotajlo, ukasz Kondraciuk, Andrew Kondrich, Aris
Konstantinidis, Kyle Kosic, Gretchen Krueger, Vishal Kuo, Michael Lampe,
Ikai Lan, Teddy Lee, Jan Leike, Jade Leung, Daniel Levy, Chak Ming Li,
Rachel Lim, Molly Lin, Stephanie Lin, Mateusz Litwin, Theresa Lopez, Ryan
Lowe, Patricia Lue, Anna Makanju, Kim Malfacini, Sam Manning, Todor
Markov, Yaniv Markovski, Bianca Martin, Katie Mayer, Andrew Mayne,
Bob McGrew, Scott Mayer McKinney, Christine McLeavey, Paul McMillan, Jake McNeil, David Medina, Aalok Mehta, Jacob Menick, Luke Metz,
Andrey Mishchenko, Pamela Mishkin, Vinnie Monaco, Evan Morikawa,
Daniel Mossing, Tong Mu, Mira Murati, Oleg Murk, David Mely, Ashvin
Nair, Reiichiro Nakano, Rajeev Nayak, Arvind Neelakantan, Richard Ngo,
Hyeonwoo Noh, Long Ouyang, Cullen O’Keefe, Jakub Pachocki, Alex
Paino, Joe Palermo, Ashley Pantuliano, Giambattista Parascandolo, Joel
Parish, Emy Parparita, Alex Passos, Mikhail Pavlov, Andrew Peng, Adam
Perelman, Filipe de Avila Belbute Peres, Michael Petrov, Henrique Ponde
de Oliveira Pinto, Michael, Pokorny, Michelle Pokrass, Vitchyr H. Pong,
Tolly Powell, Alethea Power, Boris Power, Elizabeth Proehl, Raul Puri,
Alec Radford, Jack Rae, Aditya Ramesh, Cameron Raymond, Francis Real,
Kendra Rimbach, Carl Ross, Bob Rotsted, Henri Roussez, Nick Ryder,
Mario Saltarelli, Ted Sanders, Shibani Santurkar, Girish Sastry, Heather
Schmidt, David Schnurr, John Schulman, Daniel Selsam, Kyla Sheppard,
Toki Sherbakov, Jessica Shieh, Sarah Shoker, Pranav Shyam, Szymon Sidor,
Eric Sigler, Maddie Simens, Jordan Sitkin, Katarina Slama, Ian Sohl, Benjamin Sokolowsky, Yang Song, Natalie Staudacher, Felipe Petroski Such,
Natalie Summers, Ilya Sutskever, Jie Tang, Nikolas Tezak, Madeleine B.
Thompson, Phil Tillet, Amin Tootoonchian, Elizabeth Tseng, Preston Tuggle, Nick Turley, Jerry Tworek, Juan Felipe Ceron Uribe, Andrea Vallone,
Arun Vijayvergiya, Chelsea Voss, Carroll Wainwright, Justin Jay Wang,
Alvin Wang, Ben Wang, Jonathan Ward, Jason Wei, CJ Weinmann, Akila
Welihinda, Peter Welinder, Jiayi Weng, Lilian Weng, Matt Wiethoff, Dave
Willner, Clemens Winter, Samuel Wolrich, Hannah Wong, Lauren Workman, Sherwin Wu, Jeff Wu, Michael Wu, Kai Xiao, Tao Xu, Sarah Yoo,
Kevin Yu, Qiming Yuan, Wojciech Zaremba, Rowan Zellers, Chong Zhang,
Marvin Zhang, Shengjia Zhao, Tianhao Zheng, Juntang Zhuang, William
Zhuk, and Barret Zoph. Gpt-4 technical report, 2024.
[11] Chengwei Qin, Aston Zhang, Zhuosheng Zhang, Jiaao Chen, Michihiro Yasunaga, and Diyi Yang. Is chatgpt a general-purpose natural language processing task solver?, 2023.
[12] Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright,
Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex
Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie
Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, and
Ryan Lowe. Training language models to follow instructions with human
feedback, 2022.
[13] Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma,
Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung,
Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha
Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay,
Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben
Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-Ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant
Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David
Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi,
David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica
Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou,
Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta,
Jason Wei, Kathy Meier-Hellstern, Douglas Eck, Jeff Dean, Slav Petrov,
and Noah Fiedel. Palm: Scaling language modeling with pathways, 2022.
[14] Yushi Hu, Chia-Hsuan Lee, Tianbao Xie, Tao Yu, Noah A. Smith, and
Mari Ostendorf. In-context learning for few-shot dialogue state tracking.
In Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang, editors, Findings of
the Association for Computational Linguistics: EMNLP 2022, pages 2627–
2643, Abu Dhabi, United Arab Emirates, December 2022. Association for
Computational Linguistics.
[15] Xiaoying Zhang, Baolin Peng, Kun Li, Jingyan Zhou, and Helen Meng. SGPTOD: Building task bots effortlessly via schema-guided LLM prompting. In
Houda Bouamor, Juan Pino, and Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, pages 13348–13369,
Singapore, December 2023. Association for Computational Linguistics.
[16] Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-Rfou,
Aditya Siddhant, Aditya Barua, and Colin Raffel. mt5: A massively multilingual pre-trained text-to-text transformer, 2021.
[17] Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang,
Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. Exploring the limits
of transfer learning with a unified text-to-text transformer, 2023.
[18] Sungjin Lee, Qi Zhu, Ryuichi Takanobu, Zheng Zhang, Yaoqin Zhang, Xiang
Li, Jinchao Li, Baolin Peng, Xiujun Li, Minlie Huang, and Jianfeng Gao.
ConvLab: Multi-domain end-to-end dialog system platform. In Marta R.
Costa-jussa and Enrique Alfonseca, editors, Proceedings of the 57th Annual
Meeting of the Association for Computational Linguistics: System Demonstrations, pages 64–69, Florence, Italy, July 2019. Association for Computational Linguistics.
[19] Qi Zhu, Christian Geishauser, Hsien-chin Lin, Carel van Niekerk, Baolin
Peng, Zheng Zhang, Michael Heck, Nurul Lubis, Dazhen Wan, Xiaochen
Zhu, et al. Convlab-3: A flexible dialogue system toolkit based on a unified
data format. arXiv preprint arXiv:2211.17148, 2022.
[20] Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. Bleu: a
method for automatic evaluation of machine translation. In Proceedings of
the 40th annual meeting of the Association for Computational Linguistics,
pages 311–318, 2002.
[21] Tianyi Zhang*, Varsha Kishore*, Felix Wu*, Kilian Q. Weinberger, and Yoav
Artzi. Bertscore: Evaluating text generation with bert. In International
Conference on Learning Representations, 2020.

指導教授

張嘉惠(Chia-Hui Chang)

審核日期

2025-1-16

推文