Context-Aware Question-Answer Pairing and Dialogue Act Tagging from Instant Message Chatlog

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：24

、訪客IP：18.217.108.153

姓名

陶玫婉(Thamolwan Poopradubsil) 查詢紙本館藏

畢業系所

資訊工程學系

論文名稱

(Context-Aware Question-Answer Pairing and Dialogue Act Tagging from Instant Message Chatlog)

相關論文

★ 行程邀約郵件的辨識與不規則時間擷取之研究	★ NCUFree校園無線網路平台設計及應用服務開發
★ 網際網路半結構性資料擷取系統之設計與實作	★ 非簡單瀏覽路徑之探勘與應用
★ 遞增資料關聯式規則探勘之改進	★ 應用卡方獨立性檢定於關連式分類問題
★ 中文資料擷取系統之設計與研究	★ 非數值型資料視覺化與兼具主客觀的分群
★ 關聯性字組在文件摘要上的探討	★ 淨化網頁：網頁區塊化以及資料區域擷取
★ 問題答覆系統使用語句分類排序方式之設計與研究	★ 時序資料庫中緊密頻繁連續事件型樣之有效探勘
★ 星狀座標之軸排列於群聚視覺化之應用	★ 由瀏覽歷程自動產生網頁抓取程式之研究
★ 動態網頁之樣版與資料分析研究	★ 同性質網頁資料整合之自動化研究

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

[檢視]

[下載]

本電子論文使用權限為同意立即開放。
已達開放權限電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。
請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。

摘要(中)

在本論文中，我們研究了數據準備過程的兩個不同任務：問答配對準備 (Question-Answer Pair Preparation) 和對話行為標註 (Dialogue Act Tagging)。與其他作品不同，我們的數據來自即時通訊（Instant Messaging: IM）平台，參與者更常將長句拆分成短句，分散成多條消息中發送。因此，在準備問答對時，我們還考慮了一個稱為消息合併任務的任務，用以確定這些消息是否需要進行合併以進行回复預測任務。我們提出了一個 CONTEXT-AOA 模型，將上下文（先前的對話）作為除成對消息之外的附加輸入。

其次，在對話行為標註任務，當我們無法獲得更多標註數據時，我們探索了使用域外數據集來處理該任務的可能性。我們對這個任務進行了兩個實驗。第一個實驗是零樣本學習實驗，我們只使用域外數據集訓練模型並在我們的數據集上測試它們，另一個實驗是我們將一些數據集與外部數據一起包含在模型中域數據集並在剩餘數據上測試它們。我們還提出了一個 CONTEXT-BERT-CRF 模型，它利用了 BERT 微調的能力，同時仍然能夠保留對話中的所有話語並將它們全部提供給模型。

我們在問答對準備任務和對話行為標記任務上的實驗顯示，我們提出的模型在大多數實驗中都能夠勝過所有現有模型。為了演示這兩個任務的使用，我們也構建了基於檢索的聊天機器人。此聊天機器人不僅根據用戶的輸入從前述準備的問答對中選擇回應，同時也應用對話行為標註資訊來幫助選擇答案。

摘要(英)

In this thesis, we study two different tasks for data preparation process: Question-Answer Pair Preparation and Dialogue Act Tagging. Unlike other works, our data comes from instant messaging (IM) platform which has different characteristic as participants could split long sentences into short utterances and send them in multiple messages. Therefore, in the preparation for question-answer pairs, we also consider a task called message merging task which aims to determine whether those messages need to be merged or not before generating message pairs for reply-to prediction task. We propose a CONTEXT-AOA model to include the context (previous dialogue) as additional input apart from pairwise messages. For dialogue act tagging task, we explore the possibility of using out-of-domain dataset to deal with this task when we are unable to obtain more annotated data. We conduct two experiments on this task. The first experiment is a zero-shot learning experiment where we train the models using only out-of-domain datasets and test them on our dataset, and another experiment is where we include some of our dataset to the the models along with the out-of-domain datasets and test them on the remaining data. We also propose a CONTEXT-BERT-CRF model which utilizes the ability of BERT and still be able to include all of the utterances from the conversation to the model. Our experiments on both question-answer pair preparation task and dialogue act tagging task show that our proposed models are able to outperform all of the existing models in most of the experiments. To demonstrate the use of these two tasks, the retrieval-based IR-based chatbot has been built. The chatbot will select the response from Q&A pairs prepared in question-answer pair preparation task based on the input from user and return the it back to user. We also apply dialogue act tagging task to help with the answer selection.

關鍵字(中)

★ 對話系統
★ 即時通訊
★ 對話解開
★ 響應選擇
★ 文本分類
★ 對話行為標記
★ 信息檢索

關鍵字(英)

★ dialog system
★ instant messaging
★ conversation disentanglement
★ response selection
★ text classification
★ dialogue act tagging
★ information retrieval

論文目次

摘要.......................... i
Abstract .................... iii
Acknowledgements ............. v
Table of Contents ................ vii
List of Figures .......... ix
List of Tables............ xiii
Chapter 1 Introduction ....... 1
1.1 Background and Motivation . . . . . . 1
1.2 An Overview of Data Preparation Process . . 4
1.2.1 Question-Answer Pair Preparation . . . 5
1.2.2 Dialogue Act Tagging . . 6
1.3 Contribution . . . .8
Chapter 2 Literature Review ............ 11
2.1 Conversation Disentanglement . . 11
2.2 Single-turn vs. Multi-turn Response Selection . . 12
2.3 Message Pair Classification . . . . 13
2.4 Dialogue Act Annotation Schemes . . 14
2.5 Dialogue Act Tagging . . . .
2.6 Learning Sentence Representation . . . . 16
Chapter 3 Problem Definition and Dataset ....... 19
3.1 Question-Answer Pair Preparation . . . 19
3.1.1Training Data Preparation . . . . 20
3.2 Dialogue Act Tagging . . . . 23
3.2.1Training Data Preparation . . . 23
Chapter 4 Proposed Models ........... 27
4.1 Context-Aware Message Pair Classification Model . 27
4.1.1 Contextual Representation Layer . . .28
4.1.2 Attention-over-Attention (AOA) Layer . 28
4.1.3 Final Classification Layer . . 29
4.2 Context-Aware Dialogue Act Tagging Model . . 29
4.2.1 Contextual Representation Layer . . . 30
4.2.2 Context Enrichment Layer . . . 30
4.2.3 Final Classification Layer . . 31
Chapter 5 Experiments......... 33
5.1 Question-Answer Pair Preparation . . 33
5.1.1 Experimental Setup . . . . 33
5.1.2 Performance Comparison . . . 35
5.1.3 User-initiated vs System-initiated Conversations . 37
5.1.4 An Ablation Study of Context-Aware Message Pair Classification Model . . . . 38
5.2 Dialogue Act Tagging . . . . 39
5.2.1 Experiment on ISO-Standard SWDA Data . . . 39
5.2.2 Experiment on QNAP Data . . . . 42
5.2.3 The Impact of data size on Dialogue Act Tagging . . 44
Chapter 6 IR-based QNAP chatbot .......... 47
6.1 IR-based chatbot with Manually Reconstructed Message . . 49
6.2 IR-based chatbot with User Simulator . . 52
6.3 IR-based chatbot with Unseen Data . . 55
Chapter 7 Conclusion and Future work ....... 57
7.1 Conclusion . . . . 57
7.2 Future work . . 58
Bibliography ........... 61
Appendix A ...... 73
Appendix B ....... 75
Appendix C ....... 77
Appendix D ......... 79

參考文獻

[1] Yinpei Dai, Huihua Yu, Yixuan Jiang, Chengguang Tang, Yongbin Li, and Jian Sun.A survey on dialog management: Recent advances and challenges, 2020.
[2] Xiangyang Zhou, Daxiang Dong, Hua Wu, Shiqi Zhao, Dianhai Yu, Hao Tian, XuanLiu, and Rui Yan. Multi-view response selection for human-computer conversation.InProceedings of the 2016 Conference on Empirical Methods in Natural LanguageProcessing, pages 372–381, Austin, Texas, November 2016. Association for Com-putational Linguistics.
[3] Yu Wu, Wei Wu, Chen Xing, Can Xu, Zhoujun Li, and Ming Zhou. A sequentialmatching framework for multi-turn response selection in retrieval-based chatbots.Computational Linguistics, 45(1):163–197, March 2019.
[4] Harshit Kumar, Arvind Agarwal, Riddhiman Dasgupta, Sachindra Joshi, and ArunKumar. Dialogue act sequence labeling using hierarchical encoder with crf, 2017.
[5] Vipul Raheja and Joel Tetreault. Dialogue Act Classification with Context-AwareSelf-Attention. InProceedings of the 2019 Conference of the North American Chap-ter of the Association for Computational Linguistics: Human Language Technolo-gies, Volume 1 (Long and Short Papers), pages 3727–3733, Minneapolis, Minnesota,June 2019. Association for Computational Linguistics.
[6] Stefano Mezza, Alessandra Cervone, Evgeny Stepanov, Giuliano Tortoreto, andGiuseppe Riccardi. ISO-standard domain-independent dialogue act tagging for con-versational agents. InProceedings of the 27th International Conference on Compu-tational Linguistics, pages 3539–3551, Santa Fe, New Mexico, USA, August 2018.Association for Computational Linguistics.
[7] Yiming Cui, Zhipeng Chen, Si Wei, Shijin Wang, Ting Liu, and Guoping Hu.Attention-over-attention neural networks for reading comprehension. InProceed-ings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 593–602, Vancouver, Canada, July 2017. Associa-tion for Computational Linguistics.
[8] A. M. TURING. I.—COMPUTING MACHINERY AND INTELLIGENCE.Mind,LIX(236):433–460, 10 1950.
[9] Joseph Weizenbaum. Eliza—a computer program for the study of natural languagecommunication between man and machine.Commun. ACM, 9(1):36–45, January1966.
[10] Jason D. Williams and Steve Young. Partially observable markov decision processesfor spoken dialog systems.Comput. Speech Lang., 21(2):393–422, April 2007.
[11] Gellert Weisz, Pawel Budzianowski, Pei-Hao Su, and Milica Gasic. Sample ef-ficient deep reinforcement learning for dialogue systems with large action spaces.IEEE/ACM Trans. Audio, Speech and Lang. Proc., 26(11):2083–2097, November2018.
[12] Janarthanan Rajendran, Jatin Ganhotra, Satinder Singh, and Lazaros Polymenakos.Learning end-to-end goal-oriented dialog with multiple answers. InProceedings ofthe 2018 Conference on Empirical Methods in Natural Language Processing, pages3834–3843, Brussels, Belgium, October-November 2018. Association for Compu-tational Linguistics.
[13] Yun-Nung Chen.Unsupervised Learning and Modeling of Knowledge and Intentfor Spoken Dialogue Systems. PhD dissertation, Carnegie Mellon University, 2015.
[14] M. McTear.Conversational AI: Dialogue Systems, Conversational Agents, andChatbots, volume 13. Morgan & Claypool Publishers, 2020.
[15] Micha Elsner and Eugene Charniak. You talking to me? a corpus and algorithmfor conversation disentanglement. InProceedings of ACL-08: HLT, pages 834–842,Columbus, Ohio, June 2008. Association for Computational Linguistics.
[16] Ryan Lowe, Nissan Pow, Iulian Serban, and Joelle Pineau. The Ubuntu dialoguecorpus: A large dataset for research in unstructured multi-turn dialogue systems. InProceedings of the 16th Annual Meeting of the Special Interest Group on Discourseand Dialogue, pages 285–294, Prague, Czech Republic, September 2015. Associa-tion for Computational Linguistics.
[17] Micha Elsner and Eugene Charniak. Disentangling chat.Computational Linguistics,36(3):389–409, 2010.
[18] Shikib Mehri and Giuseppe Carenini. Chat disentanglement: Identifying semanticreply relationships with random forests and recurrent neural networks. InProceed-ings of the Eighth International Joint Conference on Natural Language Processing(Volume 1: Long Papers), pages 615–623, Taipei, Taiwan, November 2017. AsianFederation of Natural Language Processing.
[19] Jyun-Yu Jiang, Francine Chen, Yan-Ying Chen, and Wei Wang. Learning to disen-tangle interleaved conversational threads with a Siamese hierarchical network andsimilarity ranking. InProceedings of the 2018 Conference of the North AmericanChapter of the Association for Computational Linguistics: Human Language Tech-nologies, Volume 1 (Long Papers), pages 1812–1822, New Orleans, Louisiana, June2018. Association for Computational Linguistics.
[20] Dou Shen, Qiang Yang, Jian-Tao Sun, and Zheng Chen. Thread detection in dynamictext message streams. InProceedings of the 29th Annual International ACM SIGIRConference on Research and Development in Information Retrieval, SIGIR ’06, page35–42, New York, NY, USA, 2006. Association for Computing Machinery.
[21] Micha Elsner and Eugene Charniak. Disentangling chat with local coherence mod-els. InProceedings of the 49th Annual Meeting of the Association for ComputationalLinguistics: Human Language Technologies, pages 1179–1189, Portland, Oregon,USA, June 2011. Association for Computational Linguistics.
[22] Jonathan K. Kummerfeld, Sai R. Gouravajhala, Joseph J. Peper, Vignesh Athreya,Chulaka Gunasekara, Jatin Ganhotra, Siva Sankalp Patel, Lazaros C Polymenakos,and Walter Lasecki. A large-scale corpus for conversation disentanglement. InPro-ceedings of the 57th Annual Meeting of the Association for Computational Linguis-tics, pages 3846–3856, Florence, Italy, July 2019. Association for ComputationalLinguistics.
[23] Hao Wang, Zhengdong Lu, Hang Li, and Enhong Chen. A dataset for research onshort-text conversations. InProceedings of the 2013 Conference on Empirical Meth-ods in Natural Language Processing, pages 935–945, Seattle, Washington, USA,October 2013. Association for Computational Linguistics.
[24] Xiangyang Zhou, Lu Li, Daxiang Dong, Yi Liu, Ying Chen, Wayne Xin Zhao, Dian-hai Yu, and Hua Wu. Multi-turn response selection for chatbots with deep attentionmatching network. InProceedings of the 56th Annual Meeting of the Association forComputational Linguistics (Volume 1: Long Papers), pages 1118–1127, Melbourne,Australia, July 2018. Association for Computational Linguistics.
[25] Andreas Stolcke, Klaus Ries, Noah Coccaro, Elizabeth Shriberg, Rebecca Bates,Daniel Jurafsky, Paul Taylor, Rachel Martin, Carol Van Ess-Dykema, and MarieMeteer. Dialogue act modeling for automatic tagging and recognition of conversa-tional speech.Computational Linguistics, 26(3):339–374, 2000.
[26] John J. Godfrey, Edward C. Holliman, and Jane McDaniel. Switchboard: Telephonespeech corpus for research and development. InProceedings of the 1992 IEEEInternational Conference on Acoustics, Speech and Signal Processing - Volume 1,ICASSP’92, page 517–520, USA, 1992. IEEE Computer Society.
[27] Jan Alexandersson, Bianka Buschbeck-Wolf, Tsutomu Fujinami, Michael Kipp, Ste-fan Koch, Elisabeth Maier, Norbert Reithinger, Birte Schmitz, and Melanie Siegel.Dialogue acts in verbmobil-2 : Second edition.Verbmobil Report, 1998(226), 1998.
[28] Jean Carletta and Amy Isard. Hcrc dialogue structure coding manual. Technicalreport, Centre, University of Edinburgh, 1996.
[29] Charles T. Hemphill, John J. Godfrey, and George R. Doddington. The ATIS spokenlanguage systems pilot corpus. InSpeech and Natural Language: Proceedings of aWorkshop Held at Hidden Valley, Pennsylvania, June 24-27,1990, 1990.
[30] Mark G. Core and James F. Allen. Coding dialogs with the damsl annotation scheme.InWorking Notes of the AAAI Fall Symposium on Communicative Action in Humansand Machines, pages 28–35, Cambridge, MA, November 1997.
[31] H.C. Bunt.Dynamic Interpretation and Dialogue Theory, pages 139–188. JohnBenjamins, 2000.
[32] Harry Bunt. The dit++ taxonomy for functional dialogue markup. In Dirk Heylen,Catherine Pelachaud, Roberta Catizone, and David Traum, editors,AAMAS 2009Workshop, Towards a Standard Markup Language for Embodied Dialogue Acts,pages 13–24, 2009.
[33] J. Ang, Yang Liu, and E. Shriberg. Proceedings. (icassp ’05). ieee internationalconference on acoustics, speech, and signal processing, 2005. InAutomatic dialogact segmentation and classification in multiparty meetings, volume 1, pages I/1061–I/1064 Vol. 1, 2005.
[34] Dinoj Surendran and Gina anne Levow. Dialog act tagging with support vectormachines and hidden markov models. InProceedings of Interspeech/ICSLP, 2006.
[35] Ji Young Lee and Franck Dernoncourt. Sequential short-text classification with re-current and convolutional neural networks. InProceedings of the 2016 Conferenceof the North American Chapter of the Association for Computational Linguistics:Human Language Technologies, pages 515–520, San Diego, California, June 2016.Association for Computational Linguistics.
[36] Hamed Khanpour, Nishitha Guntakandla, and Rodney Nielsen. Dialogue act classi-fication in domain-independent conversations using a deep recurrent neural network.InProceedings of COLING 2016, the 26th International Conference on Computa-tional Linguistics: Technical Papers, pages 2012–2021, Osaka, Japan, December2016. The COLING 2016 Organizing Committee.
[37] Daniel Ortega and Ngoc Thang Vu. Neural-based context representation learningfor dialog act classification. InProceedings of the 18th Annual SIGdial Meetingon Discourse and Dialogue, pages 247–252, Saarbrücken, Germany, August 2017.Association for Computational Linguistics.
[38] Zheqian Chen, Rongqin Yang, Zhou Zhao, Deng Cai, and Xiaofei He. Dialogueact recognition via crf-attentive structured network. InThe 41st International ACMSIGIR Conference on Research & Development in Information Retrieval, SIGIR ’18,page 225–234, New York, NY, USA, 2018. Association for Computing Machinery.
[39] Yu Wu, Wei Wu, Ming Zhou, and Zhoujun Li. Sequential match network: A newarchitecture for multi-turn response selection in retrieval-based chatbots.CoRR,abs/1612.01627, 2016.
[40] Yanran Li, Hui Su, Xiaoyu Shen, Wenjie Li, Ziqiang Cao, and Shuzi Niu. Dai-lyDialog: A manually labelled multi-turn dialogue dataset. InProceedings of theEighth International Joint Conference on Natural Language Processing (Volume 1:Long Papers), pages 986–995, Taipei, Taiwan, November 2017. Asian Federation ofNatural Language Processing.
[41] Binxuan Huang, Yanglan Ou, and Kathleen M. Carley. Aspect level sentiment classi-fication with attention-over-attention neural networks.CoRR, abs/1804.06536, 2018.
[42] Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. BERT: Pre-training of deep bidirectional transformers for language understanding. InProceed-ings of the 2019 Conference of the North American Chapter of the Association forComputational Linguistics: Human Language Technologies, Volume 1 (Long andShort Papers), pages 4171–4186, Minneapolis, Minnesota, June 2019. Associationfor Computational Linguistics.
[43] Anne H. Anderson, Miles Bader, Ellen Gurman Bard, Elizabeth Boyle, GwynethDoherty, Simon Garrod, Stephen Isard, Jacqueline Kowtko, Jan McAllister, JimMiller, Catherine Sotillo, Henry S. Thompson, and Regina Weinert. The hcrc maptask corpus.Language and Speech, 34(4):351–366, 1991.
[44] Susanne Jekat, Alexandra Klein, Elisabeth Maier, Ilona Maleck, Marion Mast, andJ. Joachim Quantz. Dialogue acts in verbmobil.Verbmobil Report, 1998(65), 1995.
[45] Geoffrey Leech and M. Weisser.Generic speech act annotation for task-orienteddialogues., pages 441–446. Centre for Computer Corpus Research on LanguageTechnical Papers, Lancaster University, 2003.
[46] John D. Lafferty, Andrew McCallum, and Fernando C. N. Pereira. Conditional ran-dom fields: Probabilistic models for segmenting and labeling sequence data. InPro-ceedings of the Eighteenth International Conference on Machine Learning, ICML’01, page 282–289, San Francisco, CA, USA, 2001. Morgan Kaufmann PublishersInc.
[47] Arman Cohan, Iz Beltagy, Daniel King, Bhavana Dalvi, and Dan Weld. Pretrainedlanguage models for sequential sentence classification. InProceedings of the 2019Conference on Empirical Methods in Natural Language Processing and the 9th In-ternational Joint Conference on Natural Language Processing (EMNLP-IJCNLP),pages 3693–3699, Hong Kong, China, November 2019. Association for Computa-tional Linguistics.
[48] Nils Reimers and Iryna Gurevych. Sentence-BERT: Sentence embeddings usingSiamese BERT-networks. InProceedings of the 2019 Conference on EmpiricalMethods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 3982–3992, HongKong, China, November 2019. Association for Computational Linguistics.
[49] Dan Roth and Wen-tau Yih. A linear programming formulation for global inferencein natural language tasks. InProceedings of the Eighth Conference on Computa-tional Natural Language Learning (CoNLL-2004) at HLT-NAACL 2004, pages 1–8,Boston, Massachusetts, USA, May 6 - May 7 2004. Association for ComputationalLinguistics.
[50] Igor Malioutov and Regina Barzilay. Minimum cut model for spoken lecture seg-mentation. InProceedings of the 21st International Conference on ComputationalLinguistics and 44th Annual Meeting of the Association for Computational Linguis-tics, pages 25–32, Sydney, Australia, July 2006. Association for Computational Lin-guistics.
[51] Zhengdong Lu and Hang Li. A deep architecture for matching short texts. In C. J. C.Burges, L. Bottou, M. Welling, Z. Ghahramani, and K. Q. Weinberger, editors,Ad-vances in Neural Information Processing Systems, volume 26, pages 1367–1375.Curran Associates, Inc., 2013.
[52] Baotian Hu, Zhengdong Lu, Hang Li, and Qingcai Chen. Convolutional neuralnetwork architectures for matching natural language sentences. In Z. Ghahramani,M. Welling, C. Cortes, N. Lawrence, and K. Q. Weinberger, editors,Advances inNeural Information Processing Systems, volume 27, pages 2042–2050. Curran As-sociates, Inc., 2014.
[53] Yu Wu, Wei Wu, Chen Xing, Ming Zhou, and Zhoujun Li. Sequential matching net-work: A new architecture for multi-turn response selection in retrieval-based chat-bots. InProceedings of the 55th Annual Meeting of the Association for Computa-tional Linguistics (Volume 1: Long Papers), pages 496–505, Vancouver, Canada,July 2017. Association for Computational Linguistics.
[54] Svetlana Kiritchenko, Xiaodan Zhu, Colin Cherry, and Saif Mohammad. NRC-Canada-2014: Detecting aspects and sentiment in customer reviews. InProceedingsof the 8th International Workshop on Semantic Evaluation (SemEval 2014), pages437–442, Dublin, Ireland, August 2014. Association for Computational Linguistics.
[55] H. Liu, I. Chatterjee, M. Zhou, X. S. Lu, and A. Abusorrah. Aspect-based sentiment
analysis: A survey of deep learning methods.IEEE Transactions on ComputationalSocial Systems, 7(6):1358–1375, 2020.
[56] Yequan Wang, Minlie Huang, Xiaoyan Zhu, and Li Zhao. Attention-based LSTMfor aspect-level sentiment classification. InProceedings of the 2016 Conference onEmpirical Methods in Natural Language Processing, pages 606–615, Austin, Texas,November 2016. Association for Computational Linguistics.
[57] Wei Xue and Tao Li. Aspect based sentiment analysis with gated convolutionalnetworks. InProceedings of the 56th Annual Meeting of the Association for Compu-tational Linguistics (Volume 1: Long Papers), pages 2514–2523, Melbourne, Aus-tralia, July 2018. Association for Computational Linguistics.
[58] Ankur Parikh, Oscar Täckström, Dipanjan Das, and Jakob Uszkoreit. A decom-posable attention model for natural language inference. InProceedings of the 2016Conference on Empirical Methods in Natural Language Processing, pages 2249–2255, Austin, Texas, November 2016. Association for Computational Linguistics.
[59] Harry Bunt, Volha Petukhova, Andrei Malchanau, Alex Fang, and Kars Wijnhoven.The dialogbank: Dialogues with interoperable annotations.Lang. Resour. Eval.,53(2):213–249, June 2019.
[60] Chengyu FANG, Jing CAO, Harry BUNT, and Xiaoyue LIU. The annotation of theswitchboard corpus with the new iso standard for dialogue act analysis. InProceed-ings of the Eighth Joint ACL-ISO Workshop on Interoperable Semantic Annotation,October 2012. Eighth Joint ACL-ISO Workshop on Interoperable Semantic Anno-tation ; Conference date: 03-10-2012 Through 05-10-2012.
[61] Harry Bunt, Jan Alexandersson, Jean Carletta, Jae-Woong Choe, Alex ChengyuFang, Koiti Hasida, Kiyong Lee, Volha Petukhova, Andrei Popescu-Belis, LaurentRomary, Claudia Soria, and David Traum. Towards an ISO standard for dialogueact annotation. InProceedings of the Seventh International Conference on Lan-guage Resources and Evaluation (LREC’10), Valletta, Malta, May 2010. EuropeanLanguage Resources Association (ELRA).
[62] Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Mael Guillemot,Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronen-thal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, Iain McCowan, Wil-fried Post, Dennis Reidsma, and Pierre Wellner. The ami meeting corpus: A pre-68
announcement. InProceedings of the Second International Conference on MachineLearning for Multimodal Interaction, MLMI’05, page 28–39, Berlin, Heidelberg,2005. Springer-Verlag.
[63] Elizabeth Shriberg, Raj Dhillon, Sonali Bhagat, Jeremy Ang, and Hannah Carvey.The ICSI meeting recorder dialog act (MRDA) corpus. InProceedings of the 5thSIGdial Workshop on Discourse and Dialogue at HLT-NAACL 2004, pages 97–100,Cambridge, Massachusetts, USA, April 30 - May 1 2004. Association for Computa-tional Linguistics.
[64] S. Quarteroni and G. Riccardi. Classifying dialog acts in human-human and human-machine spoken conversations. InINTERSPEECH, 2010.
[65] Nal Kalchbrenner and Phil Blunsom. Recurrent convolutional neural networks fordiscourse compositionality. InProceedings of the Workshop on Continuous VectorSpace Models and their Compositionality, pages 119–126, Sofia, Bulgaria, August2013. Association for Computational Linguistics.
[66] Yucan Zhou, Qinghua Hu, Jie Liu, and Yuan Jia. Combining heterogeneous deepneural networks with conditional random fields for chinese dialogue act recognition.Neurocomput., 168(C):408–417, November 2015.
[67] Yangfeng Ji, Gholamreza Haffari, and Jacob Eisenstein. A latent variable recurrentneural network for discourse-driven language models. InProceedings of the 2016Conference of the North American Chapter of the Association for ComputationalLinguistics: Human Language Technologies, pages 332–342, San Diego, California,June 2016. Association for Computational Linguistics.
[68] Wei Li and Yunfang Wu. Multi-level gated recurrent neural network for dialog actclassification. InProceedings of COLING 2016, the 26th International Conferenceon Computational Linguistics: Technical Papers, pages 1970–1979, Osaka, Japan,December 2016. The COLING 2016 Organizing Committee.
[69] Tomas Mikolov, Kai Chen, Greg S. Corrado, and Jeffrey Dean. Efficient estimationof word representations in vector space, 2013.
[70] Jeffrey Pennington, Richard Socher, and Christopher Manning. GloVe: Global vec-tors for word representation. InProceedings of the 2014 Conference on Empiri-cal Methods in Natural Language Processing (EMNLP), pages 1532–1543, Doha,Qatar, October 2014. Association for Computational Linguistics.69
[71] Matthew Peters, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark,Kenton Lee, and Luke Zettlemoyer. Deep contextualized word representations. InProceedings of the 2018 Conference of the North American Chapter of the Asso-ciation for Computational Linguistics: Human Language Technologies, Volume 1(Long Papers), pages 2227–2237, New Orleans, Louisiana, June 2018. Associationfor Computational Linguistics.
[72] Alec Radford, Karthik Narasimhan, Tim Salimans, and Ilya Sutskever. Improvinglanguage understanding by generative pre-training. 2018.
[73] Chi Sun, Luyao Huang, and Xipeng Qiu. Utilizing BERT for aspect-based sentimentanalysis via constructing auxiliary sentence. InProceedings of the 2019 Conferenceof the North American Chapter of the Association for Computational Linguistics:Human Language Technologies, Volume 1 (Long and Short Papers), pages 380–385,Minneapolis, Minnesota, June 2019. Association for Computational Linguistics.
[74] Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. Enrichingword vectors with subword information.arXiv preprint arXiv:1607.04606, 2016.
[75] Arthur Brack, Anett Hoppe, Pascal Buschermöhle, and Ralph Ewerth. Sequentialsentence classification in research papers using cross-domain multi-task learning,2021.
[76] L.R. Rabiner. A tutorial on hidden markov models and selected applications inspeech recognition.Proceedings of the IEEE, 77(2):257–286, 1989.
[77] G.D. Forney. The viterbi algorithm.Proceedings of the IEEE, 61(3):268–278, 1973.
[78] Yann N. Dauphin, Angela Fan, Michael Auli, and David Grangier. Language mod-eling with gated convolutional networks. InProceedings of the 34th InternationalConference on Machine Learning - Volume 70, ICML’17, page 933–941. JMLR.org,2017.
[79] Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. Bert: Pre-training of deep bidirectional transformers for language understanding, 2019.
[80] Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, HugoLarochelle, François Laviolette, Mario Marchand, and Victor Lempitsky. Domain-adversarial training of neural networks, 2016.
[81] Pei-Hao Su, Paweł Budzianowski, Stefan Ultes, Milica Gaši ́c, and Steve Young.Sample-efficient actor-critic reinforcement learning with supervised data for dia-logue management. InProceedings of the 18th Annual SIGdial Meeting on Dis-course and Dialogue, pages 147–157, Saarbrücken, Germany, August 2017. Asso-ciation for Computational Linguistics.
[82] Jason D. Williams, Kavosh Asadi, and Geoffrey Zweig. Hybrid code networks:practical and efficient end-to-end dialog control with supervised and reinforcementlearning. InProceedings of the 55th Annual Meeting of the Association for Compu-tational Linguistics (Volume 1: Long Papers), pages 665–677, Vancouver, Canada,July 2017. Association for Computational Linguistics.
[83] Xiujun Li, Zachary C. Lipton, Bhuwan Dhingra, Lihong Li, Jianfeng Gao, and Yun-Nung Chen. A user simulator for task-completion dialogues, 2017.
[84] Weiyan Shi, Kun Qian, Xuewei Wang, and Zhou Yu. How to build user simula-tors to train RL-based dialog systems. InProceedings of the 2019 Conference onEmpirical Methods in Natural Language Processing and the 9th International JointConference on Natural Language Processing (EMNLP-IJCNLP), pages 1990–2000,Hong Kong, China, November 2019. Association for Computational Linguistics.
[85] Baolin Peng, Xiujun Li, Lihong Li, Jianfeng Gao, Asli Celikyilmaz, Sungjin Lee,and Kam-Fai Wong. Composite task-completion dialogue policy learning via hierar-chical deep reinforcement learning. InProceedings of the 2017 Conference on Em-pirical Methods in Natural Language Processing, pages 2231–2240, Copenhagen,Denmark, September 2017. Association for Computational Linguistics.
[86] Iñigo Casanueva, Paweł Budzianowski, Pei-Hao Su, Stefan Ultes, Lina M. Rojas-Barahona, Bo-Hsiang Tseng, and Milica Gaši ́c. Feudal reinforcement learning fordialogue management in large domains. InProceedings of the 2018 Conferenceof the North American Chapter of the Association for Computational Linguistics:Human Language Technologies, Volume 2 (Short Papers), pages 714–719, New Or-leans, Louisiana, June 2018. Association for Computational Linguistics.
[87] Shang-Yu Su, Xiujun Li, Jianfeng Gao, Jingjing Liu, and Yun-Nung Chen. Discrimi-native deep Dyna-Q: Robust planning for dialogue policy learning. InProceedings ofthe 2018 Conference on Empirical Methods in Natural Language Processing, pages71
3813–3823, Brussels, Belgium, October-November 2018. Association for Compu-tational Linguistics.
[88] David Abel, John Salvatier, Andreas Stuhlmüller, and Owain Evans. Agent-agnostichuman-in-the-loop reinforcement learning, 2017.
[89] Lu Chen, Xiang Zhou, Cheng Chang, Runzhe Yang, and Kai Yu. Agent-awaredropout DQN for safe and efficient on-line dialogue policy learning. InProceedingsof the 2017 Conference on Empirical Methods in Natural Language Processing,pages 2454–2464, Copenhagen, Denmark, September 2017. Association for Com-putational Linguistics.

指導教授

張嘉惠(Chia-Hui Chang)

審核日期

2021-7-26

推文