參考文獻 |
[1] Yinpei Dai, Huihua Yu, Yixuan Jiang, Chengguang Tang, Yongbin Li, and Jian Sun.A survey on dialog management: Recent advances and challenges, 2020.
[2] Xiangyang Zhou, Daxiang Dong, Hua Wu, Shiqi Zhao, Dianhai Yu, Hao Tian, XuanLiu, and Rui Yan. Multi-view response selection for human-computer conversation.InProceedings of the 2016 Conference on Empirical Methods in Natural LanguageProcessing, pages 372–381, Austin, Texas, November 2016. Association for Com-putational Linguistics.
[3] Yu Wu, Wei Wu, Chen Xing, Can Xu, Zhoujun Li, and Ming Zhou. A sequentialmatching framework for multi-turn response selection in retrieval-based chatbots.Computational Linguistics, 45(1):163–197, March 2019.
[4] Harshit Kumar, Arvind Agarwal, Riddhiman Dasgupta, Sachindra Joshi, and ArunKumar. Dialogue act sequence labeling using hierarchical encoder with crf, 2017.
[5] Vipul Raheja and Joel Tetreault. Dialogue Act Classification with Context-AwareSelf-Attention. InProceedings of the 2019 Conference of the North American Chap-ter of the Association for Computational Linguistics: Human Language Technolo-gies, Volume 1 (Long and Short Papers), pages 3727–3733, Minneapolis, Minnesota,June 2019. Association for Computational Linguistics.
[6] Stefano Mezza, Alessandra Cervone, Evgeny Stepanov, Giuliano Tortoreto, andGiuseppe Riccardi. ISO-standard domain-independent dialogue act tagging for con-versational agents. InProceedings of the 27th International Conference on Compu-tational Linguistics, pages 3539–3551, Santa Fe, New Mexico, USA, August 2018.Association for Computational Linguistics.
[7] Yiming Cui, Zhipeng Chen, Si Wei, Shijin Wang, Ting Liu, and Guoping Hu.Attention-over-attention neural networks for reading comprehension. InProceed-ings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 593–602, Vancouver, Canada, July 2017. Associa-tion for Computational Linguistics.
[8] A. M. TURING. I.—COMPUTING MACHINERY AND INTELLIGENCE.Mind,LIX(236):433–460, 10 1950.
[9] Joseph Weizenbaum. Eliza—a computer program for the study of natural languagecommunication between man and machine.Commun. ACM, 9(1):36–45, January1966.
[10] Jason D. Williams and Steve Young. Partially observable markov decision processesfor spoken dialog systems.Comput. Speech Lang., 21(2):393–422, April 2007.
[11] Gellert Weisz, Pawel Budzianowski, Pei-Hao Su, and Milica Gasic. Sample ef-ficient deep reinforcement learning for dialogue systems with large action spaces.IEEE/ACM Trans. Audio, Speech and Lang. Proc., 26(11):2083–2097, November2018.
[12] Janarthanan Rajendran, Jatin Ganhotra, Satinder Singh, and Lazaros Polymenakos.Learning end-to-end goal-oriented dialog with multiple answers. InProceedings ofthe 2018 Conference on Empirical Methods in Natural Language Processing, pages3834–3843, Brussels, Belgium, October-November 2018. Association for Compu-tational Linguistics.
[13] Yun-Nung Chen.Unsupervised Learning and Modeling of Knowledge and Intentfor Spoken Dialogue Systems. PhD dissertation, Carnegie Mellon University, 2015.
[14] M. McTear.Conversational AI: Dialogue Systems, Conversational Agents, andChatbots, volume 13. Morgan & Claypool Publishers, 2020.
[15] Micha Elsner and Eugene Charniak. You talking to me? a corpus and algorithmfor conversation disentanglement. InProceedings of ACL-08: HLT, pages 834–842,Columbus, Ohio, June 2008. Association for Computational Linguistics.
[16] Ryan Lowe, Nissan Pow, Iulian Serban, and Joelle Pineau. The Ubuntu dialoguecorpus: A large dataset for research in unstructured multi-turn dialogue systems. InProceedings of the 16th Annual Meeting of the Special Interest Group on Discourseand Dialogue, pages 285–294, Prague, Czech Republic, September 2015. Associa-tion for Computational Linguistics.
[17] Micha Elsner and Eugene Charniak. Disentangling chat.Computational Linguistics,36(3):389–409, 2010.
[18] Shikib Mehri and Giuseppe Carenini. Chat disentanglement: Identifying semanticreply relationships with random forests and recurrent neural networks. InProceed-ings of the Eighth International Joint Conference on Natural Language Processing(Volume 1: Long Papers), pages 615–623, Taipei, Taiwan, November 2017. AsianFederation of Natural Language Processing.
[19] Jyun-Yu Jiang, Francine Chen, Yan-Ying Chen, and Wei Wang. Learning to disen-tangle interleaved conversational threads with a Siamese hierarchical network andsimilarity ranking. InProceedings of the 2018 Conference of the North AmericanChapter of the Association for Computational Linguistics: Human Language Tech-nologies, Volume 1 (Long Papers), pages 1812–1822, New Orleans, Louisiana, June2018. Association for Computational Linguistics.
[20] Dou Shen, Qiang Yang, Jian-Tao Sun, and Zheng Chen. Thread detection in dynamictext message streams. InProceedings of the 29th Annual International ACM SIGIRConference on Research and Development in Information Retrieval, SIGIR ’06, page35–42, New York, NY, USA, 2006. Association for Computing Machinery.
[21] Micha Elsner and Eugene Charniak. Disentangling chat with local coherence mod-els. InProceedings of the 49th Annual Meeting of the Association for ComputationalLinguistics: Human Language Technologies, pages 1179–1189, Portland, Oregon,USA, June 2011. Association for Computational Linguistics.
[22] Jonathan K. Kummerfeld, Sai R. Gouravajhala, Joseph J. Peper, Vignesh Athreya,Chulaka Gunasekara, Jatin Ganhotra, Siva Sankalp Patel, Lazaros C Polymenakos,and Walter Lasecki. A large-scale corpus for conversation disentanglement. InPro-ceedings of the 57th Annual Meeting of the Association for Computational Linguis-tics, pages 3846–3856, Florence, Italy, July 2019. Association for ComputationalLinguistics.
[23] Hao Wang, Zhengdong Lu, Hang Li, and Enhong Chen. A dataset for research onshort-text conversations. InProceedings of the 2013 Conference on Empirical Meth-ods in Natural Language Processing, pages 935–945, Seattle, Washington, USA,October 2013. Association for Computational Linguistics.
[24] Xiangyang Zhou, Lu Li, Daxiang Dong, Yi Liu, Ying Chen, Wayne Xin Zhao, Dian-hai Yu, and Hua Wu. Multi-turn response selection for chatbots with deep attentionmatching network. InProceedings of the 56th Annual Meeting of the Association forComputational Linguistics (Volume 1: Long Papers), pages 1118–1127, Melbourne,Australia, July 2018. Association for Computational Linguistics.
[25] Andreas Stolcke, Klaus Ries, Noah Coccaro, Elizabeth Shriberg, Rebecca Bates,Daniel Jurafsky, Paul Taylor, Rachel Martin, Carol Van Ess-Dykema, and MarieMeteer. Dialogue act modeling for automatic tagging and recognition of conversa-tional speech.Computational Linguistics, 26(3):339–374, 2000.
[26] John J. Godfrey, Edward C. Holliman, and Jane McDaniel. Switchboard: Telephonespeech corpus for research and development. InProceedings of the 1992 IEEEInternational Conference on Acoustics, Speech and Signal Processing - Volume 1,ICASSP’92, page 517–520, USA, 1992. IEEE Computer Society.
[27] Jan Alexandersson, Bianka Buschbeck-Wolf, Tsutomu Fujinami, Michael Kipp, Ste-fan Koch, Elisabeth Maier, Norbert Reithinger, Birte Schmitz, and Melanie Siegel.Dialogue acts in verbmobil-2 : Second edition.Verbmobil Report, 1998(226), 1998.
[28] Jean Carletta and Amy Isard. Hcrc dialogue structure coding manual. Technicalreport, Centre, University of Edinburgh, 1996.
[29] Charles T. Hemphill, John J. Godfrey, and George R. Doddington. The ATIS spokenlanguage systems pilot corpus. InSpeech and Natural Language: Proceedings of aWorkshop Held at Hidden Valley, Pennsylvania, June 24-27,1990, 1990.
[30] Mark G. Core and James F. Allen. Coding dialogs with the damsl annotation scheme.InWorking Notes of the AAAI Fall Symposium on Communicative Action in Humansand Machines, pages 28–35, Cambridge, MA, November 1997.
[31] H.C. Bunt.Dynamic Interpretation and Dialogue Theory, pages 139–188. JohnBenjamins, 2000.
[32] Harry Bunt. The dit++ taxonomy for functional dialogue markup. In Dirk Heylen,Catherine Pelachaud, Roberta Catizone, and David Traum, editors,AAMAS 2009Workshop, Towards a Standard Markup Language for Embodied Dialogue Acts,pages 13–24, 2009.
[33] J. Ang, Yang Liu, and E. Shriberg. Proceedings. (icassp ’05). ieee internationalconference on acoustics, speech, and signal processing, 2005. InAutomatic dialogact segmentation and classification in multiparty meetings, volume 1, pages I/1061–I/1064 Vol. 1, 2005.
[34] Dinoj Surendran and Gina anne Levow. Dialog act tagging with support vectormachines and hidden markov models. InProceedings of Interspeech/ICSLP, 2006.
[35] Ji Young Lee and Franck Dernoncourt. Sequential short-text classification with re-current and convolutional neural networks. InProceedings of the 2016 Conferenceof the North American Chapter of the Association for Computational Linguistics:Human Language Technologies, pages 515–520, San Diego, California, June 2016.Association for Computational Linguistics.
[36] Hamed Khanpour, Nishitha Guntakandla, and Rodney Nielsen. Dialogue act classi-fication in domain-independent conversations using a deep recurrent neural network.InProceedings of COLING 2016, the 26th International Conference on Computa-tional Linguistics: Technical Papers, pages 2012–2021, Osaka, Japan, December2016. The COLING 2016 Organizing Committee.
[37] Daniel Ortega and Ngoc Thang Vu. Neural-based context representation learningfor dialog act classification. InProceedings of the 18th Annual SIGdial Meetingon Discourse and Dialogue, pages 247–252, Saarbrücken, Germany, August 2017.Association for Computational Linguistics.
[38] Zheqian Chen, Rongqin Yang, Zhou Zhao, Deng Cai, and Xiaofei He. Dialogueact recognition via crf-attentive structured network. InThe 41st International ACMSIGIR Conference on Research & Development in Information Retrieval, SIGIR ’18,page 225–234, New York, NY, USA, 2018. Association for Computing Machinery.
[39] Yu Wu, Wei Wu, Ming Zhou, and Zhoujun Li. Sequential match network: A newarchitecture for multi-turn response selection in retrieval-based chatbots.CoRR,abs/1612.01627, 2016.
[40] Yanran Li, Hui Su, Xiaoyu Shen, Wenjie Li, Ziqiang Cao, and Shuzi Niu. Dai-lyDialog: A manually labelled multi-turn dialogue dataset. InProceedings of theEighth International Joint Conference on Natural Language Processing (Volume 1:Long Papers), pages 986–995, Taipei, Taiwan, November 2017. Asian Federation ofNatural Language Processing.
[41] Binxuan Huang, Yanglan Ou, and Kathleen M. Carley. Aspect level sentiment classi-fication with attention-over-attention neural networks.CoRR, abs/1804.06536, 2018.
[42] Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. BERT: Pre-training of deep bidirectional transformers for language understanding. InProceed-ings of the 2019 Conference of the North American Chapter of the Association forComputational Linguistics: Human Language Technologies, Volume 1 (Long andShort Papers), pages 4171–4186, Minneapolis, Minnesota, June 2019. Associationfor Computational Linguistics.
[43] Anne H. Anderson, Miles Bader, Ellen Gurman Bard, Elizabeth Boyle, GwynethDoherty, Simon Garrod, Stephen Isard, Jacqueline Kowtko, Jan McAllister, JimMiller, Catherine Sotillo, Henry S. Thompson, and Regina Weinert. The hcrc maptask corpus.Language and Speech, 34(4):351–366, 1991.
[44] Susanne Jekat, Alexandra Klein, Elisabeth Maier, Ilona Maleck, Marion Mast, andJ. Joachim Quantz. Dialogue acts in verbmobil.Verbmobil Report, 1998(65), 1995.
[45] Geoffrey Leech and M. Weisser.Generic speech act annotation for task-orienteddialogues., pages 441–446. Centre for Computer Corpus Research on LanguageTechnical Papers, Lancaster University, 2003.
[46] John D. Lafferty, Andrew McCallum, and Fernando C. N. Pereira. Conditional ran-dom fields: Probabilistic models for segmenting and labeling sequence data. InPro-ceedings of the Eighteenth International Conference on Machine Learning, ICML’01, page 282–289, San Francisco, CA, USA, 2001. Morgan Kaufmann PublishersInc.
[47] Arman Cohan, Iz Beltagy, Daniel King, Bhavana Dalvi, and Dan Weld. Pretrainedlanguage models for sequential sentence classification. InProceedings of the 2019Conference on Empirical Methods in Natural Language Processing and the 9th In-ternational Joint Conference on Natural Language Processing (EMNLP-IJCNLP),pages 3693–3699, Hong Kong, China, November 2019. Association for Computa-tional Linguistics.
[48] Nils Reimers and Iryna Gurevych. Sentence-BERT: Sentence embeddings usingSiamese BERT-networks. InProceedings of the 2019 Conference on EmpiricalMethods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 3982–3992, HongKong, China, November 2019. Association for Computational Linguistics.
[49] Dan Roth and Wen-tau Yih. A linear programming formulation for global inferencein natural language tasks. InProceedings of the Eighth Conference on Computa-tional Natural Language Learning (CoNLL-2004) at HLT-NAACL 2004, pages 1–8,Boston, Massachusetts, USA, May 6 - May 7 2004. Association for ComputationalLinguistics.
[50] Igor Malioutov and Regina Barzilay. Minimum cut model for spoken lecture seg-mentation. InProceedings of the 21st International Conference on ComputationalLinguistics and 44th Annual Meeting of the Association for Computational Linguis-tics, pages 25–32, Sydney, Australia, July 2006. Association for Computational Lin-guistics.
[51] Zhengdong Lu and Hang Li. A deep architecture for matching short texts. In C. J. C.Burges, L. Bottou, M. Welling, Z. Ghahramani, and K. Q. Weinberger, editors,Ad-vances in Neural Information Processing Systems, volume 26, pages 1367–1375.Curran Associates, Inc., 2013.
[52] Baotian Hu, Zhengdong Lu, Hang Li, and Qingcai Chen. Convolutional neuralnetwork architectures for matching natural language sentences. In Z. Ghahramani,M. Welling, C. Cortes, N. Lawrence, and K. Q. Weinberger, editors,Advances inNeural Information Processing Systems, volume 27, pages 2042–2050. Curran As-sociates, Inc., 2014.
[53] Yu Wu, Wei Wu, Chen Xing, Ming Zhou, and Zhoujun Li. Sequential matching net-work: A new architecture for multi-turn response selection in retrieval-based chat-bots. InProceedings of the 55th Annual Meeting of the Association for Computa-tional Linguistics (Volume 1: Long Papers), pages 496–505, Vancouver, Canada,July 2017. Association for Computational Linguistics.
[54] Svetlana Kiritchenko, Xiaodan Zhu, Colin Cherry, and Saif Mohammad. NRC-Canada-2014: Detecting aspects and sentiment in customer reviews. InProceedingsof the 8th International Workshop on Semantic Evaluation (SemEval 2014), pages437–442, Dublin, Ireland, August 2014. Association for Computational Linguistics.
[55] H. Liu, I. Chatterjee, M. Zhou, X. S. Lu, and A. Abusorrah. Aspect-based sentiment
analysis: A survey of deep learning methods.IEEE Transactions on ComputationalSocial Systems, 7(6):1358–1375, 2020.
[56] Yequan Wang, Minlie Huang, Xiaoyan Zhu, and Li Zhao. Attention-based LSTMfor aspect-level sentiment classification. InProceedings of the 2016 Conference onEmpirical Methods in Natural Language Processing, pages 606–615, Austin, Texas,November 2016. Association for Computational Linguistics.
[57] Wei Xue and Tao Li. Aspect based sentiment analysis with gated convolutionalnetworks. InProceedings of the 56th Annual Meeting of the Association for Compu-tational Linguistics (Volume 1: Long Papers), pages 2514–2523, Melbourne, Aus-tralia, July 2018. Association for Computational Linguistics.
[58] Ankur Parikh, Oscar Täckström, Dipanjan Das, and Jakob Uszkoreit. A decom-posable attention model for natural language inference. InProceedings of the 2016Conference on Empirical Methods in Natural Language Processing, pages 2249–2255, Austin, Texas, November 2016. Association for Computational Linguistics.
[59] Harry Bunt, Volha Petukhova, Andrei Malchanau, Alex Fang, and Kars Wijnhoven.The dialogbank: Dialogues with interoperable annotations.Lang. Resour. Eval.,53(2):213–249, June 2019.
[60] Chengyu FANG, Jing CAO, Harry BUNT, and Xiaoyue LIU. The annotation of theswitchboard corpus with the new iso standard for dialogue act analysis. InProceed-ings of the Eighth Joint ACL-ISO Workshop on Interoperable Semantic Annotation,October 2012. Eighth Joint ACL-ISO Workshop on Interoperable Semantic Anno-tation ; Conference date: 03-10-2012 Through 05-10-2012.
[61] Harry Bunt, Jan Alexandersson, Jean Carletta, Jae-Woong Choe, Alex ChengyuFang, Koiti Hasida, Kiyong Lee, Volha Petukhova, Andrei Popescu-Belis, LaurentRomary, Claudia Soria, and David Traum. Towards an ISO standard for dialogueact annotation. InProceedings of the Seventh International Conference on Lan-guage Resources and Evaluation (LREC’10), Valletta, Malta, May 2010. EuropeanLanguage Resources Association (ELRA).
[62] Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Mael Guillemot,Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronen-thal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, Iain McCowan, Wil-fried Post, Dennis Reidsma, and Pierre Wellner. The ami meeting corpus: A pre-68
announcement. InProceedings of the Second International Conference on MachineLearning for Multimodal Interaction, MLMI’05, page 28–39, Berlin, Heidelberg,2005. Springer-Verlag.
[63] Elizabeth Shriberg, Raj Dhillon, Sonali Bhagat, Jeremy Ang, and Hannah Carvey.The ICSI meeting recorder dialog act (MRDA) corpus. InProceedings of the 5thSIGdial Workshop on Discourse and Dialogue at HLT-NAACL 2004, pages 97–100,Cambridge, Massachusetts, USA, April 30 - May 1 2004. Association for Computa-tional Linguistics.
[64] S. Quarteroni and G. Riccardi. Classifying dialog acts in human-human and human-machine spoken conversations. InINTERSPEECH, 2010.
[65] Nal Kalchbrenner and Phil Blunsom. Recurrent convolutional neural networks fordiscourse compositionality. InProceedings of the Workshop on Continuous VectorSpace Models and their Compositionality, pages 119–126, Sofia, Bulgaria, August2013. Association for Computational Linguistics.
[66] Yucan Zhou, Qinghua Hu, Jie Liu, and Yuan Jia. Combining heterogeneous deepneural networks with conditional random fields for chinese dialogue act recognition.Neurocomput., 168(C):408–417, November 2015.
[67] Yangfeng Ji, Gholamreza Haffari, and Jacob Eisenstein. A latent variable recurrentneural network for discourse-driven language models. InProceedings of the 2016Conference of the North American Chapter of the Association for ComputationalLinguistics: Human Language Technologies, pages 332–342, San Diego, California,June 2016. Association for Computational Linguistics.
[68] Wei Li and Yunfang Wu. Multi-level gated recurrent neural network for dialog actclassification. InProceedings of COLING 2016, the 26th International Conferenceon Computational Linguistics: Technical Papers, pages 1970–1979, Osaka, Japan,December 2016. The COLING 2016 Organizing Committee.
[69] Tomas Mikolov, Kai Chen, Greg S. Corrado, and Jeffrey Dean. Efficient estimationof word representations in vector space, 2013.
[70] Jeffrey Pennington, Richard Socher, and Christopher Manning. GloVe: Global vec-tors for word representation. InProceedings of the 2014 Conference on Empiri-cal Methods in Natural Language Processing (EMNLP), pages 1532–1543, Doha,Qatar, October 2014. Association for Computational Linguistics.69
[71] Matthew Peters, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark,Kenton Lee, and Luke Zettlemoyer. Deep contextualized word representations. InProceedings of the 2018 Conference of the North American Chapter of the Asso-ciation for Computational Linguistics: Human Language Technologies, Volume 1(Long Papers), pages 2227–2237, New Orleans, Louisiana, June 2018. Associationfor Computational Linguistics.
[72] Alec Radford, Karthik Narasimhan, Tim Salimans, and Ilya Sutskever. Improvinglanguage understanding by generative pre-training. 2018.
[73] Chi Sun, Luyao Huang, and Xipeng Qiu. Utilizing BERT for aspect-based sentimentanalysis via constructing auxiliary sentence. InProceedings of the 2019 Conferenceof the North American Chapter of the Association for Computational Linguistics:Human Language Technologies, Volume 1 (Long and Short Papers), pages 380–385,Minneapolis, Minnesota, June 2019. Association for Computational Linguistics.
[74] Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. Enrichingword vectors with subword information.arXiv preprint arXiv:1607.04606, 2016.
[75] Arthur Brack, Anett Hoppe, Pascal Buschermöhle, and Ralph Ewerth. Sequentialsentence classification in research papers using cross-domain multi-task learning,2021.
[76] L.R. Rabiner. A tutorial on hidden markov models and selected applications inspeech recognition.Proceedings of the IEEE, 77(2):257–286, 1989.
[77] G.D. Forney. The viterbi algorithm.Proceedings of the IEEE, 61(3):268–278, 1973.
[78] Yann N. Dauphin, Angela Fan, Michael Auli, and David Grangier. Language mod-eling with gated convolutional networks. InProceedings of the 34th InternationalConference on Machine Learning - Volume 70, ICML’17, page 933–941. JMLR.org,2017.
[79] Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. Bert: Pre-training of deep bidirectional transformers for language understanding, 2019.
[80] Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, HugoLarochelle, François Laviolette, Mario Marchand, and Victor Lempitsky. Domain-adversarial training of neural networks, 2016.
[81] Pei-Hao Su, Paweł Budzianowski, Stefan Ultes, Milica Gaši ́c, and Steve Young.Sample-efficient actor-critic reinforcement learning with supervised data for dia-logue management. InProceedings of the 18th Annual SIGdial Meeting on Dis-course and Dialogue, pages 147–157, Saarbrücken, Germany, August 2017. Asso-ciation for Computational Linguistics.
[82] Jason D. Williams, Kavosh Asadi, and Geoffrey Zweig. Hybrid code networks:practical and efficient end-to-end dialog control with supervised and reinforcementlearning. InProceedings of the 55th Annual Meeting of the Association for Compu-tational Linguistics (Volume 1: Long Papers), pages 665–677, Vancouver, Canada,July 2017. Association for Computational Linguistics.
[83] Xiujun Li, Zachary C. Lipton, Bhuwan Dhingra, Lihong Li, Jianfeng Gao, and Yun-Nung Chen. A user simulator for task-completion dialogues, 2017.
[84] Weiyan Shi, Kun Qian, Xuewei Wang, and Zhou Yu. How to build user simula-tors to train RL-based dialog systems. InProceedings of the 2019 Conference onEmpirical Methods in Natural Language Processing and the 9th International JointConference on Natural Language Processing (EMNLP-IJCNLP), pages 1990–2000,Hong Kong, China, November 2019. Association for Computational Linguistics.
[85] Baolin Peng, Xiujun Li, Lihong Li, Jianfeng Gao, Asli Celikyilmaz, Sungjin Lee,and Kam-Fai Wong. Composite task-completion dialogue policy learning via hierar-chical deep reinforcement learning. InProceedings of the 2017 Conference on Em-pirical Methods in Natural Language Processing, pages 2231–2240, Copenhagen,Denmark, September 2017. Association for Computational Linguistics.
[86] Iñigo Casanueva, Paweł Budzianowski, Pei-Hao Su, Stefan Ultes, Lina M. Rojas-Barahona, Bo-Hsiang Tseng, and Milica Gaši ́c. Feudal reinforcement learning fordialogue management in large domains. InProceedings of the 2018 Conferenceof the North American Chapter of the Association for Computational Linguistics:Human Language Technologies, Volume 2 (Short Papers), pages 714–719, New Or-leans, Louisiana, June 2018. Association for Computational Linguistics.
[87] Shang-Yu Su, Xiujun Li, Jianfeng Gao, Jingjing Liu, and Yun-Nung Chen. Discrimi-native deep Dyna-Q: Robust planning for dialogue policy learning. InProceedings ofthe 2018 Conference on Empirical Methods in Natural Language Processing, pages71
3813–3823, Brussels, Belgium, October-November 2018. Association for Compu-tational Linguistics.
[88] David Abel, John Salvatier, Andreas Stuhlmüller, and Owain Evans. Agent-agnostichuman-in-the-loop reinforcement learning, 2017.
[89] Lu Chen, Xiang Zhou, Cheng Chang, Runzhe Yang, and Kai Yu. Agent-awaredropout DQN for safe and efficient on-line dialogue policy learning. InProceedingsof the 2017 Conference on Empirical Methods in Natural Language Processing,pages 2454–2464, Copenhagen, Denmark, September 2017. Association for Com-putational Linguistics. |