姓名 王睿揚(Jui-Yang Wang)  查詢紙本館藏   畢業系所 軟體工程研究所
論文名稱 應用角色感知於深度神經網路架構之對話行為分類
(Dialog act Classification with Role awareness in DNN Framework)
摘要(中) 在自然語言領域中,對話機器人應用日益發展迅速,其中需要克服問題之一在於自然語言理解,知道使用者在詢問何種問題及判斷文字間隱藏的資訊,對於使機器了解使用者的問題意圖是相當重要。後續的應用例如對話管理以及如何產生相應的答案皆會需要根據意圖理解來做延伸,因此如何達到更好的辨識率將是一大挑戰。
摘要(英) In the field of natural language processing, the application of dialogue robot is growing rapidly. One of the problems that need to be overcome in the field is natural language understanding. Knowing what kind of question the user is asking and judging the hidden information between the words, and the intention of making the machine understand the problem of the user is very important. Also the follow-up parts such as dialog management and how to produce the corresponding answer will need to be interpreted according to intent to do, so how to catch a better recognition rate will be a big challenge.
In this study, we mainly train the deep learning model for dialogue data and predict the dialogue act. We use various neural networks to solve this problem and compare the differences. At the same time, we introduce the role information in the model to adapt the property of short text in Chinese sentence. In addition, adding pre-training word emebdding to the model can deal with unknown Chinese words more effectively, and this could reduce the possibility of misidentification. In the end, this thesis compares many kinds of deep learning models and introduces role information to identify dialog act, which is nearly 1.2% higher than the typical neural network model in the telecome domain dialogue dataset.
關鍵字(中) ★ 對話行為
★ 詞向量
★ 深度學習
★ 卷積類神經網路
★ 長短期記憶模型
★ 注意力機制
關鍵字(英) ★ Dialog act
★ Word embedding
★ Deep learning
★ Convolutional neural network
★ Long-Short Term Memory
★ Attention mechanism
論文目次 摘要 I
致謝 III
目錄 IV
圖目錄 VI
表目錄 VII
第一章 緒論 1
1.1研究背景 1
1.2研究動機 2
1.3章節概要 3
第二章 文獻探討 4
2.1 對話行為分類 4
2.1.1對話系統 4
2.1.2自然語言理解 5
2.1.3對話行為 6
2.2 深度學習 6
2.2.1 詞向量(Word2vec) 6
2.2.2 卷積神經網路(Convolutional Neural Network) 7
2.2.3 循環神經網路(Recurrent Neural Network) 7
2.2.4 GRU(Gated Recurrent Unit) 8
2.2.5 長短期記憶網路(Long short-term memory) 9
2.2.6 注意力機制(Attention mechanism) 11
第三章 系統架構 12
3.1模組架構 12
3.1.1前處理模組 12
3.1.2預訓練詞向量模組 13
3.1.3 DNN模型框架 13
3.2模型描述 14
3.2.1句子編碼器(Sentence Encoder) 14
3.2.2前後文編碼器(Context Encoder) 15
3.2.3分類器(Classifier) 15
3.2.4模型一 句子模型 15
3.2.5模型二 句子結合角色資訊模型 16
3.2.6模型三 前後文模型 17
3.2.7模型四 前後文結合角色資訊模型 17
第四章 實驗方法 19
4.1資料描述 19
4.1.1 對話資料 19
4.1.2 預訓練詞向量 23
4.2參數說明 23
4.3實驗結果 24
4.3.1 Baseline 24
4.3.2 句子模型 – 模型一、模型二 25
4.3.3 前後文模型 – 模型三、模型四 26
4.4錯誤分析 27
第五章 結論與未來研究 29
5.1實驗成果 29
5.2未來方向 29
附錄 30
參考文獻 36
指導教授 蔡宗翰(Tzong-Han Tsai) 審核日期 2018-1-26
