使用多模態架構進行深度學習模型分析之研究

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：90

、訪客IP：3.143.254.224

姓名

鄧鈺翰(Yu-Han Teng) 查詢紙本館藏

畢業系所

資訊管理學系

論文名稱

使用多模態架構進行深度學習模型分析之研究
(Using a multimodal architecture Research on Deep Learning Model Analysis)

相關論文

★ 探討國內田徑競賽資訊系統－以103年全國大專田徑公開賽資訊系統為例	★ 生物晶片之基因微陣列影像分析之研究
★ 台灣資訊家電產業IPv6技術地圖與發展策略之研究	★ 台灣第三代行動通訊產業IPv6技術地圖與發展策略之研究
★ 影響消費者使用電子書閱讀器採納意願之研究	★ 以資訊素養映對數位學習平台功能之研究
★ 台商群聚指標模式與資料分析之研究	★ 未來輪輔助軟體發展之需求擷取研究
★ 以工作流程圖展現未來研究方法配適於前瞻研究流程之研究	★ 以物件導向塑模未來研究方法配適於前瞻研究之系統架構
★ 應用TRIZ 探討核心因素建構電子商務新畫布	★ 企業策略資訊策略人力資源管理策略對組織績效的影響
★ 採用Color Petri Net方法偵測程式原始碼緩衝區溢位問題	★ 簡單且彈性化的軟體代理人通訊協定之探討與實作
★ 利用分析層級程序法探討台灣中草藥製造業之關鍵成功因素	★ 利用微陣列資料分析於基因調控網路之建構與預測

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

至系統瀏覽論文 (全文檔遺失)
請聯絡國立中央大學圖書館資訊系統組 TEL:(03)422-7151轉57422，或E-mail聯絡

摘要(中)

隨著社交網路與電子商務網站的普及，使用者從被動的接收訊息轉變為主動傳播訊息，評論以及網路訊息所呈現的價值也越來越重要，過去幾年的分析研究，試圖去分析了解有關具體的輿論產品、主題、評論與推文的趨勢，在各個方面發揮著重要作用。本研究利用不同的向量化處理，對多模態分析模型進行驗證比對，確認模型可有效提升準確度。本研究提出一種由兩種模型組成之結合特徵，並將此特徵結合深度學習神經網路建構建立多模態分析模型。模型一是基於Glove向量、注意力機制與GRU神經網路架構之深度學習模型，模型二是基於Word2Vec向量、注意力機制與CNN神經網路架構之深度學習模型，多模態分析模型經由K折交叉驗證、F1測量方法進行模型驗證。實驗結果證明本研究提出之多模態分析模型，準確率高於相關研究，利用高層級多模態結合法，將多個模型的特徵取出並加以結合形成結合特徵，並將此特徵進行神經網路訓練，可使特徵集有互相輔助之效果，透過兩種向量與最佳神經網路架構並搭配多模態方法可以得到91.56%的準確率，並在模型驗證得到了93%的驗證值，證明本研究提出之多模態分析模型用於評論文本領域，可有效提升模型預測準確率，使其準確率有顯著的提升。

摘要(英)

With the popularity of social networks and e-commerce sites, users have switched from passively receiving messages to actively disseminating messages. The value of comments and online messages is also becoming more and more important. Analysis and research over the past few years. Trying to analyze trends about specific product products, topics, reviews, and tweets. Play an important role in all aspects. This study uses different vectorization processes to verify the multimodal analysis model and confirm that the model can effectively improve the accuracy. This study proposes a combination of two models. This feature is combined with deep learning neural network construction to build a multimodal analysis model. Model 1 is a deep learning model based on Glove vector, attention mechanism and GRU neural network architecture. Model 2 is a deep learning model based on Word2Vec vector, attention mechanism and CNN neural network architecture. Multimodal analysis model is validated by K-fold cross validation and F1 measurement method. The experimental results prove that the multimodal analysis model proposed in this study has higher accuracy than related research. Using the high-level multi-modal combination method, the features of multiple models are extracted and combined to form a combined feature, and this feature is trained in neural network. The feature set can be mutually assisted, and the accuracy can be 91.56% through the two vectors and the optimal neural network architecture combined with the multi-modal method. And the model verification shows 93% verification value, which proves that the multimodal analysis model proposed in this study is used in the field of comment texts, which can effectively improve the accuracy of model prediction and improve its accuracy.

關鍵字(中)

★ 多模態深度學習、GRU、CNN、Ｗord2Vec、Glove、注意力機制

關鍵字(英)

★ Multimodal deep learning, GRU, CNN, Word2Vec, Glove, Attention mechanism

論文目次

摘要 i
Abstract ii
誌謝 iii
目錄 iv
圖目錄 vii
表目錄 ix
第一章、緒論 1
1-1 研究背景與動機 1
1-2 研究目的 2
1-3 論文架構 3
第二章、文獻探討 4
2-1 詞向量 4
2-1-1 Word2Vec 4
2-1-2 全局向量 6
2-2 類神經網路 6
2-2-1 卷積神經網路 7
2-2-1-1 卷積層 8
2-2-1-2 池化層 8
2-2-1-3 全連接層 9
2-2-2 長短期記憶網路 10
2-2-3 GRU 12
2-3 激活函數 13
2-3-1 Sigmoid 13
2-3-2 ReLU 14
2-4 注意力機制 16
2-5 多模態深度學習 17
2-6 K折交叉驗證 20
2-7 F1測量驗證 21
2-8 準確度驗證 22
第三章、研究方法與架構 23
3-1 實驗架構 23
3-2 實驗準備 25
3-3 實驗比較對象 26
3-3-1 實驗比較對象一 26
3-3-2 實驗比較對象二 28
3-4 實驗流程 30
3-4-1 前置實驗 30
3-4-1-1 詞向量訓練 31
3.4.1.1.1. Word2Vec字詞模型訓練 31
3.4.1.1.2. Glove字詞模型訓練 31
3-4-2 實驗一 32
3-4-2-1 GRU與長短期記憶網路模型建構 32
3-4-2-2 CNN模型建構 33
3-4-2-3 注意力機制 34
3-4-3 實驗二 35
3-4-3-1 多模態特徵結合與神經網路建構 35
第四章、實驗結果 37
4-1 實驗一結果 37
4-2 實驗二結果 42
4-3 實驗總結 45
第五章、研究結論 46
5-1 結論 46
5-2 研究貢獻 46
5-3 研究限制 47
5-4 未來研究方向 47
參考文獻 48
附錄一: 模型一程式碼 51
附錄二: 模型二程式碼 59
附錄三: 多模態分析模型程式碼 65

參考文獻

Andrew. (2011). Learning word vectors for sentiment analysis.
Arras & Montavon. (2016). Explaining predictions of non-linear classifiers in NLP.
Azimi & Abdolrashidi. (2019). Deep-Sentiment: Sentiment Analysis Using Ensemble of CNN and Bi-LSTM Models.
Bagnall, A., Lines, J., Hills, J., & Bostrom, A. (2015). Time-series classification with COTE: the collective of transformation-based ensembles.
Bargal&Sclaroff. (2018). Top-down neural attention by excitation backprop.
Bengio & Grandvalet. (2014). No unbiased estimator of the variance of k-fold cross-validation.
Cerisara & Lenc. (2018). On the effects of using word2vec representations in neural networks for dialogue act recognition.
Chua&Sun. (2015). Topical word embeddings.
Dayan & Abbott. (2001). Theoretical neuroscience: computational and mathematical modeling of neural systems.
Dhariyal & Ravi. (2018). Sentiment analysis via Doc2Vec and Convolutional Neural Network hybrids.
Hansen & Simonsen. (2019). Neural Speed Reading with Structural-Jump-LSTM.
Hinton&Salakhutdinov. (2006). Reducing the dimensionality of data with neural networks.
Hochreiter & Schmidhuber. (1997). Long short-term memory. Neural computation.
Ji, L., Gong, P., & Yao, Z. (2019). A text sentiment analysis model based on self-attention mechanism.
Kai Sheng Tai & Richard Socher. (2015). Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks.
Khosla & Ng. (2011). Multimodal deep learning.
KimY. (2014). Convolutional neural networks for sentence classification.
Kingma & Ba. (2014). Adam: A method for stochastic optimization.
Koley & Dey. (2012). An ensemble system for automatic sleep stage classification using single channel EEG signal.
Lines & Bostrom. (2017). Time series classification from scratch with deep neural networks: A strong baseline.
Liu & Xiong. (2018). Attention Aware Bidirectional Gated Recurrent Unit Based Framework for Sentiment Analysis.
Mikolov & Dean. (2013). Efficient estimation of word representations in vector space.
Nanopoulos & Manolopoulos. (2001). Feature-based classification of time-series data.
Peng & Zhao. (2017). Object-part attention model for fine-grained image classification.
Pennington & Manning. (2014). Glove: Global vectors for word representation.
Ravi&Dhariyal. (2018). Sentiment analysis via Doc2Vec and Convolutional Neural Network hybrids.
Ren&Bao. (2018). Investigating Lstm with k-Max Pooling for Text Classification.
RosenblattF. (1958). The perceptron: a probabilistic model for information storage and organization in the brain.
Shazeer & Jones. (2017). Attention is all you need.
Simard & Frasconi. (1994). Learning long-term dependencies with gradient descent is difficult.
Subarno & Ghosh. (2018). Sentiment Analysis in the Light of LSTM Recurrent Neural Networks.
Sutskever & Hinton. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems.
Torralba&Fidler. (2015). Skip-Thought Vectors.
Tsuruoka. (2016). A joint many-task model: Growing a neural network for multiple nlp tasks.
Wei & Keogh. (2006). Semi-supervised time series classification.
Williams & Zipser. (1989). A learning algorithm for continually running fully recurrent neural networks.
Xianghua. (2018). Lexicon-enhanced LSTM with attention for general sentiment analysis.
Xiao & Zhao. (2018). A deep learning-based multi-model ensemble method for cancer prediction.
Xinpeng & Jingyuan. (2018). Fine-grained Video Attractiveness Prediction Using Multimodal.
Zhicheng Cui. (2016). Multi-scale convolutional neural networks for time series classification.

指導教授

薛義誠

審核日期

2019-7-10

推文