臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.24) 您好！臺灣時間：2026/04/07 23:11

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
電子全文
紙本論文
論文連結
QR Code

本論文永久網址:

研究生:

鄧鈺翰

研究生(外文):

Yu-Han Teng

論文名稱:

使用多模態架構進行深度學習模型分析之研究

論文名稱(外文):

Using a multimodal architecture Research on Deep Learning Model Analysis

指導教授:

薛義誠

學位類別:

碩士

校院名稱:

國立中央大學

系所名稱:

資訊管理學系

學門:

電算機學門

學類:

電算機一般學類

論文種類:

學術論文

論文出版年:

2019

畢業學年度:

107

語文別:

中文

論文頁數:

中文關鍵詞:

多模態深度學習、GRU、CNN、Ｗord2Vec、Glove、注意力機制

外文關鍵詞:

Multimodal deep learning、GRU、CNN、Word2Vec、Glove、Attention mechanism

相關次數:

被引用:0
點閱:1690
評分:
下載:69
書目收藏:0

隨著社交網路與電子商務網站的普及，使用者從被動的接收訊息轉變為主動傳播訊息，評論以及網路訊息所呈現的價值也越來越重要，過去幾年的分析研究，試圖去分析了解有關具體的輿論產品、主題、評論與推文的趨勢，在各個方面發揮著重要作用。本研究利用不同的向量化處理，對多模態分析模型進行驗證比對，確認模型可有效提升準確度。本研究提出一種由兩種模型組成之結合特徵，並將此特徵結合深度學習神經網路建構建立多模態分析模型。模型一是基於Glove向量、注意力機制與GRU神經網路架構之深度學習模型，模型二是基於Word2Vec向量、注意力機制與CNN神經網路架構之深度學習模型，多模態分析模型經由K折交叉驗證、F1測量方法進行模型驗證。實驗結果證明本研究提出之多模態分析模型，準確率高於相關研究，利用高層級多模態結合法，將多個模型的特徵取出並加以結合形成結合特徵，並將此特徵進行神經網路訓練，可使特徵集有互相輔助之效果，透過兩種向量與最佳神經網路架構並搭配多模態方法可以得到91.56%的準確率，並在模型驗證得到了93%的驗證值，證明本研究提出之多模態分析模型用於評論文本領域，可有效提升模型預測準確率，使其準確率有顯著的提升。

With the popularity of social networks and e-commerce sites, users have switched from passively receiving messages to actively disseminating messages. The value of comments and online messages is also becoming more and more important. Analysis and research over the past few years. Trying to analyze trends about specific product products, topics, reviews, and tweets. Play an important role in all aspects. This study uses different vectorization processes to verify the multimodal analysis model and confirm that the model can effectively improve the accuracy. This study proposes a combination of two models. This feature is combined with deep learning neural network construction to build a multimodal analysis model. Model 1 is a deep learning model based on Glove vector, attention mechanism and GRU neural network architecture. Model 2 is a deep learning model based on Word2Vec vector, attention mechanism and CNN neural network architecture. Multimodal analysis model is validated by K-fold cross validation and F1 measurement method. The experimental results prove that the multimodal analysis model proposed in this study has higher accuracy than related research. Using the high-level multi-modal combination method, the features of multiple models are extracted and combined to form a combined feature, and this feature is trained in neural network. The feature set can be mutually assisted, and the accuracy can be 91.56% through the two vectors and the optimal neural network architecture combined with the multi-modal method. And the model verification shows 93% verification value, which proves that the multimodal analysis model proposed in this study is used in the field of comment texts, which can effectively improve the accuracy of model prediction and improve its accuracy.

摘要 i
Abstract ii
誌謝 iii
目錄 iv
圖目錄 vii
表目錄 ix
第一章、緒論 1
1-1 研究背景與動機 1
1-2 研究目的 2
1-3 論文架構 3
第二章、文獻探討 4
2-1 詞向量 4
2-1-1 Word2Vec 4
2-1-2 全局向量 6
2-2 類神經網路 6
2-2-1 卷積神經網路 7
2-2-1-1 卷積層 8
2-2-1-2 池化層 8
2-2-1-3 全連接層 9
2-2-2 長短期記憶網路 10
2-2-3 GRU 12
2-3 激活函數 13
2-3-1 Sigmoid 13
2-3-2 ReLU 14
2-4 注意力機制 16
2-5 多模態深度學習 17
2-6 K折交叉驗證 20
2-7 F1測量驗證 21
2-8 準確度驗證 22
第三章、研究方法與架構 23
3-1 實驗架構 23
3-2 實驗準備 25
3-3 實驗比較對象 26
3-3-1 實驗比較對象一 26
3-3-2 實驗比較對象二 28
3-4 實驗流程 30
3-4-1 前置實驗 30
3-4-1-1 詞向量訓練 31
3.4.1.1.1. Word2Vec字詞模型訓練 31
3.4.1.1.2. Glove字詞模型訓練 31
3-4-2 實驗一 32
3-4-2-1 GRU與長短期記憶網路模型建構 32
3-4-2-2 CNN模型建構 33
3-4-2-3 注意力機制 34
3-4-3 實驗二 35
3-4-3-1 多模態特徵結合與神經網路建構 35
第四章、實驗結果 37
4-1 實驗一結果 37
4-2 實驗二結果 42
4-3 實驗總結 45
第五章、研究結論 46
5-1 結論 46
5-2 研究貢獻 46
5-3 研究限制 47
5-4 未來研究方向 47
參考文獻 48
附錄一: 模型一程式碼 51
附錄二: 模型二程式碼 59
附錄三: 多模態分析模型程式碼 65

Andrew. (2011). Learning word vectors for sentiment analysis.
Arras & Montavon. (2016). Explaining predictions of non-linear classifiers in NLP.
Azimi & Abdolrashidi. (2019). Deep-Sentiment: Sentiment Analysis Using Ensemble of CNN and Bi-LSTM Models.
Bagnall, A., Lines, J., Hills, J., & Bostrom, A. (2015). Time-series classification with COTE: the collective of transformation-based ensembles.
Bargal&Sclaroff. (2018). Top-down neural attention by excitation backprop.
Bengio & Grandvalet. (2014). No unbiased estimator of the variance of k-fold cross-validation.
Cerisara & Lenc. (2018). On the effects of using word2vec representations in neural networks for dialogue act recognition.
Chua&Sun. (2015). Topical word embeddings.
Dayan & Abbott. (2001). Theoretical neuroscience: computational and mathematical modeling of neural systems.
Dhariyal & Ravi. (2018). Sentiment analysis via Doc2Vec and Convolutional Neural Network hybrids.
Hansen & Simonsen. (2019). Neural Speed Reading with Structural-Jump-LSTM.
Hinton&Salakhutdinov. (2006). Reducing the dimensionality of data with neural networks.
Hochreiter & Schmidhuber. (1997). Long short-term memory. Neural computation.
Ji, L., Gong, P., & Yao, Z. (2019). A text sentiment analysis model based on self-attention mechanism.
Kai Sheng Tai & Richard Socher. (2015). Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks.
Khosla & Ng. (2011). Multimodal deep learning.
KimY. (2014). Convolutional neural networks for sentence classification.
Kingma & Ba. (2014). Adam: A method for stochastic optimization.
Koley & Dey. (2012). An ensemble system for automatic sleep stage classification using single channel EEG signal.
Lines & Bostrom. (2017). Time series classification from scratch with deep neural networks: A strong baseline.
Liu & Xiong. (2018). Attention Aware Bidirectional Gated Recurrent Unit Based Framework for Sentiment Analysis.
Mikolov & Dean. (2013). Efficient estimation of word representations in vector space.
Nanopoulos & Manolopoulos. (2001). Feature-based classification of time-series data.
Peng & Zhao. (2017). Object-part attention model for fine-grained image classification.
Pennington & Manning. (2014). Glove: Global vectors for word representation.
Ravi&Dhariyal. (2018). Sentiment analysis via Doc2Vec and Convolutional Neural Network hybrids.
Ren&Bao. (2018). Investigating Lstm with k-Max Pooling for Text Classification.
RosenblattF. (1958). The perceptron: a probabilistic model for information storage and organization in the brain.
Shazeer & Jones. (2017). Attention is all you need.
Simard & Frasconi. (1994). Learning long-term dependencies with gradient descent is difficult.
Subarno & Ghosh. (2018). Sentiment Analysis in the Light of LSTM Recurrent Neural Networks.
Sutskever & Hinton. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems.
Torralba&Fidler. (2015). Skip-Thought Vectors.
Tsuruoka. (2016). A joint many-task model: Growing a neural network for multiple nlp tasks.
Wei & Keogh. (2006). Semi-supervised time series classification.
Williams & Zipser. (1989). A learning algorithm for continually running fully recurrent neural networks.
Xianghua. (2018). Lexicon-enhanced LSTM with attention for general sentiment analysis.
Xiao & Zhao. (2018). A deep learning-based multi-model ensemble method for cancer prediction.
Xinpeng & Jingyuan. (2018). Fine-grained Video Attractiveness Prediction Using Multimodal.
Zhicheng Cui. (2016). Multi-scale convolutional neural networks for time series classification.

電子全文

國圖紙本論文

連結至畢業學校之論文網頁點我開啟連結
註: 此連結為研究生畢業學校所提供，不一定有電子全文可供下載，若連結有誤，請點選上方之〝勘誤回報〞功能，我們會盡快修正，謝謝！

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	ConvGRU 預判影像中物體之運動軌跡
2.	深度學習文本分類的比較研究
3.	基於深度學習的問題分類——以知乎問題資料為例
4.	應用卷積神經網路與長短期記憶神經網路之Twitter輿情分析
5.	數種結合詞向量與字典資源之方法用於字義相似度測量

無相關期刊

1.	以體驗因素結合科技接受模型探討社群媒體聊天機器人使用體驗對意圖之影響
2.	以供給層面探討台灣發展智慧醫療之關鍵成功要素
3.	卷積模糊神經系統於財金市場數據之研究
4.	臉書粉絲專頁互動與選舉結果之相關性研究
5.	非對稱因果式類神經模糊系統於時間序列預測之研究
6.	複數模糊集成學習方法於趨勢預測之研究
7.	特徵選取對智慧型時間序列預測之效能研究
8.	高斯鯨群演算法於最佳化問題之研究
9.	郵件系統異常使用行為偵測與處理-以T公司為例
10.	社群媒體粉絲專頁貼文策略研究:以醫療產業為例
11.	運用計畫行為理論探討聊天機器人使用意願之研究：以衛教諮詢為例
12.	情感分析應用於社群媒體輿論分析之研究
13.	私立護理專科學校學生選校因素分析
14.	基於注意力機制的開放式對話系統
15.	大學生課堂多工動機與學習投入對課堂多工行為之影響

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室