臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.217.46) 您好！臺灣時間：2026/05/31 04:44

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
電子全文
紙本論文
QR Code

本論文永久網址:

研究生:

李柏賢

研究生(外文):

Bo,Xian LI

論文名稱:

利用seq2seq機器學習於延伸機器人對答內容

論文名稱(外文):

Apply seq2seq to machine learning for extend Chatbot content

指導教授:

陳重臣

指導教授(外文):

JONG-CHEN CHEN

口試委員:

英家慶、陳昭宏

口試委員(外文):

YING,JIA-CHING、CHENG, JAO-HONG

口試日期:

2019-06-26

學位類別:

碩士

校院名稱:

國立雲林科技大學

系所名稱:

資訊管理系

學門:

電算機學門

學類:

電算機一般學類

論文種類:

學術論文

論文出版年:

2019

畢業學年度:

106

語文別:

中文

論文頁數:

中文關鍵詞:

自然語言處理、遞歸神經網路、字詞轉向量、問題生成、長短期記憶網路

外文關鍵詞:

Natural Language Processing、Recurrent Neural Network、Word2Vec、Question Generator、Long Short-Term Memory

相關次數:

被引用:0
點閱:282
評分:
下載:3
書目收藏:1

近年來深度學習蓬勃發展，讓機器學習方式學習到人類的行為模式，而在深度學習薰陶下，機器人對話又慢慢的浮出水面，在傳統機器人對話中以自己建立的資料庫來打造機器人，必須依賴於強大問答資料庫一個問題一個答案的方式進行，往往產生出資料所花費時間與理想超出許多，在面對客戶會想知道客戶問什麼樣的問題，產生出問題成為了重要關鍵，本研究利用seq2seq模型進行訓練打造一個產生問句模型，使機器人可以學習人造句模式產生問句，以利於發現客戶所會提的問題，本研究使用八種模式進行模型訓練來產生問句，研究測試在多領域與單領域中字元級別seq2seq達到90%及85%準確率，與上述條件相同下seq2seq加入注意力機，單領域seq2seq則為80%準確率多領域為90%準確率，在多領域詞級別seq2seq加入注意力機制可達86%準確率，單領域下則80%準確率。

In recent years, deep learning has flourished, allowing machine learning to learn human behavior patterns. Under the deep learning, robot dialogue has slowly surfaced, and in the traditional robot dialogue, the robot is built with its own database. It must rely on the powerful question and answer database to answer one question and one answer. It often takes a lot of time and ideals to generate the information. In the face of the customer, they will want to know what kind of problem the customer asks, and the problem becomes an important key. This study uses the seq2seq model to train to create a question-making model, so that the robot can learn the artificial sentence pattern to generate questions, in order to find out the problems that the customer will ask. This study uses eight models to carry out model training to generate questions. The research test achieved 90% and 85% accuracy in the character level seq2seq in multi-domain and single-field. Under the above conditions, seq2seq was added to the attention machine, and the single-field seq2seq was 80% accurate. The multi-field was 90% accurate. The multi-domain word level seq2seq adds 86% accuracy to the attention mechanism and 80% accuracy in the single field.

摘要 i
Abstract ii
目錄 iii
表目錄 v
圖目錄 vi
壹、緒論 1
一、研究動機 1
二、研究目的 2
貳、文獻探討 3
一、 Word Embedding 3
二、Word2Vec 3
三、遞歸神經網路（Recurrent neural network, RNN） 5
四、長短期記憶網路（Long Short Term Memory Network, LSTM） 7
五、seq2seq模型 9
六、注意力機制（Attention） 10
柒、批量標準化（Batch Normalization） 11
參、實驗方法 12
一、單領域與多領域 13
二、 Char-level字元級and word-level詞級 13
三、 word2vec 13
四、 Bi-LSTM 雙向長短期記憶, Attention注意力機制, BN批量標準化 14
肆、實驗結果 15
一、 character-level字元級+BI-LSTM雙向長短期記憶 15
二、 word2vec+BI-LSTM雙向長短期記憶 16
三、 character-level字元級+BI-LSTM雙向長短記憶+Attention注意力機制 16
四、 word-level詞級+ BI-LSTM雙向長短期記憶+ Attention注意力機制 18
五、輸出問句結果 19
5.1 character-level字元級+ BI-LSTM雙向長短期記憶+單領域 19
5.2 character-level詞級+ BI-LSTM雙向長短期記憶+多領域 21
5.3 word2vec +BI-LSTM雙向長短期記憶+單領域 22
5.4 word2vec +BI-LSTM雙向長短期記憶+多領域 23
5.5 character-level字元級+BI-LSTM雙向長短期記憶+Attention注意力機制+單領域 25
5.6 character-level字元級+BI-LSTM雙向長短期記憶 + Attention注意力機制+多領域 26
5.7 word-level詞級+ BI-LSTM雙向長短期記憶+ Attention注意力機制+單領域 27
5.8 word-level詞級+BI-LSTM雙向長短期記憶+Attention注意力機制+多領域 29
伍、討論與未來展望 31
參考文獻 33

Bellman, R. (2013). Dynamic programming: Courier Corporation.
Donahue, J., Anne Hendricks, L., Guadarrama, S., Rohrbach, M., Venugopalan, S., Saenko, K., & Darrell, T. (2015). Long-term recurrent convolutional networks for visual recognition and description. Paper presented at the Proceedings of the IEEE conference on computer vision and pattern recognition.
Elman, J. L. (1990). Finding structure in time. 14(2), 179-211.
Graves, A., Mohamed, A.-r., & Hinton, G. (2013). Speech recognition with deep recurrent neural networks. Paper presented at the 2013 IEEE international conference on acoustics, speech and signal processing.
Hinton, G. E., McClelland, J. L., & Rumelhart, D. E. (1984). Distributed representations: Carnegie-Mellon University Pittsburgh, PA.
Hopfield, J. J. (1982). Neural networks and physical systems with emergent collective computational abilities. 79(8), 2554-2558.
Ioffe, S., & Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift.
Jordan, M. (1986). Attractor dynamics and parallelism in a connectionist sequential machine. Paper presented at the Proc. of the Eighth Annual Conference of the Cognitive Science Society (Erlbaum, Hillsdale, NJ), 1986.
Lilleberg, J., Zhu, Y., & Zhang, Y. (2015). Support vector machines and word2vec for text classification with semantic features. Paper presented at the 2015 IEEE 14th International Conference on Cognitive Informatics & Cognitive Computing (ICCI* CC).
Liu, X., Xia, T., Wang, J., Yang, Y., Zhou, F., & Lin, Y. (2016). Fully convolutional attention networks for fine-grained recognition.
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space.
Mikolov, T., Yih, W.-t., & Zweig, G. (2013). Linguistic regularities in continuous space word representations. Paper presented at the Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.
Mnih, V., Heess, N., & Graves, A. (2014). Recurrent models of visual attention. Paper presented at the Advances in neural information processing systems.
Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. Paper presented at the Proceedings of the IEEE conference on computer vision and pattern recognition.
Schuster, M., & Paliwal, K. (1997). Bidirectional recurrent neural networks. 45(11), 2673-2681.
Stroh, E., Student, S., & Mathur, P. (2016). Question answering using deep learning. In.
Sutskever, I., Vinyals, O., & Le, Q. V. (2014). Sequence to sequence learning with neural networks. Paper presented at the Advances in neural information processing systems.
Venugopalan, S., Rohrbach, M., Donahue, J., Mooney, R., Darrell, T., & Saenko, K. (2015). Sequence to sequence-video to text. Paper presented at the Proceedings of the IEEE international conference on computer vision.
Colah(2015) Understanding LSTM Networks .Available from http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Googleblog(2015) Computer, respond to this email. Available from https://ai.googleblog.com/2015/11/computer-respond-to-this-email.html
YJango(民107) YJango的循环神经网络——介绍.民國107年12月20號取自: https://zhuanlan.zhihu.com/p/24720659

電子全文

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	基於長短期記憶遞迴類神經網路之新台幣兌美元匯率預測模型
2.	不同模型組合的多重注意力機制於影像標題生成之應用
3.	類神經網路在行銷主軸與產品文案應用
4.	聊天機器人之研製-以PTT八卦板文章為知識庫
5.	以循環神經網路用於台灣即時地震偵測
6.	LSTM為基礎的財經新聞與美股股價指數對台灣股價指數期貨趨勢預測之研究
7.	自然語言處理之深度學習模型於股市消息面情緒判別分析之研究
8.	財經新聞標題情感維度預測之研究
9.	基於嵌入式系統實現即時影像情境辨識系統
10.	以多語系自然語言理解與機器學習為基之智慧型專利摘要系統
11.	基於主題正規化遞歸神經網路的自動名詞解釋
12.	對話系統應用於中文線上客服助理:以電信領域為例
13.	馬可夫遞迴神經網路於時序性深度學習之研究
14.	基於資料驅動學習的無袖式血壓量測系統
15.	台灣財經新聞之雙向長短期記憶語意判別模型

無相關期刊

1.	透過詞向量相關模型及網路搜尋以提升聊天機器人之延伸對答能力
2.	使用深度學習Seq2seq方法處理短文本對話生成
3.	基於SCOOP演算法結合R-CNN 於多鏡頭即時影像人員追蹤定位及熱點分析
4.	利用Arduino自製足壓感測器於下肢骨折術後漸進式負重復健之研究分析
5.	利用基因演算法於人臉辨識區別與特徵分析
6.	利用機器人教育於國小學童科技知識能力之質性研究
7.	應用於無線電力驅動光耦合二極體之無線測試介面電路
8.	利用自製曲度與壓力感測器於不同日常手部使用動作手指彎度與壓力搜集與分析研究
9.	藏頭詩生成系統:Seq2Seq 控制訊號的應用
10.	RNN、LSTM與Seq2Seq with Attention演算法為基礎的台灣股價趨勢預測之研究
11.	聊天機器人之研製-以PTT八卦板文章為知識庫
12.	陪伴型機器人在失智症照護的效果：系統性文獻回顧與統合分析
13.	不同NLP模型對意圖辨識聊天機器人效能影響之研究
14.	高齡健康監測穿戴產品之污名感知研究
15.	閩南諺語與生活教育之探究

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室