臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.217.103) 您好！臺灣時間：2026/06/01 21:38

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
目次
參考文獻
紙本論文
QR Code

本論文永久網址:

研究生:

張瓊之

研究生(外文):

Chiung-Chih Chang

論文名稱:

可轉變對話風格的聊天機器人

論文名稱(外文):

Style-Changeable Chatbot

指導教授:

李宏毅

指導教授(外文):

Hung-yi Lee

口試委員:

陳縕儂、蔡宗翰、曹昱、賴穎暉

口試委員(外文):

Yun-Nung Chen、Tsung-Han Tsai、Yu Tsao、Ying-Hui Lai

口試日期:

2019-01-03

學位類別:

碩士

校院名稱:

國立臺灣大學

系所名稱:

電機工程學研究所

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2019

畢業學年度:

107

語文別:

中文

論文頁數:

中文關鍵詞:

自然語言處理、聊天機器人、強化學習、變分循環自編碼器、循環生成對抗網路

DOI:

10.6342/NTU201900404

相關次數:

被引用:3
點閱:910
評分:
下載:0
書目收藏:1

本論文的主軸在研究如何訓練出有能力輸出不同風格語句的聊天機器人，論
文中會透過不同的模型嘗試結果。近年來對話機器人的需求增加，許多企業希望
利用對話機器人減少和顧客溝通的人力；除此之外，也有許多娛樂性質或教育性
質的聊天機器人產生。這些對話機器人在被訓練時大多數都不會考慮本身的性格
及對話風格，只是講求文法或是回應出相關的資訊，但本論文認為有特殊風格的
聊天機器人是一個值得探究的主題。若有能因應狀況而輸出不同風格回覆的聊天
機器人可能可以讓人感覺更像是真實的人，而有風格偏向的聊天機器人則能作為
它前期的研究。故本論文將對此進行探討。
本論文中會研究的模型分為兩類，分別為需更動對話機器人之模型及不需更
動對話機器人之模型，其中需更動對話機器人之模型包含了個人化模型及強化學
習模型；不需更動對話機器人之模型則包含即插即用模型及循環生成對抗模型。
實驗中會嘗試這四個模型的各種不同參數及方法，最後進行它們的評估及展示語
句生成的範例。希望能夠找到較好訓練具備特殊對話風格的聊天機器人的方法。

口試委員會審定書 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i
誌謝 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii
中文摘要. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii
一、導論. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 研究背景. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 相關研究. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 研究方向. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.4 章節安排. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
二、背景知識. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.1 強化學習(Reinforcement Learning, RL) . . . . . . . . . . . . . . . . . 5
2.1.1 強化學習範例. . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.1.2 策略梯度演算法(Policy Gradient) . . . . . . . . . . . . . . . . 7
2.2 變分循環自編碼器(Variational Recurrent AutoEncoder, VRAE) . . . . 10
2.2.1 自編碼器(Autoencoder) . . . . . . . . . . . . . . . . . . . . . . 10
2.2.2 變分自編碼器(Variational AutoEncoder, VAE) . . . . . . . . . 11
2.2.3 循環神經網路(Recurrent Neural Network, RNN) . . . . . . . . 14
2.2.4 變分循環自編碼器(Variational Recurrent AutoEncoder, VRAE) 17
2.3 循環生成對抗網路(Cycle Generative Adversarial Network, CycleGAN) 18
2.3.1 生成對抗網路(Generative Adversarial Network, GAN) . . . . . 18
2.3.2 循環生成對抗網路(Cycle Generative Adversarial Network, CycleGAN)
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
2.4 本章總結. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
三、可轉變對話風格的聊天機器人. . . . . . . . . . . . . . . . . . . . . . . . 23
3.1 簡介. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
3.1.1 研究動機. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
3.1.2 模型概述. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
3.2 需更動對話機器人之模型. . . . . . . . . . . . . . . . . . . . . . . . . 24
3.2.1 個人化模型(Persona-Based Model) . . . . . . . . . . . . . . . . 24
3.2.2 強化學習模型(Reinforcement Learning) . . . . . . . . . . . . . 25
3.3 不需更動對話機器人之模型. . . . . . . . . . . . . . . . . . . . . . . 29
3.3.1 即插即用模型(Plug and Play Model) . . . . . . . . . . . . . . . 29
3.3.2 循環生成對抗模型(CycleGAN) . . . . . . . . . . . . . . . . . 32
3.4 本章總結. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
四、訓練資料及評估方式. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
4.1 訓練集. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
4.1.1 中文情緒對話產生資料集(Chinese Emotional Conversation
Generation, CECG) . . . . . . . . . . . . . . . . . . . . . . . . . 34
4.1.2 批踢踢資料集(PTT dataset) . . . . . . . . . . . . . . . . . . . . 37
4.2 評估模型方式- 機器評估. . . . . . . . . . . . . . . . . . . . . . . . . 37
4.2.1 語言模型分數(Language Model Score, LM) . . . . . . . . . . . 37
4.2.2 相干性分數一(Coherence Score 1, Coh1) . . . . . . . . . . . . 38
4.2.3 相干性分數二(Coherence Score 2, Coh2) . . . . . . . . . . . . 39
4.2.4 風格分數(Style Score, Style) . . . . . . . . . . . . . . . . . . . 40
4.3 評估模型方式- 人工評估. . . . . . . . . . . . . . . . . . . . . . . . . 41
4.4 本章總結. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
五、實驗與分析. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
5.1 實驗結果展示: 基於中文情緒對話產生資料集. . . . . . . . . . . . . 43
5.1.1 風格分類器之實驗結果. . . . . . . . . . . . . . . . . . . . . . 43
5.1.2 個人化模型之實驗結果. . . . . . . . . . . . . . . . . . . . . . 44
5.1.3 強化學習模型之實驗結果. . . . . . . . . . . . . . . . . . . . 48
5.1.4 即插即用模型之實驗結果. . . . . . . . . . . . . . . . . . . . 51
5.1.5 循環生成對抗模型之實驗結果. . . . . . . . . . . . . . . . . . 57
5.1.6 模型語句生成範例及比較. . . . . . . . . . . . . . . . . . . . 61
5.2 實驗結果展示: 基於批踢踢資料集. . . . . . . . . . . . . . . . . . . . 65
5.2.1 風格分類器之實驗結果. . . . . . . . . . . . . . . . . . . . . . 65
5.2.2 模型語句生成範例及比較. . . . . . . . . . . . . . . . . . . . 66
5.3 實驗結果分析. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
5.4 本章總結. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
六、結論與展望. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
6.1 結論. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
6.2 未來研究方向. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
參考文獻. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73

[1] Jiwei Li, Michel Galley, Chris Brockett, Georgios P. Spithourakis, Jianfeng Gao, and Bill Dolan, “A persona-based neural conversation model,” in ACL, 2016.
[2] Chih-Wei Lee, Yau-Shian Wang, Tsung-Yuan Hsu, Kuan-Yu Chen, Hung-Yi Lee, and Lin shan Lee, “Scalable sentiment for sequence-to-sequence chatbot response with performance analysisl,” in ICASSP, 2018.
[3] Chih-Wei Lee, “Improved task-oriented and non-task-oriented dialogue systems: Language learning dialogue game and chatbot as examples,” in Master’s Thesis of Communication Engineering College of Electrical Enginnering and Computer Science, National Taiwan University, 2018.
[4] Richard S. Sutton and Andrew G. Barto, “Reinforcement learning: An introduction,” in MIT press, 1998.
[5] Samuel R. Bowman, Luke Vilnis, Oriol Vinyals, Andrew M. Dai, Rafal Jozefowicz, and Samy Bengio, “Generating sentences from a continuous space,” in CONLL, 2016.
[6] Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A. Efros, “Unpaired imageto-image translation using cycle-consistent adversarial networks,” in ICCV, 2017.
[7] Hao Zhou, Minlie Huang, Tianyang Zhang, Xiaoyan Zhu, and Bing Liu, “Emotional chatting machine: Emotional conversation generation with internal and external memory,” in arXiv, 2017.
[8] Christopher JCH Watkins and Peter Dayan, “Q-learning,” in Machine learning, 1992, vol. 8, pp. 279–292.
[9] Richard S. Sutton, David McAllester, Satinder Singh, and Yishay Mansour, “Policy gradient methods for reinforcement learning with function approximation,” in NIPS, 1999.
[10] Vijay R. Konda and John N. Tsitsiklis, “Actor-critic algorithms,” in Advances in Neural Information Processing Systems, 2000.
[11] Otto Fabius and Joost R. van Amersfoort, “Variational recurrent auto-encoders,” in arXiv, 2014.
[12] Diederik P. Kingma and Max Welling, “Auto-encoding variational bayes,” in ICLR, 2013.
[13] D.E. Rumelhart, G.E. Hinton, and R.J.Williams, “Learning internal representations by error propagation,” in Parallel Distributed Processing, 1986.
[14] G. E. Hinton and R. R. Salakhutdinov, “Reducing the dimensionality of data with neural networks,” in Science, 2006, vol. 313(5786):504.
[15] Geoffrey E. Hinton, Simon Osindero, and Yee-Whye Teh, “A fast learning algorithm for deep belief nets,” in Neural Computation, 2006, vol. 18(7):1527–1554.
[16] Dumitru Erhan, Yoshua Bengio, Aaron Courville, Pierre-Antoine Manzagol, Pascal Vincent, and Samy Bengio, “Why does unsupervised pre-training help deep learning?,” in Journal of Machine Learning Research, 2010, vol. 11:625–660.
[17] Kunihiko Fukushima, “Neocognitron : A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position,” in Biological Cybernetics, 1980, vol. 36(4): 93-202.
[18] Samy Bengio, Oriol Vinyals, Navdeep Jaitly, and Noam Shazeer, “Scheduled sampling for sequence prediction with recurrent neural networks,” in NIPS, 2015.
[19] Minh-Thang Luong, Hieu Pham, and Christopher D Manning, “Effective approaches to attentionbased neural machine translation,” in arXiv, 2015.
[20] Dzmitry Bahdanau, KyungHyun Cho, and Yoshua Bengio, “Neural machine translation by jointly learning to align and translate,” in ICLR, 2015.
[21] Sepp Hochreiter and Jurgen Schmidhuber, “Long short-term memory,” in Neural computation, 1997, vol. 9(8), pp. 1735–1780.
[22] Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio, “Learning phrase representations using rnn encoder-decoder for statistical machine translation,” in EMNLP, 2014.
[23] Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio, “Generative adversarial nets,” in NIPS, 2014.
[24] Tim Salimans, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, and Xi Chen, “Improved techniques for training gans,” in NIPS, 2016.
[25] Quan Hoang, Tu Dinh Nguyen, Trung Le, and Dinh Phung, “Multi-generator generative adversarial nets,” in ICLR, 2018.
[26] Jiwei Li, Will Monroe, Alan Ritter, Michel Galley, Jianfeng Gao, and Dan Jurafsky, “Deep reinforcement learning for dialogue generation,” in EMNLP, 2016.

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	數位內容導入人工智慧協同教學之初探
2.	英語聊天機器人對台灣高中生英語學習之效益探討
3.	聊天機器人之研製-以PTT八卦板文章為知識庫
4.	以社群軟體為使用者介面之校園導覽聊天機器人之研製
5.	基於複雜任務結構與消費需求之購物機器人
6.	FinTech投資諮詢聊天機器人-以外匯應用為例
7.	整合自然語言處理與機器學習於疾病聊天機器人基於雲端運算
8.	不同NLP模型對意圖辨識聊天機器人效能影響之研究
9.	投影機AI客服機器人之設計與研究
10.	利用關係記憶核及獎勵調整改善序列生成對抗網路之研究
11.	基於長短期記憶遞迴模型和生成對抗式網路的任務型聊天機器人
12.	基於深度學習與自然語言處理發展的情感對話機器人-以短文本情感對話生成為例
13.	以本體論為基發展之商標保護諮詢機器人
14.	通過強化學習重新校正並提高最佳 ASR 假設
15.	陪伴型機器人語音辨識技術應用之研究

無相關期刊

1.	基於深度學習之端對端閩南語語音辨識
2.	於未見噪音環境下以非監督式域調適於語音增強之研究
3.	以生成對抗網路實現根據聲音生成對應場景的圖片生成器
4.	利用深度學習強化口述語彙偵測系統
5.	聊天機器人之研製-以PTT八卦板文章為知識庫
6.	交友軟體聊天機器人之商業創新模式
7.	使用深度強化學習技術與可訓練模擬使用者之互動式語音數位內容檢索
8.	以自動問題生成實現機器閱讀理解之半監督式學習與轉移學習
9.	聊天機器人系統設計與實作
10.	使用多語言語言表示模型進行跨語言遷移學習之問答系統
11.	以序列對序列網路為基礎的端對端短句回覆問答系統
12.	聲音詞向量：以非督導式序列至序列自編碼模型學習聲音片段表示
13.	以社群軟體為使用者介面之校園導覽聊天機器人之研製
14.	基於類神經網路的端對端語音合成系統之表現強化
15.	聊天機器人應用在特定族群資訊內容傳遞之研究 -以 Chatbot 及學生族群為例

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室