臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.176) 您好！臺灣時間：2025/09/09 01:48

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
紙本論文
QR Code

本論文永久網址:

研究生:

林聰

研究生(外文):

Tsung Lin

論文名稱:

兼顧新聞立場之標題產生方法研究

論文名稱(外文):

Learning to Generate News Headlines with Media’s Stance

指導教授:

陳信希

口試委員:

陳冠宇、鄭卜壬、蔡宗翰

口試日期:

2019-07-17

學位類別:

碩士

校院名稱:

國立臺灣大學

系所名稱:

資訊工程學研究所

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2019

畢業學年度:

107

語文別:

英文

論文頁數:

中文關鍵詞:

文章標題自動生成、文章自動摘要、可控制的文字生成、立場、Transformer

DOI:

10.6342/NTU201902714

相關次數:

被引用:0
點閱:239
評分:
下載:0
書目收藏:0

隨著類神經網路的興起，許多自然語言處理的研究有了全新的進展。文字生成的研究是其中之一，類神經網路能理解複雜的語言邏輯、並生成類似人寫的句子。除了使用類神經網路來加強傳統的文字生成任務，像是機器翻譯和文章摘要等外，其他研究也開始嘗試在文字生成時，加入各種條件像是時態、字數、情緒等。除了文字生成外，類神經網路也常被應用在一些自然語言處理相關的分類任務，立場的偵測和分類就是其中一個熱門的研究。受到文字生成和立場分類的啟發，本論文嘗試產生符合特定台灣媒體立場的新聞標題

As neural network model thrives, nature language processing enters into a new chapter. Powerful models motivate the innovation and renovation of text generation tasks. Text generation tasks are no longer the simple task like text summarization or machine translation, they try to generate text with a variety of novel conditions, e.g., sentence length, tense and sentiment. Neural models also have a great success in some classification task. Stance classification is one of popular research topics. Inspired from conditional text generation and stance classification, we innovate a task to generate news headline with specific stances of Taiwan’s news media.

誌謝 i
摘要 ii
Abstract iii
Table of Contents iv
Content of Figures vi
Content of Tables vii
Chapter 1 Introduction 1
1.1 Motivation 1
1.2 Organization 3
Chapter 2 Related Work 4
2.1 Text Summarization and Headline Generation 4
2.1.2 Attention Mechanism 5
2.1.3 Models for Sequence to Sequence 6
2.1.5 Chinese Text Summarization 7
2.2.1 Condition Transfer or controllable text generation 8
2.2.2 Controllable Abstractive Summarization 9
2.3 NLP Researches about Stance 9
2.3.1 Stance Classification 9
2.3.2 Stance Transfer 10
Chapter 3 Dataset 11
3.1 Data Scrapping & Cleaning 11
3.1.1 LTN 11
3.1.2 CTS & UDN 12
3.2 News Alignment 14
3.3 Character as Token 16
Chapter 4 Methodology and Experiments 18
4.1 Transformer 18
4.2 Generate News Headline with Stance 20
4.2.1 Independent Model 21
4.2.2 Independent Decoder 21
4.2.3 Stance Token 23
4.2.4 Stance Query 23
4.3 Experimental Settings 24
4.3.1 Tools 24
4.3.2 Training Settings 25
The major training settings that include dataset features and model features are listed below. 25
4.3.3 Testing Settings: 26
We list our testing settings below. Testing setting is significant because they may change the final rouge score. 26
4.3.4 Metrics 26
4.4 Pre-Experiment 28
4.4.1 Bi-LSTM versus Transformer 28
4.4.2 Number of Paragraphs 29
4.4.3 Addition of Aligned Data 30
Chapter 5 Method Improvement 35
5.1 Ensemble Generation 35
5.1.1 Methodology 35
5.1.2 Experiments 36
5.2 Multi-Task Learning 39
5.2.1 Methodology 39
5.2.2 Experiments 40
Chapter 6 Human Evaluation and Examples 42
6.1 Metric 42
6.2 Generation Pairs and Dataset 43
6.3 Experimental Results 45
6.6.1 China Concern 47
6.6.2 NTU President Amid Concern 48
6.6.3 Nuclear Power Concern 50
6.6.4 Hong Kong Anti-Extradition Bill Protests 51
Chapter 7 Conclusion and Future Work 54
Reference 55

Chang, C.-T., Huang, C.-C., Yang, C.-Y., & Hsu, J. Y.-J. (2018). A Hybrid Word-Character Approach to Abstractive Summarization. arXiv preprint arXiv:1802.09968.
Chen, W.-F., Wachsmuth, H., Al Khatib, K., & Stein, B. (2018). Learning to Flip the Bias of News Headlines. Paper presented at the Proceedings of the 11th International Conference on Natural Language Generation.
Chen, Y.-C., & Bansal, M. (2018). Fast abstractive summarization with reinforce-selected sentence rewriting. arXiv preprint arXiv:1805.11080.
Dai, Z., Yang, Z., Yang, Y., Cohen, W. W., Carbonell, J., Le, Q. V., & Salakhutdinov, R. (2019). Transformer-xl: Attentive language models beyond a fixed-length context. arXiv preprint arXiv:1901.02860.
Du, J., Xu, R., He, Y., & Gui, L. (2017). Stance classification with target-specific neural attention networks.
Fan, A., Grangier, D., & Auli, M. (2017). Controllable abstractive summarization. arXiv preprint arXiv:1711.05217.
Fu, Z., Tan, X., Peng, N., Zhao, D., & Yan, R. (2018). Style transfer in text: Exploration and evaluation. Paper presented at the Thirty-Second AAAI Conference on Artificial Intelligence.
Gavrilov, D., Kalaidin, P., & Malykh, V. (2019). Self-Attentive Model for Headline Generation. Paper presented at the European Conference on Information Retrieval.
Gehring, J., Auli, M., Grangier, D., Yarats, D., & Dauphin, Y. N. (2017). Convolutional sequence to sequence learning. Paper presented at the Proceedings of the 34th International Conference on Machine Learning-Volume 70.
Hsu, W.-T., Lin, C.-K., Lee, M.-Y., Min, K., Tang, J., & Sun, M. (2018). A unified model for extractive and abstractive summarization using inconsistency loss. arXiv preprint arXiv:1805.06266.
Hu, B., Chen, Q., & Zhu, F. (2015). Lcsts: A large scale chinese short text summarization dataset. arXiv preprint arXiv:1506.05865.
Kågebäck, M., Mogren, O., Tahmasebi, N., & Dubhashi, D. (2014). Extractive summarization using continuous vector space models. Paper presented at the Proceedings of the 2nd Workshop on Continuous Vector Space Models and their Compositionality (CVSC).
Liao, Y., Bing, L., Li, P., Shi, S., Lam, W., & Zhang, T. (2018). QuaSE: Sequence Editing under Quantifiable Guidance. Paper presented at the Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing.
Liu, L., Lu, Y., Yang, M., Qu, Q., Zhu, J., & Li, H. (2018). Generative adversarial network for abstractive text summarization. Paper presented at the Thirty-Second AAAI Conference on Artificial Intelligence.
Nallapati, R., Zhou, B., Gulcehre, C., & Xiang, B. (2016). Abstractive text summarization using sequence-to-sequence rnns and beyond. arXiv preprint arXiv:1602.06023.
Nallapati, R., Zhou, B., & Ma, M. (2016). Classify or select: Neural architectures for extractive document summarization. arXiv preprint arXiv:1611.04244.
Paulus, R., Xiong, C., & Socher, R. (2017). A deep reinforced model for abstractive summarization. arXiv preprint arXiv:1705.04304.
Rush, A. M., Chopra, S., & Weston, J. (2015). A neural attention model for abstractive sentence summarization. arXiv preprint arXiv:1509.00685.
See, A., Liu, P. J., & Manning, C. D. (2017). Get to the point: Summarization with pointer-generator networks. arXiv preprint arXiv:1704.04368.
Sutskever, I., Vinyals, O., & Le, Q. V. (2014). Sequence to sequence learning with neural networks. Paper presented at the Advances in neural information processing systems.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., . . . Polosukhin, I. (2017). Attention is all you need. Paper presented at the Advances in neural information processing systems.
Zhu, J.-Y., Park, T., Isola, P., & Efros, A. A. (2017). Unpaired image-to-image translation using cycle-consistent adversarial networks. Paper presented at the Proceedings of the IEEE international conference on computer vision.

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

無相關論文

無相關期刊

1.	對抗式零樣本學習於跨語言及跨領域之文字蘊含識別
2.	多人對話中情緒與人際關係之分析
3.	基於密集隱性相關新聞序列之股票趨勢預測
4.	整合腳本知識的機器常識閱讀理解
5.	生活情境對話中主動資訊回憶的初步研究
6.	基於社群媒體意見預測市場偏好服務套裝
7.	引入語意知識輔助影像生活紀錄之辨識與檢索
8.	引入辯論圖結構之說服力預測
9.	利用本體論及後適配技術於產生較佳之詞及詞義分散表示法
10.	基於辯論歷程之反論點生成
11.	結合訓練策略以增強轉移式完整中文語篇剖析器
12.	生活紀錄探勘: 以個人知識庫為本的多模態資訊召回
13.	以非監督學習之語義文件向量進行細緻多層面情緒分析
14.	深度財務意見探勘
15.	對光線變化具有強健適應的人物重新識別系統輔以基於群聚的損失函數

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室