臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.103) 您好！臺灣時間：2025/11/22 04:52

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
電子全文
紙本論文
QR Code

本論文永久網址:

研究生:

陳虹霈

研究生(外文):

CHEN,HUNG-PEI

論文名稱:

結合卷積神經網路與遞迴神經網路於自動文本分類

論文名稱(外文):

Integrating Convolutional Neural Network and Recurrent Neural Network for Automatic Text Classification

指導教授:

葉向原

指導教授(外文):

YEH,HSIANG-YUAN

口試委員:

許晉雄、林斯寅

口試委員(外文):

HSU,CHIN-HSIUNG、LIN,SZU-YIN

口試日期:

2019-06-26

學位類別:

碩士

校院名稱:

東吳大學

系所名稱:

數學系

學門:

數學及統計學門

學類:

數學學類

論文種類:

學術論文

論文出版年:

2019

畢業學年度:

108

語文別:

中文

論文頁數:

中文關鍵詞:

文本分類、卷積神經網路、遞迴神經網路、詞頻與逆向文件頻率

外文關鍵詞:

text classification、convolutional neural network、recursive neural network、term frequency–inverse document frequency

相關次數:

被引用:6
點閱:656
評分:
下載:140
書目收藏:2

隨著巨量資料領域的蓬勃發展，對於文本資料的處理需求與日俱增，然而如何有效將大量的文本自動化分類，仍是目前自然語言處理的熱門研究。在傳統文字探勘處理時，我們常採用“詞袋”（Bag-of-Words）模型，其捨棄了文句中字詞的順序及語意上的共線性，主要只關注在字詞出現的頻率上，另外運用一種文本探勘特徵提取常用的技術──詞頻與逆向文件頻率（term frequency–inverse document frequency; TF-IDF），將其轉換成特徵向量後再進行各分類器的訓練。本論文採用預先訓練好的詞向量模型(GloVe)，結合卷積神經網路與遞迴神經網路來結讀文字上的語意以及順序，並利用20Newsgroups新聞群分類做為測試資料，經分類結果顯示最佳模型為合併卷積神經網路與遞迴神經網路模型，該模型於測試資料中獲得86.3%的準確率，並且比起傳統的詞袋模型成長了3%左右的準確程度。

With the rapid development of huge data research area, the demand for processing textual information is increasing. Text classification is still a hot research in the field of natural language processing. In the traditional text mining process, we often use the "Bag-of-Words" model, which discards the order of the words in the sentence, mainly concerned with the frequency of occurrence of the words. TF-IDF (term frequency–inverse document frequency) is one of the techniques for feature extraction commonly used in text exploration and classification. Therefore, we combine convolutional neural network and recurrent neural network to consider the semantics and order of the words in the sentence for text classification. We apply 20Newsgroups news group as our test dataset. The performance of the result achieves an accuracy of 86.3% on the test set and improves about 3% comparing with the traditional model.

目錄
誌謝.........................................................................i
中文摘要....................................................................ii
Abstract...................................................................iii
目錄........................................................................iv
圖目錄......................................................................vi
表目錄.....................................................................vii
第一章緒論................................................................. 1
1.1研究動機與目的................................................... 1
1.2論文架構......................................................... 2
第二章文獻探討............................................................. 3
2.1自動文本分類(Automatic Text Classification)...................... 3
2.2停用詞(Stop Words) .............................................. 3
2.3詞袋模型(Bag-of-words model) .................................... 4
2.4TF-IDF (Term Frequency-Inverse Document)........................ 6
2.5樸素貝氏分類器(Naïve Bayes, NB).................................. 7
2.6支援向量機(Support Vector Machine,SVM)........................... 7
2.7詞向量(Word2vec) ................................................ 8
第三章研究方法與架構...................................................... 13
3.1研究流程與架構................................................... 13
3.2Global Vectors for Word Representation(GloVe)................... 15
3.3卷積神經網路模型(Convolutional Neural network,CNN)............... 19
3.4遞歸神經網路模型(Recurrent neural network,RNN) .................. 22
3.5長短期記憶(Long Short-Term Memory,LSTM)......................... 24
第四章實驗設計與結果...................................................... 26
4.1實驗資料........................................................ 26
4.2實驗流程........................................................ 28
4.3實驗結果........................................................ 29
第五章結論............................................................... 30
參考文獻................................................................... 31

[1] Aaron Jaech,George Mulcaire,Shobhit Hathi,Mari Ostendorf,and Noah A. Smith, “Hierarchical Character-Word Models for Language Identification”,2016.
[2]Bengio, Yoshua, et al. ,“A neural probabilistic language model.” ,Journal of machine learning research 3.Feb ,pp.1137-1155 ,2003.
[3] Carl-Xie, “LDA(Latent Dirichlet Allocation)主題模型”,2016
https://www.analyticsvidhya.com/blog/2016/08/beginners-guide-to-topic-modeling-in-python/
[4] C.Cortes and V.Vapnik, “Support-vector networks”, Machine Learning, vol.20, no.3,pp.273–297,1995.
[5]Christopher D.Manning,Jeffrey Pennington,and Richard Socher, “GloVe: Global Vectors for Word Representation”,2014.
https://nlp.stanford.edu/projects/glove/
[6] C. Olah, “Understanding lstm networks”,2015.
http://colah.github.io/posts/2015-08-Understanding-LSTMs/.
[7]DSDN, “理解GloVe模型（Global vectors for word representation）”, 2017.
https://blog.csdn.net/coderTC/article/details/73864097
[8] R. Feldman & J.Sanger,,“The text mining handbook: advanced approaches in analyzing unstructured data.” ,2007.
[9] F. Sebastiani, “Machine learning in automated text categorization”, ACM Comput. Surv., vol. 34, no. 1, pp. 1-47, 2002.
[10]G.Salton and M.McGill, “Introduction to Modern Information Retrieval” ,Mc Graw Hill Book Company,1984.
[11] Hayes-Roth, Frederick, Donald Waterman, and Douglas Lenat., “Building expert systems.” ,1984.
[12] Lawrence, Steve, et al.,“Face recognition: A convolutional neural-network approach”, IEEE transactions on neural networks, 1997.
[13] D. D. Lewis, “Naive (Bayes) at forty: The independence assumption in
information retrieval”, European conference on machine learning. Springer,
Berlin, Heidelberg, 1998.
[14]ITREAD, “詞袋模型和詞向量模型”,2018
https://www.itread01.com/content/1545232687.html
[15]M. E. Maron, “Automatic indexing: an experimental inquiry”, Journal of the ACM (JACM), vol. 8, pp. 404-417, 1961.
[16] Mikolov, Tomas, et al. ,“Efficient estimation of word representations in vectorspace ”, arXiv preprint arXiv:1301.3781,2013.
[17] Sepp Hochreiter and Jürgen Schmidhuber, “Long short-term memory.” Neural computation,vol. 9, no. 8, pp. 1735–1780, 1997.
[18] Shivam Bansal, “Beginners Guide to Topic Modeling in Python ”,Analytics Vidhya,2016
https://www.analyticsvidhya.com/blog/2016/08/beginners-guide-to-topic-modeling-in-python/
[19] V. Vapnik, “Pattern recognition using generalized portrait method”, Automation and remote control, vol. 24, pp. 774-780, 1963.
[20]Zhang Jian, “20Newsgroups文檔分類”,2018
https://zhangjian.site/2018/07/06/nlp-20-newsgroups-document-classification/

電子全文

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	應用機器學習演算法進行文本情感分析之研究
2.	運用Faster R-CNN模組與YOLO模組對於白肉雞健康辨識度之差異比較
3.	結合語意關鍵詞與卷積神經網路之文本分類研究
4.	基於深度卷積遞歸神經網路的文本分類
5.	應用文本分類提供以概念圖為基礎的學習資源推薦
6.	基於循環卷積式神經網路的文件分類模型實作
7.	使用聚類及卷積神經網路於類別型及維度型情感分析之研究
8.	機器學習於電子商務商品分類之研究
9.	基於深度學習使用雙向LSTM結合雙特徵因子應用於身分人識別
10.	卷積-遞迴神經網路計算薛丁格方程的激發態能量
11.	卷積神經網路在時間序列問題的應用
12.	使用CNN及RNN辨識手勢節拍以搜索音樂及播放
13.	捷運新聞分類之研究
14.	基於語言模板之分散式表示法於社群媒體主題分類之研究
15.	基於深度學習之殭屍網路偵測

無相關期刊

1.	結合語意關鍵詞與卷積神經網路之文本分類研究
2.	基於深度卷積遞歸神經網路的文本分類
3.	基於存活分析模型來估算客戶的終身價值
4.	建構嵌入詞向量與序列到序列注意力機制的自動文本摘要生成模型
5.	魚類新鮮度辨識：三種卷積神經網路之比較
6.	基於深度學習卷積神經網路之咖啡瑕疵豆辨識方法
7.	科技導入之創新擴散歷程-以台灣Pay為例
8.	政府委託服務下的社工督導實施 ─以婦女中心為例
9.	建構長短期記憶網路模型與嵌入向量之推薦系統
10.	嵌入式系統上實作Tiny BERT語言模型之文本分類研究
11.	基於類別語言模板之文章向量於文本分類研究
12.	利用轉移學習模型從臉部特徵來推論身體質量指數
13.	基於類神經之關聯詞向量表示於文本分類任務之研究
14.	結合主題模型及圖神經網路的文本分類方法
15.	應用深度學習之卷積神經網路於工件毛邊之辨識

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室