(3.237.97.64) 您好!臺灣時間:2021/03/03 05:12
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果

詳目顯示:::

我願授權國圖
: 
twitterline
研究生:劉博榮
研究生(外文):Liu, Po-Jung
論文名稱:近體詩自動分類研究
論文名稱(外文):The Study of Chinese Jintishi Categorization
指導教授:梁婷梁婷引用關係
指導教授(外文):Liang, Tyne
學位類別:碩士
校院名稱:國立交通大學
系所名稱:資訊科學與工程研究所
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2010
畢業學年度:99
語文別:中文
論文頁數:49
中文關鍵詞:文件分類語意消歧詩作分類特徵選擇
外文關鍵詞:Text ClassificationWord Sense DisambiguationPoetry ClassificationFeature Selection
相關次數:
  • 被引用被引用:2
  • 點閱點閱:190
  • 評分評分:系統版面圖檔系統版面圖檔系統版面圖檔系統版面圖檔系統版面圖檔
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:2
近體詩是華人社會中一項重要的文化資產,然而很多詩作中皆含有隱喻,使得近體詩對於學生而言不容易了解其中含義。在本論文中,我們提出幾個有效的方法來做近體詩的自動分類,藉以幫助學習者對於詩作的理解。我們利用法則式的方法搭配同義詞詞林來做語意標記,以及SVM的分類模型來做詩作分類。並從詩作的語料中探勘七種特徵來做為分類特徵,再利用Forward Sequential Selection Algorithm來做為選取特徵的演算法,而我們所提出的方法經過217首的五言絕句來做六個類別近體詩的詩作分類實驗,可達到72.35%的正確率。
Chinese Jintishi is one important heritage in Chinese societies. Nevertheless, many poets use metaphors while composing their poems. So it becomes hard to understand Jintishi for high school students. In this thesis, an effective approach to automate Jintishi is presented with the aim to facilitate poem comprehension. We propose a method to tackle with semantic role labeling based on Tongyici Cilin and a SVM-based model to handle poem categorization. The categorization employs seven kinds of features mined from training corpus. Best set of features is selected by using forward sequential selection algorithm. The approach is justified in terms of 72.35% accuracy by categorizing 217 five-character quatrains into six types of Jintishi.
摘要 i
ABSTRACT ii
誌謝 iii
目錄 iv
表目錄 v
第一章 緒論 1
1.1 研究目的與動機 1
1.2 問題定義 1
1.3 論文架構 4
第二章 相關研究 6
2.1 同義詞詞林 6
2.2 詞義處理 7
2.3 詩作分類 9
第三章 詩作處理 13
3.1 語料前置處理 13
3.2 詞彙語意處理 18
3.2.1 語意辭典比對與未知詞彙處理 18
3.2.2 啟發式規則概念歧義處理 20
3.3 詞彙歧義消解實驗 23
第四章 詩作分類 28
4.1 分類特徵 28
4.2 分類實驗 31
第五章 結論 38
參考文獻 40
附錄 44


[1] Anna Korhonen, Yuval Krymolowski, Nigel Collier (2006), “Automatic Classification of Verbs in Biomedical Texts.” In Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, Sydney, Australia, pp. 345-352.
[2] Canasai Kruengkrai, Kiyotaka Uchimoto, Jun’ichi Kazama,YiouWang, Kentaro Torisawa, Hitoshi Isahara (2009), “An Error-Driven Word-Character Hybrid Model for Joint Chinese Word Segmentation and POS Tagging.” In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, Suntec, Singapore, pp. 513-521.
[3] Catherine Plaisant, James Rose (2006), “Exploring erotics in Emily Dickinson's correspondence with text mining and visual interfaces.”Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries, Chapel Hill, NC, USA, pp. 141-150.
[4] Chih-Chung Chang and Chih-Jen Lin, LIBSVM : a library for support vector machines (2001). Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
[5] Corinna Cortes and Vladimir Vapnik (1995), "Support-Vector Networks", Machine Learning, Vol. 20, pp. 273-297.
[6] Gerard Escudero and Llu?瀏 M?跫quez and German Rigau (2004),“An Empirical Study of the Domain Dependence of Supervised Word Sense Disambiguation Systems.”Joint SIGDAT Conference on Empirical Methods in NLP and Very Large Corpor, Hong Kong.
[7] Huan Liu and Rudy Setiono(1995), “Chi2: Feature selection and discretization of numeric attributes.” In Proceedings of the Seventh International Conference on Tools with Artificial Intelligence, Washington, DA, USA, pp.388-391.
[8] Ian Niles and Adam Pease(2003), “Linking Lexicons and Ontologies: Mapping WordNet to the Suggested Upper Merged Ontology”, In Proceedings of the 2003 International Conference on Information and Knowledge Engineering, Las Vegas, p.p. 23-26
[9] Jyrki Kivinen amd Manfred K. Warmuth (1995),“Additive versus exponentiated gradient updates for linear prediction”, Proceedings of the twenty-seventh annual ACM symposium on Theory of computing, Las Vegas, Nevada, United States, pp. 209-218.
[10] Keh-Jiann Chen, Shu-Ling Huang, Yueh-Yin Shih, Yi-Jun Chen(2005), “Extended-HowNet: A Representational Framework for Concepts” , In Proceedings of IJCNLP-05 Workshop on Lexical Semantic, Jeju Island, South Korea, p.p 1-6.
[11] Le Cuong Anh, Shimazu Akira. (2004), “High WSD Accuracy Using Na?e Bayesian Classifier with Rich Features”. PACLIC 18, Waseda University, Tokyo, pp. 105-113.
[12] Liang-Yan Li, Zhong-Shi He, Yong Yi (2004), “Poetry stylistic analysis technique based on term connections.”, In Proceedings of the Third International Conference on Machine Learning and Cybernetics, Shanghai, China, vol.5, pp. 2713- 2718.
[13] Michael Gamon (2004), “Linguistic correlates of style: authorship classification with deep linguistic analysis features”, The 20th International Conference on Computational Linguistics, Geneva, pp. 611-617.
[14] Moshe Koppel, Shlomo Argamon, and Anat R. Shimoni (2003),“Automatically Categorizing Written Texts by Author Gender.”Literary and Linguistic Computing, Volume 17, Number 2, pp 401-412.
[15] Oi Yee Kwong, Benjamin K. Tsou (2005), “Data Homogeneity and Semantic Role Tagging in Chinese.” In Proceedings of the ACL-SIGLEX Workshop on Deep Lexical Acquisition, Ann Arbor, Michigan, pp. 1-9.
[16] Roberto Navigli (2006), “Consistent Validation of Manual and Automatic Sense Annotations with the Aid of Semantic Graphs.” Association for Computational Linguistics, Vol. 32, No.2, pp. 273-281.
[17] Xiaojun Wan (2009), “Co-Training for Cross-Lingual Sentiment Classification.” In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, Suntec, Singapore, pp. 235–243.
[18] Yang, Y., Pedersen J.P (1997), “A Comparative Study on Feature Selection in Text Categorization”. Proceedings of the Fourteenth International Conference on Machine Learning, Nashville, TN, USA , pp. 412-420.
[19] Yee Seng Chan, Hwee Tou Ng (2007), “Domain Adaptation with Active Learning for Word Sense Disambiguation.” In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, pp. 49-56.
[20] Yong Yi, Zhong-Shi He, Liang-Yan Li, Tian Yu, Elaine Yi (2005), “Advanced studies on traditional Chinese Poetry style identification.” In Proceedings of the Fourth International Conference on Machine Learning and Cybernetics, Guangzhou, China, vol.5, pp. 2936- 2939.
[21] 王??砥A“唐詩之詩風探勘”,國立交通大學,碩士論文,2006年6月。
[22] 古遠清,詩歌分類學,高雄:復文圖書出版社,1991年9月。
[23] 朱我芯,「深秋猿鳥來心上,夜靜松杉到眼前」─華文詩歌情境再現,第五屆全球華文網路教育國際研討會,台北,2007年6月。
[24] 李支舜,高考古詩詞鑑賞與應考指導,上海辭書出版社,2007年7月。
[25] 柯淑津,黃居仁,洪嘉馡,劉詩音,簡卉伶,蘇依莉,“中文詞義全文標記語料庫之設計與雛形製作”,第十九屆自然語言與語音處理研討會,2007年9月,台灣大學,台灣。
[26] 梅家駒等編著,同義詞詞林,臺灣東華書局股份有限公司,1997年3月。
[27] 許清雲,部編大學用書-近體詩創作理論,臺北市:洪葉文化,1997。
[28] 許嘉妮,“詞風與情境判斷專家系統”,國立交通大學,碩士論文,2007年6月。
[29] 陳紹宜,“建構一個中文對聯創作的知識評價架構”,國立交通大學,碩士論文,2010年6月。
[30] 楊昌樺,陳信希,“以部落格文本進行情緒分類之研究”,第十八屆自然語言與語音處理研討會,新竹,台灣,2006年9月。
[31] 羅鳳珠,“植基於中國詩詞語言特性所建構之語意概念分類體系研究”,第九屆海峽兩岸圖書資訊學學術研討會,武漢大學,2008年7月3-6日。
[32] 龔霽芃,唐詩分類鑑賞,江西人民出版社,2003年12月1日。

連結至畢業學校之論文網頁點我開啟連結
註: 此連結為研究生畢業學校所提供,不一定有電子全文可供下載,若連結有誤,請點選上方之〝勘誤回報〞功能,我們會盡快修正,謝謝!
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top
系統版面圖檔 系統版面圖檔