|
參考文獻 中文參考文獻 范長康, & 蔡文祥(1987)。以鬆弛法作中文斷詞。全國計算機會議論文集,423-431。 許菱祥(1970)。中文文法。大中國圖書公司。 彭維謙、劉士綱、杜協昌、翁稷安、項潔(2012)。自動擷取中文典籍中人名之嘗試:以PMI (Pointwise Mutual Information) 斷詞於《資治通鑑》的應用為例。第四屆數位典藏與數位人文國際研討會,台北。 英文參考文獻 Breiman, L., Friedman, J., Stone, C. J., & Olshen, R. A. (1984). Classification and regression trees. CRC press. Breiman, L. (2001). Random forests. Machine learning, 45(1), 5-32. Chen, C. J., Bai, M. H., & Chen, K. J., (1997). Category Guessing for Chinese Unknown Words. Proceedings of the Natural Language Processing Pacific Rim Symposium, 35-40. Chen, K. J., & Bai, M. H. (1998). Unknown word detection for Chinese by a corpus-based learning method. Journal of Computational Linguistics and Chinese Language Processing, 3(1), 27-44. Chen, K. J., & Liu, S. H. (1992). Word identification for Mandarin Chinese sentences. Proceedings of the 14th conference on Computational linguistics 1, 101-107. Association for Computational Linguistics. Chen, K. J., & Ma, W. Y. (2002). Unknown word extraction for Chinese documents. Proceedings of the 19th international conference on Computational linguistics 1, 1-7. Association for Computational Linguistics. Chiang, T. H., Chang, J. S., Lin, M. Y., & Su, K. Y. (1992). STATISTICAL MODELS~ FOR WOFID SEGMENTATION AND UNKNOWN WORD RESOLUTION. Han, J., & Kamber, M. (2000). Data mining: concepts and techniques (the Morgan Kaufmann Series in data management systems). Li, M., Gao, J., Huang, C., & Li, J. (2003). Unsupervised training for overlapping ambiguity resolution in Chinese word segmentation. Proceedings of the second SIGHAN workshop on Chinese language processing 17, 1-7. Association for Computational Linguistics. Luo, X., Sun, M., & Tsou, B. K. (2002). Covering ambiguity resolution in Chinese word segmentation based on contextual information. Proceedings of the 19th international conference on Computational linguistics 1, 1-7. Association for Computational Linguistics. Ma, W. Y., & Chen, K. J. (2003). A bottom-up merging algorithm for Chinese unknown word extraction. Proceedings of the second SIGHAN workshop on Chinese language processing 17, 31-38. Association for Computational Linguistics. Neyman, J., & Pearson, E. S. (1966). Joint statistical papers. University of California Press. Nie, J. Y., Hannan, M. L., Jin, W. (1995). Combining dictionary, rules, and statistical information in segmentation of Chinese. Journal of Computer Processing of Chinese and Oriental Languages, 9(2), 125–143. Peng, F., Feng, F., & McCallum, A. (2004). Chinese segmentation and new word detection using conditional random fields. Proceedings of the 20th international conference on Computational Linguistics, p. 562. Association for Computational Linguistics. Quinlan, J. (1993). C4. 5: Programs for Machine Learning. C4. 5-programs for machine learning/J. Ross Quinlan. Shannon, C. E., & Weaver, W. (1949). University of Illinois Press. Urbana, 104-107. Sproat, R., & Shih, C. (1990). A statistical method for finding word boundaries in Chinese text. Journal of Computer Processing of Chinese and Oriental Languages, 4(4), 336-351. Tseng, H., Chang, P., Andrew, G., Jurafsky, D., & Manning, C. (2005). A conditional random field word segmenter for sighan bakeoff 2005. Proceedings of the fourth SIGHAN workshop on Chinese language Processing 171. Vapnik, V. N. (1995). The Nature of Statistical Learning Theory. Springer-Verlag, New York. Yeh, C. L., & Lee, H. J. (1991). Rule-Based Word Identification for Mandarin Chinese Sentences - A Unification Approach. Journal of Computer Processing of Chinese and Oriental Languages, 5(2), 97-118. Zhang, K., Liu, Q., Zhang, H., & Cheng, X. Q. (2002). Automatic recognition of Chinese unknown words based on roles tagging. Proceedings of the first SIGHAN workshop on Chinese language processing 18, 1-7. Association for Computational Linguistics. Zheng, J. H., & Wu, F. F. (1999). Study on Segmentation of Ambiguous Phrases with the Combinatorial Type. Collections of Papers on Computational Linguistics. Tsinghua University Press, Beijing, 129-134. 網路資源 曾元顯(2012)。圖書館學與資訊科學大辭典。國家教育研究院。http://terms.naer.edu.tw/detail/1678997/ Quinlan, J. R. (2003), C5.0: An Informal Tutorial, Retrieved from https://www.rulequest.com/see5-unix.html. Tsai, C. H. (1996), MMSEG: A Word Identification System for Mandarin Chinese Text Based on Two Variants of the Maximum Matching Algorithm, http://technology.chtsai.org/mmseg/.
|