|
[1]A. Aizawa, “Linguistic Techniques to Improve the Performance of Automatic Text Categorization”, Proceedings of the Sixth Natural Language Processing Pacific Rim. Symposium (NLPRS), pp. 307-314, 2001. [2]S. Chen and J. Goodman, “An Empirical Study of Smoothing Techniques for Language Modeling”, Proceedings of the Thirty-Fourth Annual Meeting of the Association for Computational Linguistics, pp. 310-318, 1998 [3]W. Cavnar and J. Trenkle, “N-Gram-Based Text Categorization”, Proceedings of SDAIR-94, 3rd Annual Symposium on Document Analysis and Information Retrieval, pp. 161-175, 1994. [4]M. Damashek, “Gauging Similarity with N-Grams: Language-Independent Categorization of Text”, Science, Vol. 267, pp. 843-848, 1995. [5]S. Dumais, J. Platt, D. Heckerman, and M. Sahami, “Inductive Learning Algorithms and Representations for Text Categorization”, Proceedings of the 7th International Conference on Information and Knowledge Management, pp. 148-155, 1998 [6]J. He, A. Tan, and C. Tan, “On Machine Learning Methods for Chinese Document Categorization”. Applied Intelligence, Vol.18, pp. 311-322, 2003. [7]E. Jiang, “Learning to Semantically Classify Email Messages”, Proceeding of 2nd International Conference on Intelligent Computing, pp. 664-675, 2006 [8]T. Joachims, “Text Categorization with Support Vector Machines: Learning with Many Relevant Features”, Proceedings of the ECML, pp. 137-142, 1998. [9]W. Lam, M. Ruiz and P. Srinivasan, “Automatic text categorization and its application to text retrieval”, IEEE Transactions on Knowledge and Data Engineering, Vol. 11, pp.865-879, 1999. [10]C. D. Manning and H. Schuetze, Fundations of Statistical Natural Language Processing, MIT Press, pp.191-227, 2004. [11]F. Peng, X. Huang, D. Schuurmans, and N. Cercone, “Investigating the Relationship of Word Segmentation Performance and Retrieval Performance in Chinese IR” Proceedings of COLING, pp. 72-78, 2002. [12]F. Peng, X. Huang, D. Schuurmans, and S. Wang, “Text Classification in Asian Languages without Word Segmentation”, Proceedings of the Sixth International Workshop on Information Retrieval with Asian Languages (IRAL), Vol. 18, pp. 41-48, 2003. [13]F. Peng and D. Schuurmans, “Combining Naive Bayes and N-Gram Language Models for Text Classification”, Proceedings of ECIR2003, pp. 335-350, 2003. [14]F. Sebastian, “Machine Learning in Automated Text Categorization”, ACM Computing Surveys, Vol.34, pp.1-47, 2002 [15]C. Silva and B. Ribeiro, “Scaling Text Classification with Relevance Vector Machines”, Proceeding of IEEE Conference on Systems, Man, and Cybernetics (SMC), pp. 4186-4191, 2006. [16]W. Teahan and D. Harper, “Using Compression-Based Language Models for Text Categorization”, Proceedings of LMIR, pp. 83-88, 2001. [17]M. Tipping, ”Sparse Bayesian Learning and the Relevance Vector Machine”, Journal of Machine Learning Research, 1, pp. 211-214, 2001. [18]V. Vapnik, “The Nature of Statistical Learning Theory”, Springer-Verlag, 1995. [19]Y.C. Wu, “Chinese text categorization with term clustering”, M.S. thesis, Mining-Chuan University, 2003. [20]Y. Yang, “An Evaluation of Statistical Approaches to Text Categorization”, Information Retrieval Journal, Vol. 1, pp.69-90, 1999. [21]Y. Yang and X. Liu, “A Re-examination of Text Categorization Methods”, Proceedings of the 22nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp.42-49, 1999. [22]S. Yen, Y. Lee, C. Lin, J. Ying, “Investigating the Effect of Sampling Methods for Imbalanced Data Distributions,” Proceedings of IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 4163-4168, 2006.
|