|
[1] C.-C. Huang, K.-M. Lin, L.-F. Chien. Automatic Training Corpora Acquisition through Web Mining. In 2005 IEEE/WIC/ACM Conference on Web Intelligence, July 2005. [2] C.-C. Huang, S.-L. Chuang, L.-F. Chien. LiveClassifier: Creating Hierarchical Text Classifiers through Web Corpora. In World Wide Web Conference, 2004. [3] Chen-Ming Hung and Lee-Feng Chien. Web-Based Text Classication in the Absence of Manually Labeled Training Documents. In Journal of the American Society for Information Science and Technology, 2007. [4] Y. Qui and H. Frei. Concept based query expansion. In Proceedings of the 16th Annual International ACM SIGIR Conference, pages 160–169, 1993. [5] J. Xu and W. Croft. Query expansion using local and global document analysis. In Proceedings of the 19th Annual International ACM SIGIR Conference, pages 412–420, 1996. [6] C. Carpineto, R. De Mori, G. Romano, and B. Bigi. An information-theoretic approach to automatic query expansion. ACM Transactions on Information Systems, 19(1):1–27, 2001. [7] K. Nigam, A. K. McCallum, S. Thrun, and T. M. Mitchell. Text classification from labeled and unlabeled documents using EM. Machine Learning, 39(2/3):103–134, 2000. [8] A. McCallum and K. Nigam. Text classification by bootstrapping with keywords. In ACL Workshop for Unsupervised Learning in Natural Language Processing, 1999. [9] J. H. H. Yu, C. Zhai. Text classification from positive and unlabeled documents. In Proceedings of the 12th Annual International ACM Conference on Information and Knowledge Management, pages 232–239, 2003. [10] H. Yu. SVMC: Single-class classification with support vector machines. In Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, 2003. [11] Reuters-21578 Text Categorization Test Collection. http://www.daviddlewis.com/resources/testcollections/reuters21578/ [12] Google Search Engine. http://www.google.com [13]The Lemur Toolkit for Language Modeling and Information Retrieval. http://www.lemurproject.org/ [14] Rainbow. http://www.cs.cmu.edu/~mccallum/bow/rainbow/ [15] D. W. C. Kwok, O. Etzioni. Scaling question answering to the web. In Proceedings of the 10th international conference on World Wide Web, pages 150–161, 2001. [16] R. Goldman and J. Widom. A practical approach for combined querying of databases and the web. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 285–296, 2000. [17] A. Kilgarriff and G. Greffenstette. Introduction to the special issue on web as corpus. Computational Linguistics, 29(3), 2003. [18] Dmoz Open Directory Project, http://www.dmoz.org/ [19] Javed Aslam, Katya Pelekhov, and Daniela Rus. A Practical Clustering Algorithm for Static and Dynamic Information Organization". In SODA: ACM-SIAM Symposium on Discrete Algorithms, 1999. [20] Yahoo! Directory. http://dir.yahoo.com/ [21] S.-L. Chuang, L.-F. Chien. Towards Automatic Generation of Query Taxonomy: A Hierarchical Query Clustering Approach. In Proc. the 2002 IEEE International Conference on Data Mining (ICDM), pages 75-82, Dec. 2002. [22] Classifier for Computer Science. http://irlab.csie.org/~ccy/cgi-bin/classifier/. [23] Yahoo! Directory for Computer science. http://dir.yahoo.com/Science/Computer_Science/ [24] Microsoft Libra for Computer Science Directory. http://libra.msra.cn/ [25] File::Random Perl Module. http://search.cpan.org/~bigj/File-Random-0.17/Random.pm
|