[1] Andrew Moore, "Statistical Data Mining Tutorials", http://www.autonlab.org/tutorials/ [2] C. Cortes and V. Vapnik. "The Nature of Statistical Learning Theory",Springer, 1995. [3] Hastie Trevor, Tibshirani Robert, and Friedman Jerome. "Hierarchical clustering". The Elements of Statistical Learning, New York, Springer, Vol. 14.3.12, pages 472-479, 2001. [4] SVMlight, http://svmlight.joachims.org/ [5] J. Z. Liang, "SVM Multi-classi er and web document classi cation", Proceeding of International Conference on Machine Learning and Cybernetics 2004 (ICMLC''04). [6] J.Q. Zou, G.L. Chen, and W.Z. Guo, "Chinese Web Page Classi cation Using Noise-tolerant support vector Machines", IEEE International Conference on Natural Language Processing and Knowledge Engineering, 2005 (NLP-KE''05). [7] J. D. M. Rennie and R. Rifkin, "Improving Multi-class Text Classi cation with the Support Vector Machine", Massachusetts Institute of Technology, Tech. Rep AIM-2001-026.2001, MIT, 2001. [8] J.A. Hartigan. "Clustering Algorithms", Wiley, 1975. [9] Kunal Punera, Suju Rajan, and Joydeep Ghosh. "Automatically learning document taxonomies for hierarchical classi cation", WWW: Special interest tracks and posters of the 14th international conference on World Wide Web, pages 1010-1011, 2005. [10] Lei Tang, Jianping Zhang, and Huan Liu. "Acclimatizing Taxonomic Semantics for Hierarchical Content Classi cation", Knowledge Discovery and Data Mining Conference, August 20-23, 2006, Philadelphia, Pennsylvania, USA (KDD 2006). [11] Kunal Punera, Suju Rajan, and Joydeep Ghosh. Automatic Construction of N-ary Tree Based Taxonomies", Sixth IEEE International Conference on Data Mining Workshops ICDMW''06). [12] Susan Dumais and Hao Chen. "Hierarchical classi cation of web content", SIGIR, 2000. [13] Ke Wang, Senqiang Zhou, and Shiang Chen Liew. "Building hierarchical classi ers using class proximity", In Proc. of the 25th International Conference on Very Large Data Bases Conference, pages 363-374, 1999 (VLDB''99). [14] Daphne Koller and Mehran Sahami. "Hierarchically classifying documents using very few words", International Conference on Machine Learning, pages 170-178, 1997 (ICML''97). [15] Tie-Yan Liu, Yiming Yang, Hao Wan, Hua-Jun Zeng, Zheng Chen, and Wei-Ying Ma. "Support vector machines classi cation with a very largescale taxonomy", ACM Special Interest Group on Knowledge Discovery and Data Mining Explor. Newsl., 2005 (ACM SIGKDD''05). [16] I. S. Dhillon, S. Mallela, and R. Kumar. "Enhanced word clustering for hierarchical text classi cation", Knowledge Discovery and Data Mining Conference, pages 191-200, 2002 (KDD''02). [17] D. Lewis. Reuters-21578 Text Categorization Test Collection, Distribution 1.0, Manuscript, 1997, http://www.daviddlewis.com/resources/testcollections/reuters21578. [18] G. Salton, C. Buckley. "Term Weighting Approaches in Automatic Text Retrieval", Information Processing and Management, 1988. [19] Data mining from Wikipedia, http://en.wikipedia.org/wiki/Data mining [20] Wikipedia, http://www.wikipedia.org/ [21] Lijuan Cai and Thomas Hofmann. "Hierarchical document categorization with support vector machines". Conference on Information and Knowledge Management, pages 78V87, 2004 (CIKM''04). [22] A.K. Jain and R.C. Dubes. "Algorithms for Clustering Data", Prentice Hall, Englewood Cli s, NJ, 1988. [23] Aghagolzadeh M., Soltanian-Zadeh H., Araabi B., Aghagolzadeh A. "A Hierarchical Clustering Based on Mutual Information Maximization", IEEE International Conference on Image Processing, 2007 (ICIP''07).