|
[1]L. Egghe, “Untangling Herdan’s law and Heaps’ law: Mathematical and informetric arguments,” JASIST, Vol. 58, No. 5, 2007, pp.702-709. [2]fxjtoday "海量文檔查同或聚類問題 -- Locality Sensitive Hash 算法," CSDN.NET., [Online]. Available: http://blog.csdn.net/fxjtoday/article/details/6200257 [3]H. Steinhaus, “Sur la division des corps materiels en parties,” Bull. Acad. Polon. Sci. French, Vol. 4, No. 12, 1957, pp. 801-804. [4]J. B. MacQueen, “Some Methods for classification and Analysis of Multivariate Observations,” Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability, University of California Press, Vol. 1, 1967, pp. 281-297. [5]S. P. Lloyd, “Least square quantization in PCM,” IEEE Trans. Inform. Theory, Vol. 28, No. 2, 1982, pp. 129-137. [6]M. Figueiredo and A. Jain, “Unsupervised learning of finite mixture models,” IEEE Trans. Pattern Anal. Machine Intell, Vol. 24, No. 3, 2002, pp. 381-396. [7]G. Usman, U. Ahmad and M. Ahmad, “Improved K-Means Clustering Algorithm by Getting Initial Centroids,” World Applied Sciences Journal, Vol. 27, No. 4, 2013, pp. 543-551. [8]A. Alrabea, A. V. Senthilkumar, H. Al-Shalabi, and A. Bader, “Enhancing K-Means Algorithm with Initial Cluster Centers Derived from Data Partitioning along the Data Axis with PCA,” Journal of Advances in Computer Networks, Vol. 1, No. 2, June 2013, pp. 137-142. [9]B. Scholkopf, A. Smola, and K. R. Muller, “Nonlinear component analysis as a kernel eigenvalue problem,” Neural Comput, Vol. 10, No. 5, 1998, pp. 1299-1319. [10]J. Mao and A. K. Jain, “A self-organizing network for hyper-ellipsoidal clustering(HEC),” IEEE Trans. Neural Networks, Vol. 7, 1996, pp. 16-29. [11]R. Chitta, R. Jin, T. C. Havens and A. K. Jain, “Approximate Kernel k-means: Solution to Large Scale Kernel Clustering,” KDD, San Diego, California, USA, 2011, pp. 895-903. [12]A. M. Fahim, A. M. Salem, F. A. Torkey and M. A. Ramadan, “An Efficient enhanced k-means clustering algorithm,” journal of Zhejiang University, Vol. 10, No. 7, 2006, pp. 1626-1633. [13]K. A. Abdul Nazeer and M. P. Sebastian, “Improving the accuracy and efficiency of the k-means clustering algorithm,” International Conference on Data Mining and Knowledge Engineering (ICDMKE), Proceedings of the World Congress on Engineering (WCE-2009), Vol. 1, July 2009, London, UK, pp. 308-312. [14]M. Yedla, S. R. Pathakota and T. M. Srinivasa, "Enhancing K-means Clustering Algorithm with Improved Initial Center," International Journal of Computer Science and Information Technologies (IJCSIT), Vol. 1, No. 2, 2010, pp. 121-125. [15]M. Charikar. Similarity estimateon techniques from rounding algorithms. In Proc. 34th Annual Symposium on Theory of Computing (STOC 2002), 2002, pp 380-388. [16]R. W. Hamming, “Error Detecting and Error Correcting Codes,” Bell System Technical Journal, Vol. 29, 1950, p. 147-160. [17]A. Appleby, “MurmurHash,” [Online]. Available: http://sites.google.com/site/murmurhash/. [18]D. D. Lewis, “Reuters-21578 Text Categorization Test Collection Distribution 1.0”, AT&T Labs, September 1997. [19]C. D. Manning, P. Raghavan and H. Schutze, Introduction to Information Retrieval, Cambridge University Press. 2008. [20]S. Owen, R. Anil, T. Dunning, and E. Friedman, Mahout in Action, Manning Publications, October 2011. [21]T. Dunning "MiA," GitHub Inc., [Online]. Available: https://github.com/tdunning/MiA.
|