|
[1] I-Chen Wu, Jui-Yuan Su, Loon-Been Chen, "A Web Data Extraction Description Language and Its Implementation," compsac, pp. 293-298, 29th Annual International Computer Software and Applications Conference (COMPSAC'05) Volume 1, 2005 [2] http://www.worldwidewebsize.com [3] http://www.google.com [4] http://www.google.com/products [5] http://www.scholar.com [6] Cai, D., Yu, S., Wen, JR, Ma, WY, 2003. “VIPS: A Vision-base Page Segmentation Algorithm.”, Technical Report, MSR-TR-2003-79, Microsoft Research Asia. [7] J. Kleinberg, “Authoritative sources in a hyperlinked environment”, Journal of the ACM, Vol. 46, No. 5, pp. 604-622,1999. [8] L. Page, S. Brin, R. Motwani, and T. Winograd, “The PageRank citation ranking: Bringing order to the web”, Technical report, Stanford University, Stanford, CA, 1998. [9] Deng Cai, Xiaofei He, Ji-Rong Wen and Wei-Ying Ma., “Block-level Link Analysis”, Microsoft Technical Report MSR-TR-2004-50, 2004. [10] Deng Cai, Shipeng Yu, Ji-Rong Wen, Wei-Ying Ma., “Block-based Web Search.”, In Proc. of the SIGIR’04 Conf., pages 456-463, 2004. [11] Z. Nie, Y. Zhang, JR Wen, and WY Ma., “Object- level ranking: Bringing order to web objects.”, In Proceedings of WWW Conference, 2005. [12] Ruihua Song, Haifeng Liu, Ji-Rong Wen and Wei-Ying Ma, “Learning Block Importance Models for Web Pages[A].” In proceeding of the Thirteenth World Wide Web conference[C], New York, NY: ACM Press, 2004, 203-211. [13] Shian-Hua Lin,Jan-Ming Ho. “Discovering Informative Content Blocks from Web Documents”, KDD-02, 2002 [14] http://www.websiteoptimization.com/speed/tweak/clickstream/ [15] Chang, C.-H., and Shao-Chen, L. IEPAD: Information extraction based on pattern discovery. In Proceedings of the tenth international conference on World Wide Web(2001) [16] Arasu, A., Garcia-Molina, H.: Extracting Structured Data from Web Pages. In: Proceedings of ACM SIGMOD International Conference on Management of Data (SIGMOD 2003), San Diego, California, USA, ACM Press (2003) [17] Hung-Yu Kao, Shian-Hua Lin, Jan-Ming Ho, Ming-Syan Chen, Mining Web Information Structures and Contents based on Entropy Analysis, IEEE Transactions on Knowledge and Data Engineering , volume 16, issue 1, pages 41-55, Jan 2004. [18] Hung-Yu Kao, Jan-Ming Ho, Ming-Syan Chen, WISDOM : Web Intra-page Informative Structure Mining based on Document Object Model, IEEE Transactions on Knowledge and Data Engineering, volume 17, issue 5, pages 614- 627, May 2005. [19] Mendez-Torreblanca, A., Montes-y-Gomez, M., and Lopez-Lopez, A.: A Trend Discovery. System for Dynamic Web Content Mining. Proceedings of the 11. th. International Confer-. ence on Computing, Mexico City, Mexico (2002) [20] S. Debnath, P. Mitra, N. Pal, and C. L. Giles, “Automatic Identification of Informative Sections of Web Pages,” IEEE Transactions on Knowledge and Data Engineering 17, 9, Sep. 2005. [21] Deng Cai, Xiaofei He, Zhiwei Li, Wei-Ying Ma and Ji-Rong Wen, “Hierarchical Clustering of WWW Image Search Results Using Visual, Textual and Link Analysis”,12th ACM International Conference on Multimedia, Oct. 2004 . [22] Zaiqing Nie, Yunxiao Ma, Shuming Shi, Ji-Rong Wen and Wei-Ying Ma, Web Object Retrieval, The 16th international World Wide Web conference (WWW 2007) [23] CHEN, Z, LI, T, WANG, J, LIU, W Y and MA, W Y, "A Unified Framework for Web Link Analysis", Proceedings of the 3rd International Conference on Web Information Systems Engineering (WISE 2002), Singapore, December 2002, pp 63-72.
|