|
[1] Chia-Hui Chang, Mohammed Kayed, Moheb Ramzy Girgis and Khaled F. Shaalan. A Survey of Web Information Extraction Systems, IEEE Transactions on Knowledge and Data Engineering, v.18 n.10, p.1411-1428, October 2006 [2] Liu, B., Grossman, R. and Zhai, Y., Mining data records in Web pages. KDD, 601-606, 2003. [3] Zhai, Y. and Liu, B. Web Data Extraction Based on Partial Tree Alignment. Proceedings of the 14th International Conference on World Wide Web (WWW), Japan, pp. 76-85, 2005. [4] Chang, C.-H. and Lui, S.-C., IEPAD: Information extraction based on pattern discovery. Proceedings of the Tenth International Conference on World Wide Web (WWW), Hong-Kong, pp. 223-231, 2001. [5] Chang, C.-H. and Kuo, S.-C. OLERA: A semi-supervised approach for Web data extraction with visual support. IEEE Intelligent Systems, 19(6):56-64, 2004. [6] H. Zhao, W. Meng, Z. Wu, V. Raghavan, and C. Yu, Fully automatic wrapper generation for search engines. In Proceedings of the 14th International conference on World Wide World, 2005. [7] Cai, D., Yu, S., Wen, J.-R. and Ma, W.-Y. Block-based Web Search. In Proc. of SIGIR, 2004. [8] Zhu, J., Nie, Z., Wen, J.-R., Zhang, B., and Ma, W.-Y. 2D Conditional Random Fields for Web Information Extraction. In Proc. of ICML, 2005. [9] Lafferty, J., McCallum, A., and Pereira, F. . Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proc. ICML, 2001. [10] Jun Zhu, Zaiqing Nie, Ji-Rong Wen, Bo Zhang and Wei-Ying Ma. Simultaneous record detection and attribute labeling in web data extraction, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, August 20-23, 2006, Philadelphia, PA, USA [11] F. Jousse, R. Gilleron, I. Tellier, and M. Tommasi. Conditional Random Fields for XML trees. In Proc. ECML Workshop on Mining and Learning in Graphs, Berlin, Germany, Sept. 2006. [12] D. Cai, S. Yu, J.-R.Wen, andW.-Y.Ma, Extracting content structure for web pages based on visual representation, Proc.5th Asia Pacific Web Conference, Xian China, 2003. [13] D. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma, VIPS: a visionbased page segmentation algorithm, Microsoft Technical Report, MSR-TR-2003-79, 2003. [14] J. Kocibova, K. Klos, O. Lehecka, M. Kudelka, V. Snasel. Web Page Analysis: Experiments Based on Discussion and Purchase Web Patterns, IEEE ICDE, 2007. [15] Y. Lu, H. He, H. Zhao, W. Meng, C. Yu. Annotating Structured Data of the Deep Web. IEEE ICDE, 2007. [16] B. Liu and Y. Zhai. NET - A System for Extracting Web Data from Flat and Nested Data Records. WISE Conference, 2005. [17] Ching-Liang Kang, Jyh-Jong Tasy. Design and Development of an Integrated Product Search System. Master’s thesis, 2006 [18] Gusfield, D. Algorithm on strings, tree, and sequence. 1997. [19] DOM http://www.w3.org/DOM/ [20] DOM http://www.w3.org/XML/ [21] MozillaFirefox http://www.mozilla.com/ [22] Yahoo http://tw.yahoo.com/ [23] Yahoo-bid http://tw.bid.yahoo.com/ [24] Yahoo-Shopping http://shopping.yahoo.com/ [25] PChome-shopping http://shopping.pchome.com.tw/ [26] ruten http://www.ruten.com.tw/ [27] PChome-store http://store.pchome.com.tw/ [28] eBay http://www.ebay.com.hk/ [29] Books.com http://www.books.com.tw/ [30] kingstone http://www.kingstone.com.tw/ [31] Costco http://www.costco.com/
|