|
[Altschul and Lipman, 1989] Altschul, S. F. and Lipman, D. J. (1989). Trees, stars, and multiple biological sequence alignment. SIAM Journal on Applied Mathematics, 49(1):197-209. [Ashish and Knoblock, 1997] Ashish, N. and Knoblock, C. (1997). Semi-automatic wrapper generation for Internet information sources. In Proceedings of Cooperative Information Systems. [Bunke and Sanfeliu, 1990] Bunke, H. and Sanfeliu, A., editors (1990). Syntactic and Structure Pattern Recognition: Theory and Application. World Scientific. [Califf and Mooney, 1997] Califf, M. and Mooney, R. (1997). Relational learning of pattern-match rules for information extraction. Working Papers of the ACL-97 Workshop in Natural Language Learning, pages 9-15. [Cardie, 1997] Cardie, C. (1997). Empirical methods in information extraction. AI Journal, 18(4):65-79. [Clark, 1997] Clark, J. (1997). ``SP'' - an SGML system conforming to international standard ISO 8879 - standard generalized markup language. http://www.jclark.com/sp/index.htm. [Cowie and Lehnert, 1996] Cowie, J. and Lehnert, W. (1996). Information extraction. Communications of the ACM, 39(1):80-91. [Doorenbos et al., 1997] Doorenbos, R., Etzioni, O., and Weld, D. (1997). A scalable comparison-shopping agent for the World-Wide Web. In Proceedings of Autonomous Agents, pages 39-48. [Freitag, 1998] Freitag, D. (1998). Information extraction from HTML: Application of a general machine learning approach. In Proceedings of the Fifteenth National Conference on Artifical Intelligence (AAAI-98), pages 517-523. [Fukuda and kamata, 1984] Fukuda, H. and Kamata, K. (1984). Inference of tree automata from sample set of trees. International Journal of Computer and Information Sciences, 13(3):177-196. [Hao et al., 1996] Hao, X., Wang, J. T. L., and Ng, P. A. (1996). Information extraction from the structured part of office documents. Information Sciences, 91:245-274. [Hsu and Dung, 1998] Hsu, C.-N and Dung, M.-T. (1998). Generating finite-state transducers for semi-structured data extraction from the Web. Journal of Information Systems, 23(8):521-538. Special issue on Semi-structured Data. [Hsu and Yih, 1997] Hsu, J. Y.-J and Yih, W.-T. (1997). Template-based information mining from HTML documents. In Proceedings of AAAI-97, pages 256-262. [Huffman, 1995] Huffman, S. (1995). Learning information extraction patterns from examples. Workshop on new approaches to learning for natural language processing (ijCai-95), pages 127-142. [Kushmerick, 1997] Kushmerick, N. (1997). Wrapper induction for information extraction. PhD thesis, Department of Computer Science, University of Washington. Technical Report UW-CSE-97-11-04. [Kushmerick et al., 1997] Kushmerick, N., Weld, D., and Doorenbos, R. (1997). Wrapper induction for information extraction. In Proceedings of Fifteenth International Joint Conference on Artificial Intelligence (IJCAI-97), pages 729-735. [Leu, 1998] Leu, C.-H (1998). Implementation and application of approximate tree matching for information extraction from HTML documents. Master's thesis, Department of Computer science and Information Engineering, National Taiwan University. [Muslea et al., 1998] Muslea, I., Minton, S., and Knoblock, C. A. (1998). STALKER: Learning extraction rules for semi-structured, Web-based information sources. Technical Report WS-98-01, AAAI Press, Menlo Park, CA. [Myers et al., 1997] Myers, G., Selznick, S., Zhang, Z., and Miller, W. (1997). Progressive multiple alignment with constraints. In Proceedings of the First Annual International Conference on Computational Molecular Biology, pages 220-225. [Pevzner, 1992] Pevzner, P. A. (1992). Multiple alignment, communication cost, and graph matching. SIAM Journal on Applied Mathematics, 52(6):1763-1779. [Reinert et al., 1997] Reinert, K., Lenhof, H.-P., Mutzel, P., Mehlhorn, K., and Kececioglu, J. (1997). Branch-and-cut algorithm for multiple sequence alignment. In Proceedings of the First Annual International Conference on Computational Molecular Biology, pages 241-250. [Riloff, 1993] Riloff, E. (1993). Automatically constructing a dictionary for information extraction tasks. In Proceedings of the Eleventh Annual Conference on Artificial Intelligence, pages 811-816. [Riloff, 1996] Riloff, E. (1996). Automatically generating extraction patterns from untagged text. In Proceedings of the Thirteenth Annual Conference on Artificial Intelligence, pages 1044-1049. [Rus and Subramanian, 1997] Rus, D. and Subramanian, D. (1997). Customizing information capture and access. ACM Transactions on Information Systems, 15(1):67-101. [Shibuya and Imai, 1997] Shibuya, T. and Imai, H. (1997). New flexible approaches for multiple sequence alignment. In Proceedings of the First Annual International Conference on Computational Molecular Biology, pages 267-276. [Soderland, 1997] Soderland, S. (1997). Learning to extract text-based information from the World-Wide Web. In Proceedings of Third International Conference on Knowledge Discovery and Data Mining (KDD-97), pages 251-254. [Soderland, 1999] Soderland, S. (1999). Learning information extraction rules for semi-structured and free text. Machine Learning, pages 1-44. [Soderland et al., 1995] Soderland, S., Fisher, D., Aseltine, J., and Lehnert, W. (1995). CRYSTAL: Inducing a conceptual dictionary. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI-95), pages 1314-1319. [Waterman, 1995] Waterman, M. S. (1995). Introduction to Computational Biology: Maps, Sequences and Genomes. Chapman & Hall. [Yih, 1997] Yih, W.-T. (1997). Template-based information extraction from tree-structured HTML documents. Master's thesis, Department of Computer science and Information Engineering, National Taiwan University. [Zhang et al., 1994] Zhang, K., Shasha, D., and Wang, J. (1994). Approximate tree matching in the presence of variable length don't care. Journal of Algorithms, 16:33-66.
|