|
[1] T. H. Davenport and L. Prusak, Working knowledge: How organizations manage what they know. Harvard Business Press, 1998. [2] W. A. Gale and K. W. Church, “A program for aligning sentences in bilingual corpora,” in Proceedings of Association for Computational Linguistics, 1991. [3] P. F. Brown, J. C. Lai, and R. L. Mercer, “Aligning sentences in parallel corpora,” in Proceedings of the 29th annual meeting on Association for Computational Linguistics, pp. 169–176, 1991. [4] M. M. Hasan and Y. Matsumoto, “Multilingual document alignment-a study with chinese and japanese,” in Proceedings of Natural Language Processing Pacific Rim Symposium, pp. 617–623, 2001. [5] R. Nazar, “Parallel corpus alignment at the document, sentence and vocabulary levels,” Procesamiento del lenguaje natural, vol. 47, pp. 129–136, 2011. [6] T. Lee and S.-w. Hwang, “Bootstrapping entity translation on weakly comparable corpora,” in Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2013), vol. 1, pp. 631–640, 2013. [7] D. Nguyen, A. Overwijk, C. Hauff, D. R. Trieschnigg, D. Hiemstra, and F. De Jong, “Wikitranslate: query translation for cross-lingual information retrieval using only wikipedia,” in Evaluating Systems for Multilingual and Multimodal Information Access, pp. 58–65, Springer, 2009. [8] J. Lehmann, R. Isele, M. Jakob, A. Jentzsch, D. Kontokostas, P. N. Mendes, S. Hellmann, M. Morsey, P. van Kleef, S. Auer, et al., “Dbpedia – a large-scale, multilingual knowledge base extracted from wikipedia,” Semantic Web Journal, vol. 1, pp. 1–5, 2014. [9] P. Sorg and P. Cimiano, “Enriching the crosslingual link structure of wikipedia - a classification-based approach,” in Proceedings of the AAAI 2008 Workshop on Wikipedia and Artifical Intelligence, pp. 49–54, 2008. [10] J.-H. Oh, D. Kawahara, K. Uchimoto, J. Kazama, and K. Torisawa, “Enriching multilingual language resources by discovering missing cross-language links in wikipedia,” in Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI/IAT 2008), vol. 1, pp. 322–328, 2008. [11] M. Jiang, “The business and politics of search engines: A comparative study of baidu and google’s search results of internet events in china,” New Media & Society, vol. 16, no. 2, pp. 212–233, 2014. [12] H.-T. Liao, “How does localization influence online visibility of user-generated encyclopedias? a study on chinese-language search engine result pages (serps),” in Proceedings of the 9th International Symposium on Open Collaboration, p. 27, ACM, 2013. [13] M. Diab and S. Finch, “A statistical word-level translation model for comparable corpora,” in Proceedings of Conference on Content-based Multimedia Information Access, 2000. [14] L. Shao and H. T. Ng, “Mining new word translations from comparable corpora,”in Proceedings of the 20th international conference on Computational Linguistics, Association for Computational Linguistics, 2004. [15] C.-J. Lee, J. S. Chang, and J.-S. R. Jang, “Extraction of transliteration pairs from parallel corpora using a statistical transliteration model,” Information Sciences, vol. 176, no. 1, pp. 67–90, 2006. [16] M. Nagata, T. Saito, and K. Suzuki, “Using the web as a bilingual dictionary,” in Proceedings of the workshop on Data-driven methods in machine translation, vol. 14, pp. 1–8, 2001. [17] W.-H. Lu, L.-F. Chien, and H.-J. Lee, “Translation of web queries using anchor text mining,” ACM Transactions on Asian Language Information Processing (TALIP), vol. 1, no. 2, pp. 159–172, 2002. [18] D. Zhou, M. Truran, T. Brailsford, and H. Ashman, “Ntcir-6 experiments using pattern matched translation extraction,” in Proceedings of NTCIR-6 Workshop Meeting, 2006. [19] K. Kwok, P. Deng, N. Dinstl, H. Sun, W. Xu, P. Peng, and J. Doyon, “Chinet: a chinese name finder system for document triage,” in Proceedings of 2005 International Conference on Intelligence Analysis, 2005. [20] G. Cao, J. Gao, J.-Y. Nie, and W. Redmond, “A system to mine large-scale bilingual dictionaries from monolingual web,” Proceedings of MT Summit XI, pp. 57–64, 2007. [21] D. Lin, S. Zhao, B. Van Durme, and M. Pas¸ca, “Mining parenthetical translations from the web by word alignment,” in Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 944–1002, 2008. [22] I. D. Melamed, “Models of translational equivalence among words,” Computational Linguistics, vol. 26, no. 2, pp. 221–249, 2000. [23] W. A. Gale and K. W. Church, “Identifying word correspondences in parallel texts,”in Proceedings of the workshop on Speech and Natural Language, vol. 91, pp. 152–157, 1991. [24] X. Wu, N. Okazaki, and J. Tsujii, “Semi-supervised lexicon mining from parenthetical expressions in monolingual web pages,” in Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 424–432, 2009. [25] S. Karimi, F. Scholer, and A. Turpin, “Machine transliteration survey,” ACM Computting Surveys, vol. 43, no. 3, p. 17, 2011. [26] K. Knight and J. Graehl, “Machine transliteration,” Computational Linguistics, vol. 24, no. 4, pp. 599–612, 1998. [27] H. M. Meng, W.-K. Lo, B. Chen, and K. Tang, “Generating phonetic cognates to handle named entities in english-chinese cross-language spoken document retrieval,” in Automatic Speech Recognition and Understanding, 2001. ASRU’01. IEEE Workshop on, pp. 311–314, IEEE, 2001. [28] P. Virga and S. Khudanpur, “Transliteration of proper names in cross-language applications,” in Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, pp. 365–366, 2003. [29] P. Virga and S. Khudanpur, “Transliteration of proper names in cross-lingual information retrieval,” in Proceedings of the ACL 2003 workshop on Multilingual and mixed-language named entity recognition-Volume 15, pp. 57–64, Association for Computational Linguistics, 2003. [30] J. S. Lee and K.-S. Choi, “English to korean statistical transliteration for information retrieval,” Computer Processing of Oriental Languages, vol. 12, no. 1, pp. 17–37, 1998. [31] J. S. Lee, An English-Korean transliteration and retransliteration model for Cross-lingual information retrieval. PhD thesis, Computer Science Dept., KAIST, 1999. [32] K. S. Jeong, S. H. Myaeng, J. S. Lee, and K.-S. Choi, “Automatic identification and back-transliteration of foreign words for information retrieval,” Information Processing and Management, vol. 35, pp. 523–540, 1999. [33] J. Kim, J. S. Lee, and K.-S. Choi, “Pronunciation unit based automatic english-korean transliteration model using neural network,” in Proceedings of Korea Cognitive Science Association (in Korean), 1999. [34] B.-J. Kang and K.-S. Choi, “Automatic transliteration and back-transliteration by decision tree learning,” in Proceedings of the 2nd International Conference on Language Resources and Evaluation, pp. 1135–1141, 2000. [35] B.-J. Kang, A resolution of word mismatch problem caused by foreign word transliterations and English words in Korean information retrieval. PhD thesis, Computer Science Dept., KAIST, 2001. [36] I.-H. Kang and G. Kim, “English-to-korean transliteration using multiple unbounded overlapping phoneme chunks,” in Proceedings of the 18th International Conference on Computational Linguistics, pp. 418–424, 2000. [37] I. Goto, N. Kato, N. Uratani, and T. Ehara, “Transliteration considering context information based on the maximum entropy method,” in Proceedings of MT-Summit IX, pp. 125–132, 2003. [38] H. Li, M. Zhang, and J. Su, “A joint source-channel model for machine transliteration,” in Proceedings of the 42nd Annual Meeting on association for Computational Linguistics, pp. 160–167, 2004. [39] M. Zhang, H. Li, and J. Su, “Direct orthographical mapping for machine transliteration,” in Proceedings of the 20th international conference on Computational Linguistics, p. 716, 2004. [40] Y. Al-Onaizan and K. Knight, “Translating named entities using monolingual and bilingual resources,” in Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 400–408, Association for Computational Linguistics, 2002. [41] J.-H. Oh, “Machine learning based english-to-korean transliteration using grapheme and phoneme information,” IEICE transactions on information and systems, vol. 88, no. 7, pp. 1737–1748, 2005. [42] J.-H. Oh, K.-S. Choi, and H. Isahara, “A hybrid model for extracting transliteration equivalents from parallel corpora,” in Proceedings of the 9th International Conference on Text, Speech and Dialogue, pp. 119–126, 2006. [43] J.-H. Oh, K.-S. Choi, and H. Isahara, “Improving machine transliteration performance by using multiple transliteration models,” in Proceedings of the 21st International Conference on Computer Processing of Oriental Languages, pp. 85–96, 2006. [44] P. McNamee and H. T. Dang, “Overview of the tac 2009 knowledge base population track,” in Text Analysis Conference (TAC), vol. 17, pp. 111–113, 2009. [45] H. Ji, R. Grishman, H. T. Dang, K. Griffitt, and J. Ellis, “Overview of the tac 2010 knowledge base population track,” in Third Text Analysis Conference (TAC 2010), 2010. [46] Y. Guo, G. Tang, W. Che, T. Liu, and S. Li, “Hit approaches to entity linking at tac 2011,” in Proceedings of the Fourth Text Analysis Conference (TAC 2011), 2011. [47] W. Zhang, J. Su, B. Chen, W. Wang, Z. Toh, Y. Sim, Y. Cao, C. Y. Lin, and C. L. Tan, “I2R-nus-msra at TAC 2011: Entity linking,” in Proceedings of Text Analysis Conference (TAC 2011), 2011. [48] A. Fahrni, T. Gockel, and M. Strube, “Hitsmonolingual and cross-lingual entity linking system at tac 2012: A joint approach,” in Proceedings of the Sixth Text Analysis Conference (TAC 2012), vol. 1, 2012. [49] B. Hachey, W. Radford, and J. R. Curran, “Graph-based named entity linking with wikipedia,” in Web Information System Engineering–WISE 2011, pp. 213–226, Springer, 2011. [50] P. McNamee, J. Mayfield, D. Lawrie, D. W. Oard, and D. S. Doermann, “Cross-language entity linking,” in Proceedings of International Joint Conference on Natural Language Processing (IJCNLP 2011), pp. 255–263, 2011. [51] J. Clarke, Y. Merhav, G. Suleiman, S. Zheng, and D. Murgatroyd, “Basis technology at tac 2012 entity linking,” in Proceedings of the Sixth Text Analysis Conference (TAC 2012), 2012. [52] Q. Miao, R. Fang, Y. Meng, and S. Zhang, “Frdc’s cross-lingual entity linking system at tac 2013,” in Proceedings of Text Analysis Conference (TAC 2013), 2013. [53] Y. Chen and C. Zong, “A structure-based model for chinese organization name translation,” ACM Transactions on Asian Language Information Processing (TALIP), vol. 7, no. 1, pp. 1–30, 2008. [54] D. Lowd and P. Domingos, “Efficient weight learning for markov logic networks,”in Lecture Notes in Computer Science, 4702, pp. 200–211, Springer, 2007. [55] U. Manber and G. Myers, “Suffix arrays: a new method for on-line string searches,”siam Journal on Computing, vol. 22, no. 5, pp. 935–948, 1993. [56] H. Poon and P. Domingos, “Sound and efficient inference with probabilistic and deterministic dependencies,” in Proceedings of the National Conference on Artificial Intelligence, vol. 6, pp. 458–463, 2006. [57] S. Jiampojamarn, C. Cherry, and G. Kondrak, “Integrating joint n-gram features into a discriminative training framework,” in Proceedings of NAACL-2010, (Los Angeles, CA), Association for Computational Linguistics, June 2010. [58] J.-H. Oh and K.-S. Choi, “An ensemble of transliteration models for information retrieval,” Information processing & management, vol. 42, no. 4, pp. 980–1002, 2006. [59] H. Li, M. Zhang, and J. Su, “A joint source-channel model for machine transliteration,” in Proceedings of the 42nd Annual Meeting on association for Computational Linguistics, p. 159, Association for Computational Linguistics, 2004. [60] Y. Jia, D. Zhu, and S. Yu, “A noisy channel model for grapheme-based machine transliteration,” in Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration, pp. 88–91, Association for Computational Linguistics, 2009. [61] S. Wan and C. M. Verspoor, “Automatic english-chinese name transliteration for development of multilingual resources,” in Proceedings of the 17th international conference on Computational linguistics-Volume 2, pp. 1352–1356, Association for Computational Linguistics, 1998. [62] L. Jiang, M. Zhou, L.-F. Chien, and C. Niu, “Named entity translation with web mining and transliteration.,” in IJCAI, vol. 7, pp. 1629–1634, 2007. [63] C. Zhang, T. Li, and T. Zhao, “Syllable-based machine transliteration with extrac phrase features,” in Proceedings of NEWS 2012, pp. 52–56, 2012. [64] S. Jiampojamarn, G. Kondrak, and T. Sherif, “Applying many-to-many alignments and hidden markov models to letter-to-phoneme conversion,” in Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference, (Rochester, New York), pp. 372–379, Association for Computational Linguistics, April 2007. [65] S. Jiampojamarn, C. Cherry, and G. Kondrak, “Joint processing and discriminative training for letter-to-phoneme conversion,” in Proceedings of ACL-08: HLT, (Columbus, Ohio), pp. 905–913, Association for Computational Linguistics, June 2008. [66] R. Zens and H. Ney, “Improvements in phrase-based statistical machine translation,”in HLTNAACL 2004: Main Proceedings, (Boston, Massachusetts, USA), pp. 257–264, 2004. [67] K. Crammer and Y. Singer, “Ultraconservative online algorithms for multiclass problems,” The Journal of Machine Learning Research, vol. 3, pp. 951–991, 2003. [68] G. Kondrak, “Phonetic alignment and similarity,” Computers and the Humanities, vol. 37, no. 3, pp. 273–291, 2003. [69] L. Cambell, Historical linguistics: an introduction. The MIT Press, 2004. [70] S. Robertson and H. Zaragoza, The probabilistic relevance framework: BM25 and beyond. Now Publishers Inc, 2009. [71] C. Cortes and V. Vapnik, “Support-vector networks,” Machine learning, vol. 20, no. 3, pp. 273–297, 1995. [72] D. M. Blei, A. Y. Ng, and M. I. Jordan, “Latent dirichlet allocation,” Journal of Machine Learning Research, vol. 3, no. 4-5, pp. 993–1022, 2003. [73] T. Zhang, K. Liu, and J. Zhao, “Cross lingual entity linking with bilingual topic model,” in Proceedings of the 23rd international joint conference on Artificial Intelligence (IJCAI 2013), pp. 2218–2224, AAAI Press, 2013. [74] T. Nguyen, V. Moreira, H. Nguyen, H. Nguyen, and J. Freire, “Multilingual schema matching for wikipedia infoboxes,” Proceedings of the VLDB Endowment, vol. 5, no. 2, pp. 133–144, 2011. [75] M. Zhang, H. Li, A. Kumaran, and M. Liu, “Report of news 2012 machine transliteration shared task,” in Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, pp. 10–20, 2012.
|