|
[1] Artetxe, M., Labaka, G., Agirre, E., Cho, K. (2018) Unsupervised Neural Machine Translation. arXiv preprint arXiv:1710.11041. [2] Badalamenti, A.F. (1991) Language and the Intuition of Meaning. Systems Research 8(4), pp. 43-66 [3] Bahdanau, D., Cho, K., Bengio, Y. (2015) Neural Machine Translation by Jointly Learning to Align and Translate. In: ICLR 2015. [4] Banchs, R.E., D''Haro, L.F., Li, H. (2015) Adequacy–Fluency Metrics: Evaluating MT in the Continuous Space Model Framework. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 23(3), pp. 472–482 [5] Bengio, Y., Ducharme, R., Vincent, P., Jauvin, C. (2003) A Neural Probabilistic Language Model. Journal of Machine. Learning Research 3:1137–1155. [6] Bikel, D., & Zitouni, I. (2012). Multilingual Natural Language Processing Applications: From Theory to Practice. Indianapolis, IN: IBM Press. [7] Bordes, A., Glorot, X., Weston, J., Bengio, Y. (2012) Joint Learning of Words and Meaning Representations for Open-Text Semantic Parsing. In Proceedings of AISTATS, pp. 127–135. [8] Britz, D., Goldie, A., Luong, M.-T., Le, Q.V. (2017) Massive Exploration of Neural Machine Translation Architectures. CoRR abs/1703.03906. [9] Brown, P.F., Cocke, J., Pietra, S.A.D., Pietra, C.J.D., Jelinek, F.,Lafferty, J.D., Mercer, R.L., Roossin, P.S. (1990) A Statistical Approach To Machine Translation. Computational linguistics 16 (2), 79-85. [10]Brown, P.E., Pietra, S.A.D., Pietra, V.J.D., Mercer, R.L. (1993) The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics, 19(2):263–311. [11]Cho, K., van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., Bengio, Y. (2014) Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. In: Proceedings of EMNLP 2014. [12]Chollet, F. (2017, September 29). A ten-minute introduction to sequence-to-sequence learning in Keras [Web blog content]. Retrieved from https://blog.keras.io/a-ten-minute-introduction-to-sequence-to-sequence-learning-in-keras.html [13]Chung, J., Cho, K., Bengio, Y. (2016) A Character-Level Decoder without Explicit Segmentation for Neural Machine Translation. CoRR abs/1603.06147 [14]Conneau, A., Schwenk, H., Barrault, L., Lecun, Y. (2017) Very Deep Convolutional Networks for Text Classification. arXiv preprint arXiv:1606.01781 [15]Devitt, M. (2006) Intuitions In Linguistics. The British Journal for the Philosophy of Science. 57(3), pp. 481–513 [16]Doddington, G. (2002) Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics. In Second International Conference on Human Language Technology Research, San Diego, CA. pp. 138–145. [17]Fitzgerald, G.(2010) Linguistic Intuitions. The British Journal for the Philosophy of Science. 61(1), pp. 123–160 [18]Harris, Z.S. (1954) Distributional Structure. Word, 10(2-3), 146-162. [19]Hinton, G.E. (1986) Learning distributed representations of concepts. In Proceedings of the Eighth Annual Conference of the Cognitive Science Society. Hillsdale, NJ: Erlbaum. [20]Hinton, G.E., Osindero, S., Teh, Y.-W. (2006) A fast learning algorithm for deep belief nets. Neural Computation, Volume 18, 2006, 2283-2292. [21]Hinton, G.E., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.R. (2012) Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580. [22]Hochreiter, S., Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9(8), pp.1735–1780. oi:10.1162/neco. [23]Huang, P.-S., Wang, C., Huang, S., Zhou, D., Deng, L. (2018) Towards Neural Phrase-Based Machine Translation.arXiv preprint arXiv:1706.05565 [24]Jean, S., Cho, K., Memisevic, R., & Bengio, Y. (2015) On Using Very Large Target Vocabulary for Neural Machine Translation. In ACL-IJCNLP 2015 [25]Kaiser, L., Gomez, A.N., Chollet, F. (2017) Depthwise Separable Convolutions for Neural Machine Translation. arXiv preprint arXiv:1706.03059 [26]Kaiser, L., Gomez, A.N., Shazeer, N., Vaswani, A., Parmar, N., Jones, L., Uszkoreit, J. (2017) One Model To Learn Them All. arXiv preprint arXiv:1706.05137 [27]Kim, Y., Jernite, Y., Sontag, D., Rush, A.M. (2015) Character-Aware Neural Language Models. arXiv preprint arXiv preprint arXiv:1508.06615 [28]Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., ...Herbst, E. (2007) Moses: Open Source Toolkit for Statistical Machine Translation. Annual Meeting of the Association for Computational Linguistics (ACL), demonstration session, Prague, Czech Republic, June 2007. [29]Koehn, P., Och, F.J., Marcu, D. (2003) Statistical Phrase-Based Translation. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1. Stroudsburg, PA, USA: Association for Computational Linguistics. 2003. p. 48-54. (NAACL ''03). [30]Koehn, P. (2004) Pharaoh: A Beam Search Decoder for Phrase-Based Statistical Machine Translation Models. Conference of the Association for Machine Translation in the Americas, 115-124. [31]Krizhevsky, A., Sutskever, I., Hinton, G.E. (2012) ImageNet Classification with Deep Convolutional Neural Networks. Neural Information Processing Systems 25. [32]Kumar, A., Irsoy, O., Ondruska, P., Iyyer, M., Bradbury, J., Gulrajani, I., ...Socher, R. (2016) Ask Me Anything: Dynamic Memory Networks for Natural Language Processing. arXiv preprint arXiv:1506.07285. [33]Lample, G., Denoyer, L., Ranzato, M. (2017) Unsupervised Machine Translation Using Monolingual Corpora Only. arXiv preprint arXiv:1711.00043 [34]Lee, J., Cho, K., Hofmann, T. (2017) Fully Character-Level Neural Machine Translation without Explicit Segmentation. arXiv preprint arXiv:1610.03017. [35]Le, Q.V., Mikolov, T. (2014) Distributed Representations of Sentences and Documents. arXiv preprint arXiv:1405.4053. [36]Luong, M.-T., Pham, H., Manning, C.D. (2015) Effective Approaches to Attention-Based Neural Machine Translation. In Proceedings of the 2015 Conference on EMNLP. [37]Luong, M.-T., Sutskever, I., Le, Q.V., Vinyals, O., Zaremba, W. (2014) Addressing the Rare Word Problem in Neural Machine Translation. arXiv preprint arXiv:1410.8206 [38]Mikolov, T., Chen, K., Corrado, G., Dean, J. (2013) Efficient Estimation of Word Representations in Vector Space. arXiv preprint arXiv:1301.3781. [39]Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J. (2013) Distributed Representations of Words and Phrases and their Compositionality. arXiv preprint arXiv:1310.4546 [40]Nallapati, R., Zhou, B., Santos, C.N.d., Gulcehre, C., Xiang, B. (2016) Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond. arXiv preprint arXiv:1602.06023 [41]Neubig, G. (2017) Neural Machine Translation and Sequence-to-sequence Models: A Tutorial. arXiv preprint arXiv:1703.01619 [42]Och, F.J., Ney, H. (2002) Discriminative Training and Maximum Entropy Models for Statistical Machine Translation. 40th Annual Meeting of the Association for Computational Linguistics, 295-302. [43]Och, F.J. (2003) Minimum Error Rate Training in Statistical Machine Translation. In Proceedings of the 41st Annual Meeting of Association for Computational Linguistics, pages 160-167, Sapporo, Japan, July. [44]Och, F.J., Ney, H. (2003). A systematic comparison of various statistical alignment models. Computational Linguistics, 29(1), 19-51. [45]Och, F.J., Ney, H. (2004) The Alignment Template Approach to Statistical Machine Translation. Computational Linguistics, 30(4):417-449. [46]Papineni, K., Roukos, S., Ward, T., Zhu, W.-J. (2002) BLEU: a Method for Automatic Evaluation of Machine Translation. In ACL 2002, pp. 311–318. [47]Perera, R., Nand, P. (2017) Recent Advances in Natural Language Generation: A Survey and Classification of the Empirical Literature. Computing and Informatics, 36(1), 1–32. Retrieved from http://www.cai.sk/ojs/index.php/cai/article/viewArticle/2017_1_1 [48]Peter, P.F., Cocke, J., Della Pietra, S., Pietra, D., Della Pietra, V., Jelinek, F., Lafferty, J., Mercer, R.L., Roossin, P.S. (1990) A Statistical Approach To Machine Translation. Computational Linguistics 16(2):79-85 [49]Reynolds, A.C. (1954): The conference on mechanical translation held at M.I.T., June 17-20, 1952. Mechanical Translation 1 (3), 47-55. [50]Rumelhart, D.E., Hinton, G.E., Williams, R.J. (1986) Learning representations by back-propagation errors. Nature, 323(6088), 533-536. [51]Rong, X. (2016) word2vec Parameter Learning Explained. arXiv preprint arXiv:1411.2738 [52]Saha, D., Bandyopadhyay, S. (2005) A Semantics-based English-Bengali EBMT System for translating News Headlines. In: Proceedings of the 10th Machine Translation Summit (Phuket, 12-16 December 2005), 125-133. [53]Sennrich, R., Haddow, B., Birch, A. (2016) Neural Machine Translation of Rare Words with Subword Units. arXiv preprint arXiv:1508.07909 [54]Shannon, C.E. (1948) A Mathematical Theory of Communication. The Bell System Technical Journal, 27. [55]Strassel, S., Przybocki, M., Peterson, K., Song, Z., Maeda, K. (2008) Linguistic Resources and Evaluation Techniques for Evaluation of Cross-Document Automatic Content Extraction. Proceedings of the 6th International Language Resources and Evaluation Conference (LREC-08), Marrakech, Morocco. [56]Sutskever, I., Vinyals, O., Le, Q.V. (2014) Sequence to Sequence Learning with Neural Networks. arXiv preprint arXiv:1409.3215. [57]Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R. (2014) Dropout: A Simple Way to Prevent Neural Networks from Overfitting. The Journal of Machine Learning Research, 15(1), 1929–1958. [58]Taskar, B., Lacoste-Julien, S., Klein, D. (2005) A Discriminative Matching Approach to Word Alignment. In Proceedings of HLT/EMNLP 2005, 73-80, Vancouver, British Columbia, Canada, October. [59]Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I. (2017) Attention Is All You Need. arXiv preprint arXiv:1706.03762v4 [60]Vinyals, O., Le, Q.V. (2015) A Neural Conversational Model. arXiv preprint arXiv:1506.05869 [61]Wu, Y., Schusterm, M., Chen, Z., Le, Q.V., Norouzi, M., Macherey, W., ...Dean, J. (2016) Google''s Neural Machine Translation System: Bridging the Gap between Human and Machine Translation. arXiv preprint arXiv:1609.08144 [62]Yann, L., Bengio, Y., Hinton, G. (2015) Deep learning. Nature 521.7553 (2015): 436–444. [63]Zaccone, G. (2016). Getting Started with TensorFlow. Birmingham, England: Packt Publishing Ltd. [64]鄭捷(2017)。NLP漢語自然語言處理原理與實踐。北京市:電子工業出版社
|