|
[1] I. Sutskever, O. Vinyals, Q. V. Le, “Sequence to sequence learning with neural networks.” Advances in neural information processing systems. 2014. [2] J. Gehring, M. Auli, D. Grangier, Y. N. Dauphin, “A convolutional encoder model for neural machine translation.” arXiv preprint arXiv:1611.02344 (2016). [3] Y. Cheng, et al. "Agreement-based joint training for bidirectional attention-based neural machine translation." arXiv preprint arXiv:1512.04650 (2015). [4] F. Rosenblatt, “The perceptron: A probabilistic model for information storage and organization in the brain,” Psychological Review, Vol 65(6), Nov 1958, 386-408. [5] R. O’Reilly, “Biologically Plausible Error-driven Learning using Local Activation Differences: The Generalized Recirculation Algorithm,” Neural Computation, 8:5, 895-938, 1996. [6] D. Rumelhart, G. Hinton, R. Williams, “Learning Internal Representations by Error Propagation” Technical rept., Mar-Sep, 1985. [7] 深度學習:使用激勵函數的目的、如何選擇激勵函數.[Online]. Available : http://mropengate.blogspot.tw/2017/02/deep-learning-role-of-activation.html, [Accessed: 10-Jun-2018]. [8] G. Hinton, S. Osindero, Y. Teh, “A Fast Learning Algorithm for Deep Belief Nets” Neural computation, Vol. 18, No. 7, Pages 1527-1554, 2006. [9] Y. Lecun, L. Bottou, Y. Bengio and P. Haffner, “Gradient-based learning applied to document recognition,” Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, Nov 1998. [10] C. Ding and D. Tao, “Robust face recognition via multimodal deep face representation,” IEEE Transactions on Multimedia, vol. 17, no. 11, pp. 2049-2058, 2015. [11] L. Pigou, S. Dieleman, P. Kindermans, and B. Schrauwen, “Sign language recognition using convolutional neural networks,” Workshop at the European Conference on Computer Vision, Springer International Publishing, 2014. [12] S. Ren, K. He, R. Girshick, J. Sun. “Faster r-cnn: Towards real-time object detection with region proposal networks.” Advances in neural information processing systems. 2015. [13] S. Sukittanon, A. Surendran, J. Platt, and C. Burges, “Convolutional networks for speech detection,” Interspeech, 2004. [14] Y. Kim, “Convolutional neural networks for sentence classification.” arXiv preprint arXiv:1408.5882 (2014). [15] C dos Santos, and M. Gatti, “Deep convolutional neural networks for sentiment analysis of short texts.” Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers. 2014. [16] X. Zhang, J. Zhao, and Y. LeCun, “Character-level convolutional networks for text classification.” Advances in neural information processing systems. 2015. [17] J. Gehring, M. Auli, D. Grangier, D. Yarats, Y. N. Dauphin, “Convolutional sequence to sequence learning.” arXiv preprint arXiv:1705.03122 (2017). [18] K. He, X. Zhang, S. Ren, J. Sun, “Deep residual learning for image recognition.” Proceedings of the IEEE conference on computer vision and pattern recognition. 2016. [19] G. Huang, Z. Liu, K. Q. Weinberger, L. van der Maaten, “Densely connected convolutional networks.” Proceedings of the IEEE conference on computer vision and pattern recognition. Vol. 1. No. 2. 2017. [20] Y. Wang, F. Tian, “Recurrent residual learning for sequence classification.” Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. 2016. [21] F. Godin, J. Dambre, W. De Neve, “Improving Language Modeling using Densely Connected Recurrent Neural Networks.” arXiv preprint arXiv:1707.06130. 2017. [22] S. Ioffe, C. Szegedy, “Batch normalization: Accelerating deep network training by reducing internal covariate shift.” arXiv preprint arXiv:1502.03167 (2015). [23] K. Simonyan, Z. Andrew, “Very deep convolutional networks for large-scale image recognition.” arXiv preprint arXiv:1409.1556 (2014). [24] J. J. Hopfield, “Neural networks and physical systems with emergent collective computational abilities”, Proceedings of the National Academy of Sciences of the USA, vol. 79, no. 8,pp. 2554–2558, April 1982. [25] S. Hochreiter and J. Schmidhuber. “Long short-term memory”. Neural Computation, vol. 9, pp. 1735–1780, 1997. [26] Deep Learning in a Nutshell: Sequence Learning,[Online]. Available: https://devblogs.nvidia.com/parallelforall/deep-learning-nutshell-sequence-learning/. [Accessed: 10-Jun-2018]. [27] K. Cho, B. Merriënboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, Y. Bengio, “Learning phrase representations using RNN encoder-decoder for statistical machine translation.” arXiv preprint arXiv:1406.1078 (2014). [28] J. Donahue, L. Anne Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, K. Saenko, T, Darrell, “Long-term recurrent convolutional networks for visual recognition and description.” Proceedings of the IEEE conference on computer vision and pattern recognition. 2015. [29] L. Pigou, A. Oord, S. Dieleman, M. Herreweghe, and J. Dambre, “Beyond Temporal Pooling: Recurrence and Temporal Convolutions for Gesture Recognition in Video,” arXiv preprint arXiv:1506.01911, 2015. [30] A. Graves, and J. Navdeep, “Towards end-to-end speech recognition with recurrent neural networks.” International Conference on Machine Learning. 2014. [31] P. Liu, X. Qiu, X. Huang, “Recurrent neural network for text classification with multi-task learning.” arXiv preprint arXiv:1605.05101 (2016). [32] L. Shang, Z. Lu, H. Li, “Neural responding machine for short-text conversation.” arXiv preprint arXiv:1503.02364 (2015). [33] R. Nallapati, B. Zhou, C. Gulcehre, B. Xiang, “Abstractive text summarization using sequence-to-sequence rnns and beyond.” arXiv preprint arXiv:1602.06023 (2016). [34] S. Lai, L. Xu, K. Liu, J. Zhao, “Recurrent Convolutional Neural Networks for Text Classification.” AAAI. Vol. 333. 2015. [35] D. Wang, and E. Nyberg. “A long short-term memory model for answer sentence selection in question answering.” Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers). Vol. 2. 2015. [36] A. Severyn, A. Moschitti. “Learning to rank short text pairs with convolutional deep neural networks.” Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 2015. [37] N. Kalchbrenner, P. Blunsom, “Recurrent continuous translation models.” Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. 2013. [38] D. Bahdanau, K. Cho, Y. Bengio, “Neural machine translation by jointly learning to align and translate.” arXiv preprint arXiv:1409.0473 (2014). [39] K. Xu, J. Ba, R. Kiros, K. Cho, A. Courville, R. Salakhudinov, Y. Bengio, “Show, attend and tell: Neural image caption generation with visual attention.” International Conference on Machine Learning. 2015. [40] M. T. Luong, H. Pham, C. D. Manning, “Effective approaches to attention-based neural machine translation.” arXiv preprint arXiv:1508.04025 (2015). [41] Vaswani, Ashish, et al. “Attention is all you need.” Advances in Neural Information Processing Systems. 2017. [42] Z. Lin, M. Feng, C. N. Santos, M. Yu, B. Xiang, B. Zhou, Y. Bengio, “A structured self-attentive sentence embedding.” arXiv preprint arXiv:1703.03130 (2017). [43] R. Paulus, C. Xiong, and R. Socher, “A deep reinforced model for abstractive summarization.” arXiv preprint arXiv:1705.04304 (2017). [44] J. Cheng, L. Dong, and M. Lapata, Long short-term memory-networks for machine reading. In Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, 2016. [45] Y. Wu, M. Schuster, Z. Chen, Q. V. Le, M. Norouzi, W. Macherey, et al. “Google’s neural machine translation system: Bridging the gap between human and machine translation.” arXiv preprint arXiv:1609.08144, 2016. [46] O. Press, L. Wolf. “Using the output embedding to improve language models.” arXiv preprint arXiv:1608.05859(2016). [47] H. Inan, K. Khosravi, R. Socher, “Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling.” ArXiv Preprint arXiv: 1611.01462. [48] K. Papineni, S. Roukos, T. Ward, W. J. Zhu, “BLEU: a method for automatic evaluation of machine translation.” In Proceedings of the 40th annual meeting on association for computational linguistics (pp. 311-318). Association for Computational Linguistics. 2002.
|