|
[1] D. Hakkani-Tur, G. Tur, and L. Heck. Research challenges and opportunities in mobile applications. Signal Processing Magazine, IEEE, 2011. [2] G. Hinton, Li Deng, Dong Yu, G.E. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T.N. Sainath, and B. Kingsbury. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. Signal Processing Magazine, IEEE, 2012. [3] Ronan Collobert and Jason Weston. A unified architecture for natural language processing: deep neural networks with multitask learning. In Proceedings of the 25th international conference on Machine learning, ICML ''08. ACM, 2008. [4] G.E. Dahl, Dong Yu, Li Deng, and A. Acero. Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. Audio, Speech, and Language Processing, IEEE Transactions on, 2012. [5] Tee Kiah Chia, Khe Chai Sim, Haizhou Li, and Hwee Tu Ng. Statistical lattice-based spoken document retrieval. ACM Trans. Inf. Syst., 2010. [6] Yi-Chen Pan, Hung-Yi Lee, and Lin-Shan Lee. Interactive spoken document retrieval with suggested key terms ranked by a markov decision process. Audio, Speech, and Language Processing, 2012. [7] S. E. Johnson, P. Jourlin, G.L. Moore, K.S. Jones, and P.C. Woodland. The cambridge university spoken document retrieval system. In Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on, 1999. [8] Diane J. Litman and Scott Silliman. Itspoke: an intelligent tutoring spoken dialogue system. In Demonstration Papers at HLT-NAACL 2004, 2004. [9] Stephanie Seneff and Joseph Polifroni. Dialogue management in the mercury flight reservation system. In Proceedings of the 2000 ANLP/NAACL Workshop on Conversational systems, 2000. [10] Teruhisa Misu and Tatsuya Kawahara. Bayes risk-based dialogue management for document retrieval system with speech interface. Speech Commun., January 2010. [11] Berlin Chen and Yi-Ting Chen. Extractive spoken document summarization for information retrieval. Pattern Recognition Letters, 2008. [12] Lin shan Lee and B. Chen. Spoken document understanding and organization. Signal Processing Magazine, IEEE, 2005. [13] Jerome R. Bellegarda. Statistical language model adaptation: review and perspectives. Speech Communication, 2004. [14] Aaron Heidel and Lin-Shan Lee. Robust topic inference for latent semantic language model adaptation. In Proc. on ASRU, 2007. [15] Tsung-Hsien Wen, Hung-Yi Lee, Tai-Yuan Chen, and Lin-Shan Lee. Personalized language modeling by crowd sourcing with social network data for voice access of cloud applications. In Proc. on IEEE SLT workshop, 2012. [16] Hung-Yi Lee, Tsung-Hsien Wen, and Lin-Shan Lee. Improved semantic retrieval of spoken content by language models enhanced with acoustic similarity graph. In SLT, 2012. [17] Yun-Nung Chen, Chia-Ping Chen, Hung-Yi Lee, Chun-An Chan, and Lin-Shan Lee. Improved spoken term detection with graph-based re-ranking in feature space. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pages 5644--5647. IEEE, 2011. [18] Hung-yi Lee and Lin-shan Lee. Integrating recognition and retrieval with user feedback: A new framework for spoken term detection. In Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on, pages 5290--5293. IEEE, 2010. [19] Tsung-Hsien Wen, Hung-Yi Lee, and Lin-Shan Lee. Interactive spoken content retrieval with different types of actions optimized by a markov decision process. In Interspeech, 2012. [20] Tsung-Hsien Wen, Hung-yi Lee, Pei-hao Su, and Lin-Shan Lee. Interactive spoken content retrieval by extended query model and continuous state space markov decision process. In ICASSP, 2013. [21] C. J. Leggetter and P. C. Woodland. Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models. Computer Speech and Language, 1995. [22] luc Gauvain Jean and Chin-Hui Lee. Maximum a posteriori estimation for multivariate gaussian mixture observations of markov chains. IEEE Transactions on Speech and Audio Processing, 1994. [23] P. C. Woodland. Speaker adaptation for continuous density hmms: A review. In Proc. on ITRW on Adaptation Methods for Speech Recognition, 2001. [24] Peter F. Brown, Peter V. deSouza, Robert L. Mercer, Vincent J. Della Pietra, and Jenifer C. Lai. Class-based n-gram models of natural language. Computational Linguistics, 1992. [25] William Gale. Good-Turing Smoothing Without Tears. Technical report, AT&T Bell Laboratories, 1994. [26] Frankie James. Modified kneser-ney smoothing of n-gram models modified kneserney smoothing of n-gram models. Technical report, 2000. [27] Georey Hinton. A Practical Guide to Training Restricted Boltzmann Machines. Technical report, 2010. [28] Yoshua Bengio, R ejean Ducharme, Pascal Vincent, and Christian Janvin. A neural probabilistic language model. Journal of Machine Learning Research, 2003. [29] Junho Park, Xunying Liu, Mark J. F. Gales, and Philip C. Woodland. Improved neural network based language modeling and adaptation. In Proc. on InterSpeech, 2010. [30] Hai-Son Le, Ilya Oparin, Alexandre Allauzen, Jean-Luc Gauvain, and Francois Yvon. Structured Output Layer neural network language model. In Proc. on ICASSP, 2011. [31] Tomas Mikolov, Martin Karafiat, Lukas Burget, Jan Cernocky, and Sanjeev Khudanpur. Recurrent neural network based language model. In Proc. on InterSpeech, 2010. [32] T. Mikolov, S. Kombrink, L. Burget, J.H. Cernocky, and S. Khudanpur. Extensions of recurrent neural network language model. In Proc. on ICASSP, 2011. [33] T. Mikolov and G. Zweig. Context dependent recurrent neural network language model. In Proc. on IEEE SLT workshop, 2012. [34] R.M. Iyer and M. Ostendorf. Modeling long distance dependence in language: topic mixtures versus dynamic cache models. IEEE Transactions on Speech and Audio Processing, 1999. [35] Aaron Heidel, Hung-An Chang, and Lin-Shan Lee. Language model adaptation using latent dirichlet allocation and an efficient topic inference algorithm. In Proc. on InterSpeech, 2007. [36] Marcello Federico. Efficient language model adaptation through mdi estimation. In Proc. on EuroSpeech, 1999. [37] Ciprian Chelba and Frederick Jelinek. Structured language modeling. Computer Speech and Language, 2000. [38] Hung yi Lee and Lin-Shan Lee. Enhanced spoken term detection using support vector machines and weighted pseudo examples. IEEE Transactions on Audio, Speech & Language Processing, 2013. [39] Hung yi Lee, Tsung wei Tu, Chia-Ping Chen, Chao-Yu Huang, and Lin-Shan Lee. Improved spoken term detection using support vector machines based on lattice context consistency. In ICASSP, 2011. [40] John Lafferty and Chengxiang Zhai. Document language models, query models, and risk minimization for information retrieval. SIGIR ''01. ACM, 2001. [41] Richard S. Sutton and Andrew G. Barto. Reinforcement learning: An introduction. Cambridge Journal, 1999. [42] Richard Bellman. Dynamic programming. 1957. [43] Stuart Dreyfus. Richard bellman on the birth of dynamic programming. Oper. Res., January 2002. [44] Anhai Doan, Raghu Ramakrishnan, and Alon Y. Halevy. Crowdsourcing systems on the world-wide web. Communications of the ACM, 2011. [45] Munro and Robert. Crowdsourcing and language studies: the new generation of linguistic data. In Proc. on NAACL, 2010. [46] Jingjing Liu, Scott Cyphers, Panupong Pasupat, Ian McGraw, and Jim Glass. A conversational movie search system based on conditional random field. In Proc. on InterSpeech, 2012. [47] Ian McGraw, Scott Cyphers, Panupong Pasupat, Jingjing Liu, and Jim Glass. Automating crowd-supervised learning for spoken language systems. In Proc. on InterSpeech, 2012. [48] Tsung-Hsien Wen, Aaron Heidel, Hung-Yi Lee, Yu Tsao, and Lin-Shan Lee. Recurrent neural network based language model personalization by social network crowdsourcing. In to be published. [49] John Paolillo. The virtual speech community: Social network and language variation on irc. Journal of Computer-Mediated Communication, 1999. [50] Devan Rosen and Margaret Corbit. Social network analysis in virtual environments. In Proc. on ACM Hypertext, 2009. [51] Thomas L. Griffiths and Mark Steyvers. Finding scientific topics. Proceedings of the National Academy of Sciences of the United States of America, 2004. [52] Hsu Bo-June and James Glass. Style and topic language model adaptation using hmm-lda. In Proc. on EMNLP, 2006. [53] Tam Yik-Cheung and Tanja Schultz. Unsupervised language model adaptation using latent semantic marginals. In Proc. on InterSpeech, 2006. [54] David M. Blei, Andrew Y. Ng, and Michael I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 2003. [55] Gregor Heinrich. Parameter estimation for text analysis. Technical report, 2004. [56] Ian Porteous, David Newman, Alexander Ihler, Arthur Asuncion, Padhraic Smyth, and Max Welling. Fast collapsed gibbs sampling for latent dirichlet allocation. In KDD. ACM, 2008. [57] Teh Yee Whye, David Newman, and Max Welling. A collapsed variational bayesian inference algorithm for latent dirichlet allocation. In NIPS, 2006. [58] Hsu Winston H., Lyndon S. Kennedy, and Chang Shih-Fu. Video search reranking through random walk over document-level context graph. In MULTIMEDIA. ACM, 2007. [59] Chen Yun-Nung, Chen Chia-Ping, Lee Hung-Yi, Chan Chun-An, and Lee Lin-Shan. Improved spoken term detection with graph-based re-ranking in feature space. In ICASSP, 2011. [60] Chen Yun-Nung, Yu Huang, Ching-Feng Yeh, and Lee Lin-Shan. Spoken lecture summarization by random walk over a graph constructed with automatically extracted key terms. In Interspeech, 2011. [61] Andreas Stolcke. Srilm - an extensible language modeling toolkit. In Proc. on Spoken Language Processing, 2002. [62] Steve J. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland. The HTK Book Version 3.4. Cambridge University Press, 2006. [63] Mikael Bod’en. A guide to recurrent neural networks and backpropagation. 2002. [64] Yangyang Shi, Pascal Wiggers, and Catholijn M. Jonker. Towards recurrent neural networks language models with linguistic and contextual features. In Proc. on InterSpeech, 2012. [65] Yoshua Bengio, Patrice Simard, and Paolo Frasconi. Learning long-term dependencies with gradient descent is difficult, 1994. [66] Xunying Liu, Mark J. F. Gales, and Philip C. Woodland. Improving lvcsr system combination using neural network language model cross adaptation. In Proc. on InterSpeech, 2011. [67] Yehuda Koren, Robert Bell, and Chris Volinsky. Matrix factorization techniques for recommender systems. Computer, 2009. [68] Thomas K. Landauer, Peter W. Foltz, and Darrell Laham. An Introduction to Latent Semantic Analysis. Discourse Processes, 1998. [69] Keh-Jiann Chen and Shing-Huan Liu. Word identification for mandarin chinese sentences. In Proceedings of the 14th conference on Computational linguistics - Volume 1, COLING ''92. Association for Computational Linguistics, 1992. [70] Wei-Yun Ma and Keh-Jiann Chen. A bottom-up merging algorithm for chinese unknown word extraction. In Proceedings of the second SIGHAN workshop on Chinese language processing - Volume 17, SIGHAN ''03. Association for Computational Linguistics, 2003. [71] Tomas Mikolov, Stefan Kombrink, Anoop Deoras, Lukas Burget, and Jan Cernocky. Rnnlm - recurrent neural network language modeling toolkit. In Proc. on ASRU, 2011. [72] George A. Miller. Wordnet: A lexical database for english. Communications of the ACM, 1995. [73] Yi-Lun Yang. Plutalk: A spoken social network application with speaker adaptation and error correction. 2012. [74] L. Bahl, P. Brown, P.V. De Souza, and R. Mercer. Maximum mutual information estimation of hidden markov model parameters for speech recognition. In Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP ''86.,1986. [75] D. Povey and P.C. Woodland. Minimum phone error and i-smoothing for improved discriminative training. In Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on, 2002. [76] David Robins. Interactive information retrieval: Context and basic notions. Informing Science Journal, 2000. [77] Ian Ruthven. Interactive information retrieval. Annual Review of Information Science and Technology, 2008. [78] Teruhisa Misu and Tatsuya Kawahara. Speech-based interactive information guidance system using question-answering technique. In ICASSP, 2007. [79] Chengxiang Zhai and John Lafferty. Model-based feedback in the language modeling approach to information retrieval. CIKM ''01. ACM, 2001. [80] Tao Tao and ChengXiang Zhai. Regularized estimation of mixture models for robust pseudo-relevance feedback. In SIGIR''06, 2006. [81] Thomas Hofmann. Probabilistic latent semantic indexing. In ACM SIGIR, 1999. [82] Richard Bellman and Sherman Dreyfus. Functional approximation and dynamic programming. Mathematical Tables and Other Aids to Computation, 1959. [83] R emi Munos and Csaba Szepesv ari. Finite-time bounds for fitted value iteration. Journal of Machine Learning Research, 2008. [84] Csaba Szepesv ari and R emi Munos. Finite time bounds for sampling based fitted value iteration. ICML ''05. ACM, 2005. [85] Ben He and Iadh Ounis. Query performance prediction. Inf. Syst., 2006. [86] Ying Zhao, Falk Scholer, and Yohannes Tsegay. Effective pre-retrieval query performance prediction using similarity and variability evidence. In Advances in Information Retrieval. 2008. [87] Yun Zhou and W. Bruce Croft. Query performance prediction in web search environments. SIGIR ''07. ACM, 2007.
|