|
[1] D. Huggins-Daines, M. Kumar, A. Chan, A. Black, M. Ravishankar, and A. Rudnicky, “Pocketsphinx: A free, real-time continuous speech recognition system for hand-held devices,” 2006. [2] 陳鴻彬,陳柏琳,林順喜,語音辨識及資訊檢索技術於數位典藏多媒體文物之 應用,第三屆數位典藏技術研討會,頁239-246。 [3] X. L. Aubert, “An overview of decoding techniques for large vocabulary continuous speech recognition,” Computer Speech and Language, vol. 16, 2002. [4] M. Mohri, F. Pereira, and M. Riley, “Speech recognition with weighted finite-state transducers,” Springer Handbook of Speech Processing., vol. 3, 2007. [5] M. Mohri, F. Pereira, and M. Riley, “Weighted finite-state transducers in speech recognition,” in ASR2000-Automatic Speech Recognition: Challenges for the new Millenium ISCA Tutorial and Research Workshop (ITRW), ISCA, 2000. [6] C. Allauzen, M. Mohri, M. Riley, and B. Roark, “A generalized construction of integrated speech recognition transducers,” in IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004. Proceedings.(ICASSP’04), vol. 1, 2004. [7] I. Hetherington, “PocketSUMMIT: small-footprint continuous speech recognition,” in Proc. of INTERSPEECH, pp. 1465–1468, 2007. [8] C. H. Yu, “Large Vocabulary Continuous Mandarin Speech Recognition Using Finite-State Machine,” Master’s thesis, National Taiwan University, 2004. [9] L. Rabiner, “A tutorial on hidden Markov models and selected applications in speech recognition,” Proceedings of the IEEE, vol. 77, no. 2, pp. 257–286, 1989. [10] J. T. Huang, “Improved large vocabulary continuous mandarin speech recognition by prosody modeling,” Master’s thesis, National Taiwan University, 2006. [11] S. Young, N. Russell, and J. Thornton, Token passing: a simple conceptual model for connected speech recognition systems. University of Cambridge, Department of Engineering, 1989. [12] D. Jurafsky, J. Martin, A. Kehler, K. Vander Linden, and N. Ward, Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition. MIT Press, 2000. [13] E. Matusov, S. Kanthak, and H. Ney, “On the integration of speech recognition and statistical machine translation,” in Ninth European Conference on Speech Communication and Technology, ISCA, 2005. [14] M. Mohri and M. Riley, “A weight pushing algorithm for large vocabulary speech recognition,” in Seventh European Conference on Speech Communication and Technology, ISCA, 2001. [15] M. Mohri, “Semiring Frameworks and Algorithms for Shortest-Distance Problem,” Journal of Automata, Languages and Combinatorics, vol. 7. [16] M. Mohri, “Generic Epsilon-Removal and Input Epsilon-Normalization Algorithms forWeighted Transducers,” International Journal of Foundations of Computer Science, vol. 13, no. 1, pp. 129–143, 2002. [17] “HTK Toolkit, http://htk.eng.cam.ac.uk/.” [18] C. Allauzen, M. Mohri, and B. Roark, “Generalized algorithms for constructing statistical language models,” in Proceedings of the 41st Annual Meeting on Association for Computational Linguistics-Volume 1, pp. 40–47, Association for Computational Linguistics Morristown, NJ, USA, 2003. [19] T. Takezawa, E. Sumita, F. Sugaya, H. Yamamoto, and S. Yamamoto, “Toward a broad-coverage bilingual corpus for speech translation of travel conversations in the real world,” in Proc. of the Third Int. Conf. on Language Resources and Evaluation (LREC), pp. 147–152, 2002. [20] A. Stolcke, “SRILM-an extensible language modeling toolkit,” in Seventh International Conference on Spoken Language Processing, ISCA, 2002. [21] E. Bocchieri and D. Blewett, “A decoder for LVCSR based on fixed-point arithmetic,” in 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings, vol. 1, 2006. [22] T. Kohler, C. Fugen, S. St ‥ uker, and A. Waibel, “Rapid porting of ASR-systems to mobile devices,” in Ninth European Conference on Speech Communication and Technology, ISCA, 2005.
|