|
一、 中文部分 張志豪(2005)。強健性和鑑別力語音特徵擷取技術於大詞彙連續語音辨識之研 究,國立臺灣師範大學資訊工程研究所,台北市。 范育菖(2007)。資訊與設計學系 語音辨識在數位娛樂之應用與研究,亞洲大 學,台中市。 劉鳳萍(2008)。使用鑑別式語言模型於語音辨識結果重新排序,國立臺灣師範 大學資訊工程研究所,台北市。 潘吉安(2007)。強健性語音辨識中能量相關特徵之改良式正規化技術的研究, 國立暨南國際大學電機工程研究所,南投縣。
二、 英文部分 Beth. Logan (2000). "Mel frequency cepstral coefficients for music modeling", Int.Symp. Music Information Retrieval(ISMIR). David Huggins-Daines, Mohit Kumar, Arthur Chan, Alan W Black, Mosur Ravishankar, Alex I. Rudnicky (2006)."PocketSphinx: a free real-time continuous speech recognition system for hand-held devices.", ICASSP 2006, page 185-188. DeLone W.H, McLean E. R. (2003). "The DeLone and McLean Model for Information Systems Success: A Ten-Year Update.", Journal of Management Information Systems, Vol. 19(No. 4), page 9-30. G. David Forney JR (1969)."The Viterbi Algorithm", Proceedings of the IEEE Vol.61(No.3), page 268-278. Hsin-Min Wang, Berlin Chen, Jen-Wei Kuo, Shih-Sian Cheng (2005)."MATBN: A Man darin Chinese Broadcast News Corpus", Computational Linguistics and Chinese Lanugage Processing, Vol.10(No.2), page 219-236. Kai-Fu Lee, Hsiao-Wuen Hon, Raj Reddy (1990). "An Overview of the SPHINX Speech Recognition System", IEEE, Vol. 38(No. 1), page 35-45. Lawrence R. Rabiner (1989). " A Tutorial on Hidden Markov Models and Selected application in Speech Recognition " Proceedings of the IEEE, vol. 77( No. 2) Peter F. Brown, Peter V. Desouza, Robert L. Mercer, Vincent J. Della Pietra, Jenifer C. Lai (1992). "Class-Based n-gram Models of Natural Language ", Association for Computational Linguistics, Vol. 18(No. 4), page 467-479. Rabiner Lawrence, Juang Bing-Hwang (1993). "Fundamentals of Speech Recognition"Prentice Hall, ISBN 0-13-015157-2. Steven B. Davis, PaMermelstein (1980). "Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences", IEEE, Vol. 28(No. 4), page 357-366. Sadaoki Furui(1981) "Cepstral Analysis Technique for Automatic Speaker Verifica tion", IEEE Trans. on Acoustics Speech and Signal Processing, Vol.29(No.2), page 254-272. Willie Walker, Paul Lamere, Philip Kwok, Bhiksha Raj, Rita Singh, Evandro Gouvea, Peter Wolf,Joe Woelfel (2004). "Sphinx-4:A flexible Open Source Framework for Speech Recognition", MICROSYSTEMS INC, SMLI TR-2004-139. Dbagnall (2012). "Basic concepts of speech", CMUSphinx Wiki, from the World Wide Web: http://cmusphinx.sourceforge.net/wiki/tutorialconcepts
|