[1] Lawrence Rabiner and Bing-Hwang Juang, “Fundamentals of speech recognition”, Prentice Hall, 1993.
[2] 涂家章,“使用MAT2000語料庫之中文語音辨認”, 國立交通大學碩士論文,民國八十九年六月。[3] D.A. Reynolds, “Speaker identification and verification using Gaussian mixture speaker models,” Speech Communication 17. pp.91-108 , March 1995
[4] A. P. Dempster, N. M. Laira and D. B. Rubin, “Maximum Likelihood from Incomplete Data via the EM Algorithm,” Harvard University and Educational Testing Service, Dec. 1976.
[5] 鄭志民,”基於高斯混合模型之語者辨認”, 國立清華大學碩士論文,民國八十九年六月。[6] Douglas A. Reynolds, “Robust Text-Independent Speaker Indentification Using Gaussian Mixture Speaker Models,” IEEE Trans. On Speech and Audio Processing. Vol. 3, NO. 1, January 1995.
[7] Biing-Hwang Juang, Wu Chou, and Chin-Hui Lee, ”Minimum Classification Error Rate Methods for Speech Recognition,” IEEE Trans. On Speech and Audio Processing. Vol. 5, NO. 3, May 1997.
[8] W. Chou, B.H. Juang and C.H Lee, “Segmental GPD Training of HMM based Speech Recognizer,” In proceedings of ICASSP, IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, page(s): 473 -476, 1992.
[9] del Alamo, C.M.; Caminero Gil, F.J.; dela Torre Munilla, C.; Hernandez Gomez, L. “Discriminative Training of GMM for Speaker Identification,” In proceedings of ICASSP, IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, page(s): 89 -92 , 1996.
[10] Li Lee, Richard Rose, “A Frequency Warping Approach to Speaker Normalization,” IEEE Trans. On Speech and Audio Processing. Vol. 6, NO. 1,January 1998.
[11] Welling, L.; Kanthak, S.; Ney, H., “Improved Method For Vocal Tract Normalization,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.2, page(s): 761 —764, 1999.