|
[1] Lawrence Rabiner and Bing-Hwang Juang, “Fundamentals of speech recognition”,Prentice Hall, 1993.
[2] D.A. Reynolds, “Speaker identification and verification using Gaussian mixture speaker models,” Speech Communication 17. pp.91-108 , March 1995
[3] A. P. Dempster, N. M. Laira and D. B. Rubin, “Maximum Likelihood fromIncomplete Data via the EM Algorithm,” Harvard University and Educational Testing Service, Dec. 1976.
[4] Biing-Hwang Juang, Wu Chou, and Chin-Hui Lee, ”Minimum Classification ErrorRate Methods for Speech Recognition,” IEEE Trans. On Speech and Audio Processing. Vol. 5, NO. 3, May 1997.
[5] W. Chou, B.H. Juang and C.H Lee, “Segmental GPD Training of HMM based Speech Recognizer,” In proceedings of ICASSP, IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, page(s): 473 -476, 1992.41
[6] del Alamo, C.M.; Caminero Gil, F.J.; dela Torre Munilla, C.; Hernandez Gomez, L.“Discriminative Training of GMM for Speaker Identification,” In proceedings ofICASSP, IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, page(s): 89 -92 , 1996.
[7] Li Lee, Richard Rose, “A Frequency Warping Approach to Speaker Normalization,” IEEE Trans. On Speech and Audio Processing. Vol. 6, NO.1,January 1998.
[8] Welling, L.; Kanthak, S.; Ney, H., “Improved Method For Vocal TractNormalization,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.2, page(s): 761 –764, 1999.
[9] W.M. Fisher, G.R. Doddingdon “The DARPA Speech Recognition Research Database: Specifications And Status”, In Proc. DARPA Workshop Speech Recognition, Feb. 1986, pp93-99
[10]L.F. Lemal, J.L Cauvain “Cross-Lingual Experiments With Phone Recognition”, In Proc. Int. Conf. Acoustic Speech Signal Processing, 1993, pp507-510
[11]John R. Deller, John G. Prooakls, John H. Hansen, “Discrete-Time Processing Of Speech Signals”, Maxwell Macmillan international
[12] Y. Linde, A. Buzo & R. Gray, “An Algorithm For Vector Quantizer Design”, IEEE Transactions on Communications, Vol. 28, pp.84-95, 1980
[13]Moody. J, Slomka .S, Pelecanos. J, “On The Convergence Of Gasssain Mixture Models: Improvements Through Vector Quantization”, ICSLP98. [14]A. P. Dempster, N. M. Laird, “Maximum-Likelihood For Incomplete Data Via The EM Algorithm”, J. Royal Statist. Soc. SerB., pp39, 1977.
[13]L. Rabiner, B. H. Juang, “Fundamentals of Speech Recognition”, Prentice Hall Signal Processing Series, 1993.
[14]王小川,”語音訊號處理”,全華,民國93年.
[15]戴顯權,”資料壓縮”,紳藍,民國91年.
|