|
[1] V. Zue, “Speech in Oxygen” Technical Report, Computer Science Lab., MIT, Cambridge, MA, USA, May 2001. [2] Y. Gong, “Speech Recognition in Noisy Environments: A Survey”, Speech Communication 16, 1995. [3] M.J.F. Gales, “Model-based Techniques for Noise Robust Speech Recognition”, University of Cambridge, Sep. 1995. [4] Boll, S. F, “Suppression of Acoustic Noise in Speech Using Spectral Subtraction”, IEEE Trans. on ASSP, Vol. 27, No. 2, pp.113-120.1979 [5] P. Lockwood and J. Boudy, “Experiments with a Nonlinear Spectral Subtractor (NSS), Hidden Markov Models and the Projection, for Robust Speech Recognition in Cars”, Eurospeech 1991. [6] ITU-T Recommendation G.729 — Annex B: A silence compression sceme for G. 729 optimized for terminals conforming to Recommendation V.70 [7] B.A. Mellor and A.P. Varga, “Noise Masking in the MFCC Domain for the Recognition of Speech in Background Noise”, ICASSP 1992. [8] Y. Ephraim and H.L. Van Trees, “A Signal Subspace Approach for Speech Enhancement”, IEEE Trans. on Speech and Audio Processing, 1995. [9] S. Furui, “Cepstral Analysis Technique for Automatic Speaker Verification”. IEEE Trans. Acoust. Speech Signal Process. 1981 [10] O. Viikki and K. Laurila, “Noise Robust HMM-based Speech Recognition Using Segmental Cepstral Feature Vector Normalization,” in ESCA NATO Workshop Robust Speech Recognition Unknown Communication Channels, Pont-a-Mousson, France, 1997, pp. 107—110. [11] H. Hermansky and N. Morgan, “RASTA Processing of Speech”. IEEE Trans. on Speech and Audio Processing. 2, pp. 578-589, 1994 [12] Kuo-Hwei Yuo and Hsiao-Chuan Wang, “Robust Features for Noisy Speech Recognition Based on Temporal Trajectory Filtering of Short-Time Autocorrelation Sequences”, Speech Communication 28, 1999. [13] J.W. Hung, J.L. Shen, L.S. Lee, “New Approaches for Domain Transformation and Parameter Combination for Improved Accuracy in Parallel Model Combination (PMC) Techniques”, IEEE Trans. on Speech and Audio Processing, Nov. 2001. [14] J.L. Gauiain and C.H.Lee, “Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains”, IEEE Trans. on Speech and Audio Processing, 1994. [15] C.J. Leggetter and P.C. Woodland, “Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models”, Computer Speech and Language, 1995. [16] John, R.Deller, John G.Proaskis, John H.L.Hansen,“Discrete-Time Processing of Speech Signals”. [17] Y. K. Muthusamy and R. A. Cole, “Automatic Segmentation and Identification of Ten Languages Using Telephone Speech,” in Proc. ICSLP ’92, vol. 2, Oct. 1992, pp.1007-1010 [18] C. Nadeu, D. Macho, and J. Hernando, “Time and frequency filtering of filter-bank energies for robust HMM speech recognition”, Speech Communication, 2001. [19] N. Kanedera, T. Arai, H. Hermansky, and M. Pavel “On The Importance of Various Modulation Frequencies for Speech Recognition,” Proc. Eurospeech ’97, Rhodes, Greece,pp. 1079 — 1082. [20] N. Kanedera, T. Arai, H. Hermansky, “Desired Characteristics of Modulation Spectrum for Robust Automatic Speech Recognition” ICASSP 1998 [21] R. Hariharan, I. Kiss, and O. Viikki, ”Noise robust speech parameterization using multiresolution feature extraction,” IEEE Trans. on Speech and Audio Processing, Nov.2001, pp856-865 [22] Sarel van Vuuren and H. Hermansky, “Data-Driven Design of RASTA-Like Filters”, ICSLP 1996. [23] J-W. Hung, L.S. Lee “Comparative Analysis for Data-Driven Temporal Filters Obtained Via Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) In Speech Recognition”, Eurospeech 2001 [24] K. Fukunaga, “Introduction to statistical Pattern Recognition”, E.2nd, Academic Press, 1990 [25] B. Flury, “A First Course in Multivariate Statistics”, Springer, 1997 [26]B-H Juang, Wu Chou, and C-H Lee, “Minimum Classification Error Rate Methods for Speech Recognition,” IEEE Trans. on Speech and Audio Processing, Vol 5,No 3,May 1997 [27]J-W. Hung, L.S. Lee, “Data-Driven Temporal Filters for RobustFeatures in Speech Recognition Obtained Via Minimum Classification Error (MCE)”, ICASSP 2002 [28]Thomas M. Cover, Joy A. Thomas, Elements of Information Theory ,Wiley, NewYork, NY, 1997. [29] N-C Wang, J-W. Hung, and L.S. Lee, “Data-Driven Temporal Filters Based on Multi-Eigenvector for Robust Features in Speech Recognition”, ICASSP 2003
|