|
[1] Jari Turunen, Damjan Vlaj: "A Study of Speech Coding Parameters in Speech Recognition", Proc. EUROSPEECH 2001, pp. 2363-2366, 2001 [2] An-Tzyh Yu, Hsiao-Chuan Wang, “A Study on the Recognition of Low Bit-Rate Encoded Speech”, Proc. ICSLP 1998, pp. 38-41, 1998 [3] Euler, S. and Zinke, J. “The Influence of Speech Coding Algorithms on Automatic Speech Recognition”. ICASSP-94, Vol. 1, pp. 621-624. 1994. [4] Lilly, B. T. and Paliwal, K. K. "Effect of Speech Coders on Speech Recognition Performance". ICSLP-96, Vol 4, pp. 2344-2347. 1996. [5] J.M. Huerta and R.M. Stern, “Speech Recognition from GSM Coder Parameters", Proc. ICSLP-98, Vol 4, pp. 1463-1466, 1998 [6] Kim, H.K., and Cox, R. (2000), “Bitstream-based feature extraction for wireless speech recognition”, Proc. ICASSP 2000, Vol 3, pp. 1607 -1610, 2000 [7] Raj, B.; Migdal, J.; Singh, R., "Distributed Speech Recognition with Codec Parameters", IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), December 2001 (ASRU 2001) [8] Gallardo-Antolin, A., Diaz-de-Maria, F., and Valverde- Albacete, F., “Recognition from GSM Digital Speech”, Proc. ICSLP 1998, pp. 584-587, 1998 [9] M. Naito, S. Kuroiwa, T. Kato, T. Shimizu and N. Higuchi : "Rapid CODEC Adaptation for Cellular Phone Speech Recognition," Proc. of EUROSPEECH 2001, Vol. II, pp. 1099-1102, 2001 [10] M.G. Kuitert & L. Boves, “Speaker verification with GSM coded telephone speech”, Proc. EUROSPEECH 1997, Rhodes, Vol.2, pp. 975-978, 1997 [11] T.F. Quatieri, E. Singer, R.B. Dunn, D.A. Reynolds, J.P. Campbell, “Speaker and Language Recognition Using Speech Codec Parameters”, Proc. EUROSPEECH 1999, Vol.2, pp. 787-790, 1999 [12] Besacier, L., Grassi, S., Dufaux, A., Ansorge, M., Pellandini, F.,” GSM speech coding and speaker recognition”, ICASSP-00, Vol. 2, pp. 1085-1088, 2000. [13] Quatieri T.F., Dunn R.B., Reynolds D.A., Campbell J.P., Singer E., “Speaker Recognition using G.729 speech codec parameters”, Proc. ICASSP '00, Vol. 2, pp. 1089-1092, 2000 [14] C. Mokbel, L. Mauuary, L. Karray, D. Jouvet, J. Monne, J. Simonin,K. Bartkova, "Toward improving ASR robustness for PSN and GSM telephone applications," Speech Communication, vol. 23, no. 1, pp.141?59, Oct. 1997. [15] ETSI standard document, “Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Front-end feature extraction algorithm; Compression algorithm”, ETSI ES 201 108 v1.1.2 (2000-04), April 2000 [16] P. Thevenaz & H. Hugli, "Usefulness of the LPC-Residue in text-independent speaker verification". Speech Communication, Vol. 17, pp. 145-157. 1995. [17] J. He, L. Liu, and G. Palm, "On the use of residual cepstrum in speech recognition," Proc. IEEE of ICASSP'96, Vol. 1, pp. 5-8, May, 1996, Atlanta,USA. [18] J. He, L. Liu, and G. Palm, "On the use of features from prediction residual signals in speaker identification," Proc. of EUROSPEECH'95, Vol. 1, pp. 313-316, Sept. 1995, Madrid, Spain [19] http://kbs.cs.tu-berlin.de/~jutta/toast.html [20] ETSI standard document, “European digital telecommunications system (Phase 2+), full rate speech transcoding (GSM 06.10 version 8.1.1 Release 1999), http://www.etsi.org [21] ITU-T Recommendation, G.729, “Coding of speech at 8 kbit/s using conjugate structure algebraic-code-excited linear-prediction (CS-ACELP)” [22] ITU-T Recommendation, G.729 Annex A, “Coding of speech at 8 kbit/s using conjugate structure algebraic-code-excited linear-prediction (CS-ACELP), Annex A: Reduced complexity 8 kbit/s CS-ACELP speech codec” [23] Douglas O’Shaughnessy, Speech Communications: Human and Machine, 2nd ed., IEEE Press, 2000 [24] Thomas F. Quatieri, Discrete-Time Speech Signal Processing: Principles and Practice, Prentice Hall, 2002 [25] D. A. Reynolds and R. C. Rose. “Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models”, IEEE Trans. on Speech and Audio Processing, 3(1):72 - 83, 1995. [26] Roland Auckenthaler , Eluned S Parris , Michael J Carey, “Improving a GMM Speaker Verification System by Phonetic Weighting”, ICASSP 1999, pp. 313-316, 1999 [27] Douglas A. Reynolds, “Speaker identification and verification using Gaussian mixture speaker models”, Speech Communication, 17, pp. 91-108, 1995 [28] Rosenberg, A. E., DeLong, J., Lee, C. H., Juang, B. H., and Soong, F. K., “The use of cohort normalized scores for speaker verification”.ICSLP-92, November 1992, pp. 599—602. [29] Rosenberg, A. E. and Parthasarathy, S., “Speaker background models for connected digit password speaker verification”, ICASSP 96, May 1996, pp. 81—84. [30] Deller J., Proakis J., Hansen J., Discrete-Time Processing of Speech Signals, McMillan Publishing Company, New York, 1993. [31] Rabiner, L. and Juang, B.-H., Fundamentals of Speech Recognition, Prentice Hall, Englewood Cliffs 1993. [32] S. B. Davis, P. Mermelstein, “Comparison of parametric representations of monosyllabic word recognition in continuously spoken sentences”, IEEE Trans. Acoust., Speech, Signal Processing, vol ASSP-28, pp. 357-366, Aug. 1980 [33] F. K. Soong und A. E. Rosenberg, “On the use of instantaneous and transitional spectral information in speaker recognition”, IEEE Trans. Acoustics, Speech and Signal Proc., vol. 1, ASSP-36, no. 6, pp. 871-879, 1988 [34] K. Sonmez, E. Shriberg, L. Heck & M. Weintraub, “Modeling Dynamic Prosodic Variation for Speaker Verification”, ICSLP-98, vol. 7, pp. 3189-3192, Sydney [35] Fukunaga, Keinosuke, Introduction to Statistical Pattern Recognition, 2nd ed., Academic Press, 1990 [36] http://www.nist.gov/speech/tests/spk/1999/spkrec99.html [37] D. A. Reynolds, "Comparison of Background Normalization Methods for Text-independent Speaker Verification”, Proc. EUROSPEECH 1997, pp 963-966.
|