|
[1] S. N. Koh, I. Y. Soon and C. K. Yeo, Noisy speech enhancement using discrete cosine transform," Speech Communication, vol. 24, no. 3, pp. 249-257, 1998. [2] J. Yeh and C. Chen, Noise-robust speech features based on cepstral time coefficients," Conference on Computational Linguistics and Speech Processing (RO- CLING 2009), pp. 31-38, 2009. [3] N. Kanedera, T. Arai, H. Hermansky and M. Pavel, On the relative importance of various components of the modulation spectrum for automatic speech recognition," Speech Communication, vol. 28, no. 1, pp. 43-55, 1999. [4] J. Hung and L. Lee, Optimization of temporal filters for constructing robust features in speech recognition," IEEE Transactions on Audio, Speech and Language Processing, vol. 14, no. 3, pp. 808-832, 2006. [5] H. Hermansky and N. Morgan, RASTA processing of speech," IEEE Transactions on Speech and Audio Processing, vol. 2, no. 4, pp. 578-589, 1994. [6] N. Kanedera, T. Arai, H. Hermansky and M. Pavel, On the importance of various modulation frequencies for speech recognition," European Conference on Speech Communication and Technology (EUROSPEECH), pp. 1079-1082, 1997. [7] Y. Hu and C. Loizou, Speech enhancement based on wavelet thresholding the multitaper spectrum," IEEE Transactions on Speech and Audio Processing, vol. 12, no. 1, pp. 59-67, 2004. [8] G. Doblinger, Computationally efficient speech enhancement by spectral minima tracking in sub-bands," European Conference on Speech Communication and Technology (EUROSPEECH), pp. 1513-1516, 1995. [9] S. Salahuddin, S. Z. Al Islam, M. K. Hasan and M. R. Khan, Soft thresholding for DCT speech enhancement," Electronics Letters, vol. 38, no. 24, pp. 1605-1607, 2002. [10] C. Kwong, W. Pang, H. Wu, K. Ho, Simple DCT-based speech coder for internet applications," IEEE International Conference on Communications, vol. 1, pp. 344- 348, 1999. [11] U. Guz, H. Gurkan and B. S. Yarman, A novel noise robust and low bit rate speech coding algorithm," International Symposium on Computer and Information Sciences (ISCIS 2009), pp. 471-474, 2009. [12] B. Milner and X. Shao, Low bit-rate feature vector compression using transform coding and non-uniform bit allocation," IEEE International Conference on Acous- tics, Speech and Signal Processing (ICASSP 2003), vol. 2, pp. 129-132, 2003. [13] M. Y. Azar and F. Razzazi, A DCT based nonlinear predictive coding for feature extraction in speech recognition systems," IEEE International Conference on Computational Intelligence for Measurement Systems and Applications (CIMSA 2008), pp. 19-22, 2008. [14] Q. Zhu and A. Alwan, An efficient and scalable 2D DCT-based feature coding scheme for remote speech recognition," IEEE International Conference on Acous- tics, Speech and Signal Processing (ICASSP 2001), vol. 1, pp. 113-116, 2001. [15] S. A. Zahorian, H. Hu, Z. Chen and J. Wu, Spectral and temporal modulation features for phonetic recognition," International Speech Communication Associa- tion (INTERSPEECH), pp. 1071-1074, 2009. [16] N. Ahmed, T. Natarajan and K. R. Rao, Discrete cosine transform," IEEE Trans- actions on Computers, vol. 23, no. 1, pp. 90-93, 1974. [17] H. Ding and I. Soon, An Adaptive time-shift analysis for DCT based speech enhancement," International Conference on Information, Communications and Sig- nal Processing (ICICS 2009), pp. 1-4, 2009. [18] M. T. Heideman, Computation of an odd-length DCT from a real-valued DFT of the same length," IEEE Transactions on Signal Processing, vol. 40, no. 1, pp. 54-61, 1992. [19] S. K. Mitra, Digital signal processing: a computer-based approach," McGraw-Hill Companies, Inc., 2006. [20] S. A. Khayam, The discrete cosine transform (DCT): theory and application," Technical Report WAVES-TR-ECE802.602, 2003. [21] G. Strang, The discrete cosine transform," SLAM Review, vol. 41, no. 1, pp. 135-147, 1999. [22] G. Aggarwal and D. Gajski, Exploring DCT implementations," Technical Report UCI-ICS-98-10, 1998. [23] J. F. Blinn, What's the deal with the DCT?," IEEE Computer Graphics and Applications, vol. 13, no. 4, pp. 78-83, 1993. [24] S. Furui, Speaker independent isolated word recognition using dynamic features of speech spectrum," IEEE Transations on Acoustics, Speech and Signal Processing, vol. 34, no. 1, pp. 52-59, 1986. [25] H. Hrmansky and P. Fousek, Multi-resolution RASTA filtering for TANDEMbased ASR," International Speech Communication Association (INTER- SPEECH), 2005. [26] ETSI standard doc., Speech Processing, transmission and quality aspects (STQ); distributed speech recognition; extended advanced front-end feature extraction algorithm; compression algorithms; back-end speech reconstruction algorithm," ETSI ES 202 212 Ver.1.1.2, 2005. [27] H. G. Hirsch and D. Pearce, The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions," International Conference on Spoken Language Processing (ICSLP 2000), 2000. [28] http://htk.eng.cam.ac.uk/
|