|
1. A. Waibel, P. Geutner, and L. M. Tomokiyo, “Multilinguality in speech and spoken language systems,” in Proc. IEEE, vol. 88, pp. 1297 - 1313, 2000. 2.Y. Muthusamy, E. Barnard, and R. Cole, “Reviewing automatic language identi-fication,” Signal Processing Magazine, IEEE, vol. 11, pp. 33 - 41, Oct. 1994. 3.P. Yip and R. K. R., Discrete Cosine Transform: Algorithms, Advantages and Ap-plications. Norwell, MA: Academic, 1997. 4.P. Mermelstein, “Distance measures for speech recognition, psychological and instrumental,” in Pattern Recognition and Artificial Intelligence, pp. 374 - 388, 1976. 5.B. Gold and N. Morgan, Speech and Audio Signal Processing. John Wiley & Sons, Inc., 2000. 6.H. Hermansky, “Perceptual linear predictive (plp) analysis of speech,” in Journal of Acoustical Society of America, vol. 87, pp. 1738 - 1752, 1990. 7.L. Ferrer, H. Bratt, V. R. R. Gadde, S. Kajarekar, E. Shriberg, K. S. Andreas, and S. A. Venkataraman, “Modeling duration patterns for speaker recognition,” Eu-rospeech, pp. 2017 - 2020, 2003. 8. R. Tong, B. Ma, D. Zhu, H. Li, and E. S. Chng, “Integrating acoustic, prosodic and phonotactic features for spoken language identification,” in Acoustics, 51 Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE In-ternational Conference on, vol. 1, pp. I-205 - I-208, 14-19 May 2006. 9.D. Reynolds, W. Campbell, T. Gleason, C. Quillen, D. Sturim, P. Torres-Carrasquillo, and A. Adami, “The 2004 MIT Lincoln laboratory speaker recognition system,” in ICASSP''05, vol. 1, pp. 177 - 180, March 2005. 10. P. Boersma and D. Weenink, “Praat: doing phonetics by computer,” http://www.praat.org. 11. C.-Y. Lin and H.-C. Wang, Language identification using pitch contour in- for-mation in the ergodic markov model," in Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Con- ference on, vol. 1, pp. I-193 – I-196, 14-19 May 2006. 12.M. Rizvi, B. Akram, M. Anwar, M. Baig, and M. Sheikh, “Language identifica-tion from raw speech,” in Students Conference, ISCON ''02. Proceedings. IEEE, vol. 1, pp. 27 – 33 vol.1, 16-17 Aug. 2002. 13.M. Zissman, “Comparison of four approaches to automatic language identification of telephone speech," Speech and Audio Processing, IEEE Transactions on, vol. 4, p. 31, Jan 1996. 14.M. A. Zissman and E. Singer, “Automatic language identification of telephone speech message using phoneme recognition and n-gram modeling,” ICASSP''94, vol. 1, pp. 305 - 308, Apr. 1994. 15.B. Ma and H. Li, “A phonotactic-semantic paradigm for automatic spoken docu-ment classification,” SIGIR2005, pp. 369 - 376, Aug. 2005. 16.G. McLachlan and T. Krishnan, The EM algorithm and extensions. John Wiley & Sons, 1988. 17.P. A. Torres-Carrasquillo, E. Singer, M. A. Kohler, R. J. Greene, D. A. Reynolds, and J. R. J. Deller, “Approaches to language identification using Gaussian mixture models and shifted delta cepstral features," ICSLP, pp. 89 - 92, Sep 2002. 18.S. Chen and P. Gopalakrishnan, “Speaker, environment and channel change de-tection and clustering via the Bayesian information criterion," in DARPA Speech Recognition Workshop, 1998. 19.H. Akaike, “A new look at the statistical model identification," in Automatic Con-trol, IEEE Transactions on, vol. 19, pp. 716 - 723, 1974. 20.J. Rissanen, “Modeling by shortest data description,” in Automatica, vol. 14, pp. 465 - 471, 1978. 21.B. S. Everitt, The Cambridge Dictionary of Statistics. Cambridge University Press, 1 ed., October 1998. [22] P. D. GrÄunwald, The Minimum Description Length Principle. 2007. [23] V. Vapnik, The nature of statistical learning theory. Berlin: Springer-Verlag,1995. [24] H. Abdi, A neural network primer," in Journal of Biological Systems, vol. 2,1994. [25] B. V. Dasarathy, Nearest Neighbor (NN) Norms: NN Pattern Classi‾cation Techniques. Ieee Computer Society, 1991. [26] A. Oppenheim and R. Schafer, Discrete-Time Signal Processing. Upper Sad-dle River, NJ: Prentice-Hall, 2000. [27] R. M. Hegde, A. Murthy, Hema, and V. R. R. Gadde, Signi‾cance of the modi‾ed group delay feature in speech recognition," Audio, Speech and Lan-guage Processing, IEEE Transactions on, vol. 15, pp. 190{202, Jan. 2007. [28] J. A. Hartigan, Clustering Algorithms. Wiley, 1975. [29] R. Fletcher, Optimization in Practice. John Wiley, 1987. [30] R. E. Fan, P. H. Chen, and C. J. Lin, Working set selection using second order information for training support vector machines," in The Journal of Machine Learning Research, vol. 6, pp. 1889 { 1918, December 2005. [31] B. E. Boser, I. Guyon, and V. Vapnik, A training algorithm for optimal margin classi‾ers," in Computational Learning Theory, pp. 144{152, 1992. [32] B. SchÄolkopf, A. Smola, R. C. Williamson, and P. L. Bartlett, New support vector algorithms," in Neural Computation, vol. 12, pp. 1207{1245, 2000. [33] C. J. C. Burges, A tutorial on support vector machines for pattern recogni-tion," in Data Mining and Knowledge Discovery, pp. 121{167, 1998. [34] B. SchÄolkopf, K. Sung, C. J. C. Burges, F. Girosi, P. Niyogi, T. Poggio, and V. Vapnik, Comparing support vector machines with gaussian kernels to radial basis function classi‾ers," in Signal Processing, vol. 45, pp. 2758{2765,November 1997. [35] G. Wahba, Support vector machines, reproducing kernel hilbert spaces, and the randomized gacv," in Advances in Kernel Methods: Support Vector Learn-ing (B. SchÄoelkopf, C. J. C. Burges, and A. J. Smola, eds.), pp. 69{87, MIT Press, 1999. [36] V. Vapnik, Statistical learning Theory. Wiley, New York, 1998. [37] T. Hastie and R. Tibshirani, Classi‾cation by pairwise coupling," in Ad-vances in Neural Information Processing Systems, 1998. [38] B. Bielefeld, Language identi‾cation using shifted cepstrum," In 14th Annual Speech Research Symposium, 1994.
|