|
[1] C. Zhan, W. Li and P. Ogunbona, “Face Recognition from Single Sample based on Human Face Perception,” International Conference Image and Vision Computing New Zealand, pp. 56-61, 2009. [2] http://www.apple.com/tw/ios/siri/ [3] https://cloud.google.com/speech/ [4] D. A. Reynolds, “An overview of Automatic Speaker Recognition Technology,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 4, pp. 4072-4075, 2002. [5] http://htk.eng.cam.ac.uk/ [6] http://kaldi-asr.org/doc/about.html [7] L. Rabiner and B. H. Juang, “Fundamentals of Speech Recognition,” in Prentice Hall, 1993. [8] https://en.wikipedia.org/wiki/Speaker_recognition [9] T. Stafylakis, M. J. Alam and P. Kenny, “Text-Dependent Speaker Recognition With Random Digit Strings,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 24, Issue. 7, pp. 1194-1203, July 2016. [10] D. A. Reynolds and R. C. Rose, “Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models,” IEEE Transactions on Speech and Audio Processing, Vol. 3, Issue. 1, pp. 72-83, Jan 1995. [11] http://www.d-ear.com/ [12] http://www.playrobot.com/speech-recognition/88-arduino-chinese-voice-recognition-module.html [13] http://www.garmin.com.tw/m/buzz/tw/minisite/nuvi3790T/feature_02.htm [14] J. S. Lim and A. V. Oppenheim, “Enhancement and Bandwidth Compression of Noisy Speech,” Proceedings of the IEEE, Vol. 67, Issue. 12, pp. 1586-1604, Dec. 1979. [15] W. Zunjin and C. Zhigang, “Improved MFCC-based feature for robust speaker identification,” Tsinghua Science and Technology, Vol. 10, Issue. 2, pp. 158-161, April 2005. [16] N. Cristianini and J. Shawe-Taylor, “Support Vector Machines,” in Cambridge University Press, 2000. [17] B. H. Juang and T. Chen, “The Past, Present, and Future of Speech Processing,” IEEE Signal Processing Magazine, Vol. 15, Issue. 3, pp. 23-48, May 1998. [18] N. E. Huang et al, “On Instantaneous Frequency,” in World Scientific Publishing Company, pp. 177-229, 2009. [19] R. Vergin, D. O'Shaughnessy and A. Farhat, “Generalized Mel Frequency Coefficients for Large-Vocabulary Speaker-Independent Continuous-Speech Recognition,” IEEE Transactions on Speech and Audio Processing, Vol. 7, Issue. 5, pp. 525-532, Sep 1999. [20] D. O'Shaughnessy, “Speech Communications: Human and Machine,” Wiley-IEEE Press, 1999. [21] T. T. Soong, “Fundamentals of Probability and Statistics for Engineers,” Wiley, 2004. [22] X. Peng, X. Wang and B. Wang, “Speaker Clustering via Novel seudo-Divergence of Gaussian Mixture Models,” International Conference on Natural Language Processing and Knowledge Engineering, pp. 111-114, 2005. [23] http://www.datasciencelab.cn/clustering/gmm [24] L. Rabiner and B. H. Juang, “Fundamentals of Speech Recognition,” in Prentice Hall, pp. 215-219, 1993. [25] A. Bhattacharyya, “On a Measure of Divergence between Two Statistical Populations,” in Springer on behalf of the Indian Statistical Institute, pp. 99–109, 1943. [26] K. Rao and P. Yip, “Discrete Cosine Transform: Algorithms, Advantages, Applications,” in Academic Press, 1990. [27] A. Goel and A. Gupta, “Design of Satellite Payload Filter Emulator Using Hamming Window,” International Conference on Medical Imaging, m-Health and Emerging Communication Systems (MedCom), pp. 202-205, 2014. [28] J. O. Smith III, “Spectral Audio Signal Processing,” W3K Publishing, 2011. [29] H. C. Ravichandar and A. P. Dani, “Human Intention Inference Using Expectation-Maximization Algorithm With Online Model Learning,” IEEE Transactions on Automation Science and Engineering, Vol. PP, Issue. 99, December 2016. [30] http://www.sympy.org/en/index.html [31] https://www.python.org/ [32] https://www.scipy.org/ [33] https://www.hdfgroup.org/ [34] J. P. Openshaw, Z. P. Sun and J. S. Mason, “A Comparison of Composite Features under Degraded Speech in Speaker Recognition,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 371-374, 1993. [35] A. Chaudhari, A. Rahulkar and S. B. Dhonde, “Combining dynamic features with MFCC for text-independent speaker identification,” International Conference on Information Processing (ICIP), pp. 160-164, 2015. [36] http://www.oxfordlearnersdictionaries.com/us/about/pronunciation_english [37] http://isrc.ccs.asia.edu.tw/www/essay/essay7/essay7-008.htm
|