|
[1] D. G. Stork, G. Wolff, and E. Levine, “Neural network lipreading system for improved speech recognition,” in Proc. Int. Joint Conf. Neural Networks, pp. 285—295, 1992. [2] Simon Haykin, ”Neural Networks”, Prentice Hall, pp. 156-255,pp. 466-473,1999. [3] Ram R. Rao, Tsuhan Chen, “Audio-to-Visual Conversion for Multimedia Communication,” IEEE Transactions on Industrial Electronics, Vol. 45, No.1, pp. 15-22, Feb. 1998. [4] Ram R. Rao, Tsuhan Chen, “Audio-to-Visual Integration in Multimedia Communication,” Proceedings of the IEEE, Vol. 86, No.5, pp. 837-852, May 1998. [5] Yao-Jen Chang, Chih-Chung Chen, Jen-Chung Chou, and Yung-Chang Chen, “Virtual Talk: A Model-Based Virtual Phone Using a Layered Audio-Visual Integration,” IEEE Multimedia and Expo, 2000. ICME 2000. 2000 IEEE International Conference on , Volume: 1 , 2000, pp. 415 -418 vol.1 [6] Fabio Lavagetto, “Time-Delay Neural Networks for Estimating Lip Movements From Speech Analysis: A Useful Tool in Audio-Video Synchronization,” IEEE Transactions on Circuits and systems for Video Technology, Vol. 7, No.5, pp. 786-800, 1997. [7] R. Rao, R. Mersereau, Tsuhan Chen, “Using HMM’s in Audio-toVisual Conversion,” IEEE 1997 First Workshop on Multimedia Signal Processing, pp. 19-24, 1997. [8] KyoungHo Choi, Jeng-Neng Hwang, “Baum-Welch Hidden Markov Model Inversion For Reliable Audio-to-Visual Conversion”, IEEE 3rd Workshop on Multimedia Signal Processing (MMSP99), pp. 175 —180, Copenhagen, Denmark, Sept. 13-15, 1999 [9]John R. Deller, Jr., John G. P roakis, John H. L. Hansen, “Discrete-Time Processing of Speech Singals,”chapter 3, Macmillan Publishing Company. [10]John R. Deller, Jr., John G. P roakis, John H. L. Hansen, “Discrete-Time Processing of Speech Singals,”chapter 5, Macmillan Publishing Company. [11]John R. Deller, Jr., John G. P roakis, John H. L. Hansen, “Discrete-Time Processing of Speech Singals,”chapter 6, Macmillan Publishing Company. [12]G. A. Carpenter,S.Grossberg,“Art2:Self-organization of stable category recognition codes for analog input patterns” Applied Optics,26(23):4919-4930, 1987. [13] G. A. Carpenter,S. Grossberg, “The art of adaptive pattern recognition by self-organizing neural network ” IEEE computer,21(3):77-88,March 1988.. [14]M.J.F. Gales, D. Pye, P.C. Woodland, “Variance Compensation within the MLLR framework for Robust Speech Recognition and Speaker Adaptation”, Fourth International Conference On Spoken Language Proceedings. Vol. 3 pp. 1832-1835, 1996. [15] Carey E. Priebe, “Adaptive Mixtures,” Journal of the American Statistical Association, Vol. 89, No. 427, Sep. 1994. [16]Chih-Chung Chen “Adaptation of Gaussian Mixture Model for Multi-user Audio to Visual Conversion” 國立清華大學碩士論文,June 2000. [17]Ru-Yu Yu “Frame Based Audio to Visual Conversion Using Line Spectrum Pairs” 國立清華大學碩士論文,June 2001.
|