[1] M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, “Voice conversion through vector quantization,” Proc. IEEE ICASSP-88 Acoustics, Speech, and Signal Processing, vol. 1, pp. 655-658, 1988.
[2] Y. M. Cheng, D. O’Shaughnessy, and P. Mermelstein, “Statistical Recovery of Wideband Speech from Narrowband Speech,” IEEE Trans.
[3] Y. Stylianou, O. Cappe, and E. Moulines, “Continuous probabilistic transform for voice conversion,” IEEE Trans. Speech and Audio Processing, vol. 2, pp. 544-548, October 1994.
[4]李承龍, 「語音轉換及其在異常發聲矯正之應用」, 國立交通大學碩士論文,民國九十年。[5] R. J. McAulay and T. F. Quatieri, “Speech analysis-synthesis based on a sinusoidal representation,” IEEE Trans. Acoust., Speech and Signal Proc., ASSP-34(4):744-754, 1986.
[6] R. J. McAulay and T. F. Quatieri, “ Speech Transformations Based on a Sinusoidal Representation,” IEEE Trans. Acoustics, Speech, and Signal Processing, vol. ASSP-34, No.6, pp 1449-1464, December 1986.
[7] I.T. Jolliffe, “Principal Component Analysis, ” Springer-Verlag, 1986.
[8] Hallowell Davis and S. Richard Silverman, “Hearing and Deafness”, Edition, Holt, Rineheart and Winston.
[9] H. Sskoe and S. Chiba, “Dynamic programming optimization for spoken word recognition,” IEEE Trans. Acoustic, Speech, Signal Proc., ASSP-26(1): 43-49, February 1978.
[10] R. J. McAulay and T. F. Quatieri, “Pitch estimation and voicing detection based on a sinusoidal model,” Proc. IEEE Int. Conf. Acoust., Speech and Signal Proc., pages 249-252, Albuquerque, NM, Apr. 3-6 1990.
[11] Lawrence Rabiner, Biing-Hwang Juang, “Fundamentals of Speech Recognition”, Prentice Hall, New Jersey, 1993.
[12] R. J. McAulay and T. F. Quatieri. “Low rate speech coding based on a sinusoidal model,” In S. Furui and M. M. Sondhi, editors, Advances in Speech Signal Processing, chapter 6. Marcel Dekker, New York, 1992
[13] R. J. McAulay and T. F. Quatieri. “Computationally efficient sine-wane synthesis and its application to sinusoidal transform coding,” In Int. Conf. Acoust., Speech Proc., pages 370-373, 1988.
[14] Stephen A. Zahorian, Martin Rothenberg, “Principal-components analysis for low-redundancy encoding of speech spectra”, J. Acoust. Soc. Am. 69(3), March 1981.
[15] A. Dempster, N. Laird, and D. Rubin, “Maximum likelihood from incomplete data via the EM algorithm,” J. Royal Stat. Soc., vol. 39, pp. 1-38, 1977.
[16] S. M. Kay, Fundamentals of Statistical Signal Processing: Estimation Theory. Englewood Cliffs, NJ: Prentice-Hall, 1993.
[17] Douglas A. Reynolds, Richard C. Rose, “Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models”, IEEE Trans. Speech and Audio Processing, vol. 3, pp. 72-83, January 1995.
[18] L. R. Rabiner, J. G. Wilpon, B. H. Juang, “A segmental K-means training procedure for connected word recognition based on whole word reference patterns,” AT&T Technical Journal, Vol. 65, No. 3, pp. 21-31, May 1986.
[19]謝國平, “語言學概論”, 三民書局, pp71-80, 民85.
[20] Lawrence Rabiner, Biing-Hwang Juang, “Fundamentals of Speech Recognition”, p163-171, Prentice Hall, New Jersey, 1993.
[21] B. Massen and D. Provel, “The effect of segmental and suprasegmental corrections on the intelligibility of deaf speech,” J. Acoust. Soc. Am., 78, pp. 877-886, 1985.
[22] R. Nickerson, “Characteristics of the speech of deaf persons,” The Volta Review, 77, pp. 342-362, 1975.
[23] W. Kinsner, A. Langi, “Speech and image signal compression with wavelets”, IEEE Communications, Computers and Power in the Modern Environment, p368-375, 1993.
[24] S. Wang, A. Sekey, and A. Gersho. “An objective measure for predicting subjective quality of speech coders.” IEEE J. Select. Areas Commun., 10(5): 819-829, June 1992.