[1]朱孝國(2005,10月)。「低成本高彈性的特定領域語音合成實驗」。2005開放源碼國際研討會,台北。
[2]吳銘鈞(2003)。「以音節為基礎之語者識別」。清華大學碩士論文,新竹。[3]李俊毅(2002)。「語音評分」。清華大學碩士論文,新竹。[4]邵芳雯(1994)。「國語歌曲之合成」。交通大學碩士論文,新竹。[5]洪朝貴、嚴春美、鄭爵儀(2002)。「利用音框技術的國語母音辨識」。2002開放源碼國際研討會,台北,第95-100頁。
[6]陳松琳(2002)。「以類神經網路為架構之語音辨識系統」。中山大學碩士論文,高雄。[7]葉怡成(2003)。「類神經網路模式應用與實作」。儒林圖書,第1-16頁。
[8]臺灣師大國音教材編輯委員會(2001)。「國音學」。正中書局,第1-30頁。
[9]蔣昇倫(1997)。「經電話通道之國語連續411音節辨認」。交通大學碩士論文,新竹。[10]蔣為文(2000)。「解構漢字的迷思」, http://www.de-han.org/hanji/chuliau/hanjibesu.htm.
[11]Alan Black , Lenzo, K. (2000), “Building Voices in the Festival Speech Synthesis System, ” DRAFT (updated2003)
[12]Alan Black. , Lenzo, K. (2000), “Limited Domain Synthesis,” ICSLP2000, Beijing, China.
[13]Alan Black. , Lenzo, K. (2004), “Multilingual Text-to-Speech Synthesis,” ICASSP 2004, Montreal, Canada.
[14]Alan Black (1997), “Festival Speech Synthesis System,” http://www.speech.cs.cmu.edu/comp.speech/Section5/Synth/festival.html
[15]Thierry Dutoit (1999), “A Short Introduction to Text-to-Speech Synthesis,”
[16]H/Mariam, S., Kishore, S., Black, A., Kumar, R., , Sangal, R. (2004), “Unit Selection Voice for Amharic Using Festvox,” 5th ISCA Speech Synthesis Workshop, Pittsburgh, PA., pp. 103-107.
[17]B. H. Juang , L. Rabiner (1993), “Fundamentals of speech recognition,” Prentice Hall, pp. 97-117.
[18]K. R. Farrell, R. J. Mammone, , K. T. Assaleh (1994), “Speaker recognition using neural networks and conventional classifiers,” IEEE Trans. on Speech and AudionProcessing, Volume 2 , pp. 194-205.
[19]Langner B. ,Black A. (2004), “Creating A Database Of Speech In Noise For Unit Selection Synthesis,” 5th ISCA Speech Synthesis Workshop, Pittsburgh, PA., pp. 229-230.
[20]Sami Lemmetty (1999), “Review of Speech Synthesis Technology,” http://www.acoustics.hut.fi/~slemmett/dippa/.
[21]M. W. Macon, L. Jensen-Link, J. Oliverio, M. Clements , E. B. George (1993), “Discrete-Time Processing of Speech Signals,” Prentice Hall, pp. 236-250.
[22]E.S Morais , F. Violaro , P.A Barbosa (1998), “Prosodic speech modifications using pitch-synchronous time-frequency interpolation,” Telecommunications Symposium, 1998. ITS ''98 Proceedings. SBT/IEEE International, Volume 1 , pp. 225-230.
[23]N. Deshmukh, A. Ganapathiraju, J. Picone(1999), “Hierarchical seaarch fo large vocabulary conversational speech recognition,” IEEE Signal Processing Magazine, pp. 84-107.
[24]J. Oglesby, J. S. Mason (1990), “Optimization of neural models for speaker identification,” Proc. ICASSP, pp. 261-264.
[25]H. Valbret , E. Moulines , J.P. Tubach (1997), “Concatenation-based MIDI-to-singing voice synthesis,” 103rd Meeting of the Audio Engineering Society, New York.
[26]H. Valbret , E. Moulines , J.P. Tubach (1992), “Voice transformation using PSOLA technique,” Acoustics, Speech, and Signal Processing, 1992. ICASSP-92, pp. 145-148.
[27]G. Velius (1988), “Variants of cepstrum based speaker identify verification,” Proc. ICASSP, pp. 583-586.