[1] M. Riley, W. Byrne, M. Finke, S. Khudanpur, and A. Ljolje, “Stochastic pronunciation modeling from hand-labelled phonetic corpora,” Speech Communication, Vol. 29, No. 2-4, pp. 209-224, 1999
[2] Y. Liu, and P. Fung, “State-Dependent Phonetic Tied Mixtures with Pronunciation Modeling for Spontaneous Speech Recognition,” IEEE Transactions on Speech and Audio Processing, Vol. 12, No.4, pp. 351-364, 2004
[3] Nanjo, H.; Kawahara, T., "Language model and speaking rate adaptation for spontaneous presentation speech recognition," Speech and Audio Processing, IEEE Transactions on , vol.12, no.4, pp. 391- 400, July 2004
[4] Måhl, Lena, “Speech recognition and adaptation experiments on children’s speech”, Master of Science thesis at the Department of Speech, Music and Hearing, KTH (The Royal Institute of Technology), 2004.
[5] K. Silverman, M. Beckman, J. Pitrelli, M. Ostendorf, C. Wightman, P. Price, J.
Pierrehumbert, and J. Hirschberg, “TOBI: A standard for Labeling English Prosody,” In Proc. of ICSLP, pp. 864-870, 1992.
[6] Aijun Li, “Chinese prosody and prosodic labeling of spontaneous speech,” in Proc. of Speech Prosody, pp. 39-46, 2002.
[7] Maekawa, K., H. Kikuchi, Y. Igarashi and J. Venditti. “X-JToBI: An extended J_ToBI for spontaneous speech,” in Proc. of ICSLP, pp. 1545-1548, 2002.
[8] M. Ostendorf, I. Shafran, S. Shattuck-Hufnagel, L. Carmichael, and W. Byrne, “A prosodically labeled database of spontaneous speech,” in Proc. of the ISCA Workshop on Prosody in Speech Recognition and Understanding , pp. 119-121, 2001.
[9] 江振宇,“非監督式中文語音韻律標記及韻律模式”,國立交通大學博士論文,民國九十八年三月。[10] 周裕倫,“中文自發性語音之韻律標記及韻律模式”,國立交通大學碩士論文,民國九十八年七月。[11] 曾淑娟, 劉怡芬, “現代漢語口語對話語料庫標註系統說明,” 中央研究院語言學研究所籌備處, September. 2002
[12] 李柏蒼,“自發性國語語音辨識”,國立交通大學碩士論文,民國九十六年八月。[13] The HTK Book (for HTK version 3.4)
[14] WaveSurfer Homepage:www.speech.kth.se/wavesurfer/
[15] 吳聲鋒,“使用於中文自發性語音辨認之聲學模式及韻律模式”,國立交通大學碩士論文,民國一零三年八月。[16] Z. Sheng, J.-H. Tao, and D.-L. Jiang, “Chinese prosodic phrasing with extended features,” Proceedings of the IEEE ICASSP , Vol. 1, pp. 492–495, 2003
[17] C.-Y. Tseng, S.-H. Pin, Y.-L. Lee, H.-M. Wang, and Y.-C. Chen, “Fluent speech prosody: Framework and modeling,” Speech Commun. special issue on quantitative prosody modeling for natural speech description and generation, 46, 284–309, 2005.
[18] C.Y. Tseng and Z.Y. Su, “Corpus approach to phonetic investigation - methods, quanitative evidence and findings of Mandarin speech prosody,” in Proc. of Oriental COCOSDA Workshop, pp. 123-138, 2006.
[19] S.H. Chen and Y.R. Wang, “Vector Quantization of Pitch Information in Mandarin Speech”, IEEE Transactions on Communications, Vol. 38, No. 9, pp. 1317-1320, 1990.
[20] D. Povey, A. Ghoshal, et al., "The Kaldi Speech Recognition Toolkit," in Proc. ASRU, 2011.
[21] Ghahremani, P., BabaAli, B., Provey, D., Riedhammer, K., Trmal, J. &;Khudanpur, S., “A Pitch Extraction Algorithm Tuned for Automatic Speech recognition”, in Proc ICASSP, Florence, 2014.
[22] S.C. Tseng, “Repairs in Mandarin Conversation,” Journal of Chinese Linguistics, Vol. 34, No.1, pp. 80-120, 2006.