|
[1] L.S. Lee, C.Y. Tseng, and M. Ouh-Young. The synthesis rules in a Chinese text-to-speech system. IEEE Transactions on Acoustics, Speech, and Signal Processing, 37(9):1309–1320, 1989. [2] M.S. Liang, R.C. Yang, Y.C. Chiang, D.C. Lyu, and R.Y. Lyu. A Taiwanese Text-to-Speech System with Applications to Language Learning. In Proceedings of the IEEE International Conference on Advanced Learning Technologies, volume 1, pages 91–95. IEEE Computer Society Washington, DC, USA, 2004. [3] A.J. Hunt and A.W. Black. Unit selection in a concatenative speech synthesis system using alarge speech database. In IEEE International Conference on Acoustics, Speech, and Signal Processing, 1996, volume 1, pages 373–376, 1996. [4] 古鴻炎 and 楊仲捷 基於VQ/HMM之國語語句基週軌跡產生之方法. Master’s thesis, 國立台灣科技大學電機所, 1999. [5] A.W. Black and K.A. Lenzo. Limited Domain Synthesis. In Proceedings of the Sixth International Conference on Spoken Language Processing, 2000. ISCA, 2000. [6] S.J. Kim, J.J. Kim, and M. Hahn. HMM-based Korean speech synthesis system for hand-held devices. IEEE Transactions on Consumer Electronics, 52(4): 1384–1390, 2006. [7] S.H. Chen, S.H. Hwang, and Y.R. Wang. An RNN-based prosodic information synthesizer for Mandarintext-to-speech. IEEE Transactions on Speech and Audio Processing, 6(3):226–239, 1998. [8] J. Tao, Y. Kang, and A. Li. Prosody conversion from neutral speech to emotional speech. IEEE Transactions on Audio, Speech and Language Processing, 14(4):1145–1154, 2006. [9] M. Isogai and H. Mizuno. A New F0 Contour Control Method Based on Vector Representation of F0 Contour. In Sixth European Conference on Speech Communication and Technology. ISCA, 1999. [10] D.T. Chappell and J.H.L. Hansen. Speaker-specific pitch contour modeling and modification. In IEEE International Conference on Acoustics, Speech, and Signal Processing, 1998, volume 2, pages 885–888, 1998. [11] Z. Inanoglu. Transforming pitch in a voice conversion framework. Master’s thesis, St. Edmund’s College, University of Cambridge, 2003. [12] T. Ceyssens, W. Verhelst, and P. Wambacq. On The Construction Of A Pitch Conversion System. In Proceedings of European Signal Processing Conference, volume I, pages 423–426, 2002. [13] H. Kawahara, A. Cheveign′e, H. Banno, T. Takahashi, and T. Irino. Nearly Defect-Free F0 Trajectory Extraction for Expressive Speech Modifications Based on STRAIGHT. In Ninth European Conference on Speech Communication and Technology. ISCA, 2005. [14] S.H. Pin, Y. Lee, Y. Chen, H. Wang, and C. Tseng. A Mandarin TTS system with an integrated prosodic model. 2004 International Symposium on Chinese Spoken Language Processing, pages 169–172, 2004. [15] H. Tseng, P. Chang, G. Andrew, D. Jurafsky, and C. Manning. A Conditional Random Field Word Segmenter for Sighan Bakeoff 2005. In Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing, pages 168–171. Jeju Island, Korea, 2005. [16] P. C. Chang, M. Galley, and C. D. Manning. Optimizing Chinese word segmentation for machine translation performance. In Proceedings of the Third Workshop on Statistical Machine Translation, pages 224–232, Columbus, Ohio, June 2008. Association for Computational Linguistics. [17] N. XUE, FEI XIA, F.U.D. CHIOU, and M. PALMER. The Penn Chinese TreeBank: Phrase structure annotation of a large corpus. Natural Language Engineering, 11(02):207–238, 2005. [18] G. Monti and M. Sandler. MONOPHONIC TRANSCRIPTION WITH AUTOCORRELATION. In Proceedings of the Workshop on Digital Audio Effects (DAFx-00), volume 12, 2000. [19] G. S. Ying, L. H. Jamieson, and C. D. Michell. A probabilistic approach to AMDF pitch detection. Proceedings of the Fourth International Conference on Spoken Language, 1996, 2:1201–1204, 1996. [20] E. Moulines and F. Charpentier. Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication, 9 (5-6):453–467, 1990. [21] H. Valbret, E. Moulines, J.P. Tubach, and T. Paris. Voice transformation using PSOLA technique. IEEE International Conference on Acoustics, Speech, and Signal Processing, 1992, 1:145–148, 1992. [22] W.B. Kleijn, H. Yang, and E.F. Deprettere. Waveform Interpolation Coding With Pitch-Spaced Subbands. In Fifth International Conference on Spoken Language Processing. ISCA, 1998. [23] M. Chu, Y. Wang, and L. He. Labeling stress in continuous Mandarin speech perceptually. In Proceedings of the 15th International Congress of Phonetic Science, pages 2095–2098, 2003. [24] J. E. Hopcroft, R. Motwani, and J. D. Ullman. Introduction to Automata Theory, Languages, and Computation (3rd Edition). Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA, 2006. ISBN 0321455363. [25] 黃奕欽 and 陳嘉平. 前後文無關文法於語音合成語料庫之應用. In Proceedings of the 25th Workshop on Combinatorial Mathematics and Computation Theory, pages 455–459, 2008. [26] G.J.L.S.G. Chen and T. Wu. High quality and low complexity pitch modification of acousticsignals. In IEEE International Conference on Acoustics, Speech, and Signal Processing, 1995, volume 5, pages 2987–2990, 1995. [27] S. Lemmetty. Review of Speech Synthesis Technology. Master’s thesis, Helsinki University of Technology, 1999.
|