臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.110) 您好！臺灣時間：2025/09/26 07:12

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
電子全文
紙本論文
QR Code

本論文永久網址:

研究生:

林正甫

研究生(外文):

Zhang-fu Lin

論文名稱:

使用ANN抖音參數模型之國語歌聲合成

論文名稱(外文):

Mandarin Singing Voice Synthesis Using ANN Vibrato Parameter Models

指導教授:

古鴻炎

指導教授(外文):

Hong-yan Gu

學位類別:

碩士

校院名稱:

國立臺灣科技大學

系所名稱:

資訊工程系

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2007

畢業學年度:

語文別:

中文

論文頁數:

中文關鍵詞:

國語歌聲合成、抖音

外文關鍵詞:

mandarin singing synthesis、vibrato

相關次數:

被引用:9
點閱:269
評分:
下載:44
書目收藏:0

本論文針對歌聲表情的一個重要因素“抖音”，研究以短時傅利葉轉換和解析信號之方法來對歌聲音節作分析，而求得抖音參數。此外我們也將這個分析方法，應用於求取波形包絡的振動參數。求得各個音節的抖音和振動參數之後，再拿去訓練各項參數分別的類神經網路(artificial neural network, ANN)模型，之後依據所建造的ANN模型的輸出，再配合滿度、下拍點等規則，去控制諧波加噪音信號模型 (HNM)作歌聲信號的合成。經由主觀的自然度聽測實驗，所得的評分顯示，同時使用抖音和振動參數合成出的歌聲信號，的確可以比原始使用HNM合成出的歌聲信號有顯著的改進。

In this thesis, analysis and synthesis of vibrato, an important factor of singing expression, are focused. We analyze the vibrato parameters of a singing syllable by using short-time Fourier transform and the method of analytic signal. In addition, we apply the same procedure to analyze the vibrating parameters from a syllable’s waveform envelope curve. When the parameter values of vibrato and amplitude vibrating are obtained for each singing syllable, they are used to train an artificial neural network (ANN) based model for each different parameter type. Then, these ANN models are used to generate the vibrato and vibrating parameters. Next, these parameters and other relevant music parameters are used together to control a harmonic-plus-noise (HNM) model to synthesize singing voice signals. With the synthetic singing voices, subjective perception tests are conducted. The result show that the singing signal synthesized with the control of vibrato and vibrating parameters is indeed apparently better than the singing signal synthesized without such controls.

摘要 I
Abstract II
致謝 III
目錄 IV
第1章緒論 1
1.1 研究動機及目的 1
1.2 歌聲合成研究之回顧 2
1.3 研究方法 4
1.4 論文架構 5
第2章信號之抖音參數分析 6
2.1 抖音參數分析前置作業 6
2.2 抖音參數求取方式回顧 7
2.3 基週峰谷法 7
2.4 瞬間頻率法 9
2.4.1 STFT分析 9
2.4.2 Analytic Signal分析 9
2.5 抖音參數分析之實驗與改進作法 11
2.5.1 基週峰谷法之實驗 11
2.5.2 瞬間頻率法之實驗 13
2.5.3 瞬間頻率分析-使用Analytic Signal 14
2.5.4 瞬間頻率分析-使用STFT 17
2.5.5 音位軌跡計算 19
2.5.6 時變抖音頻率、範圍測量 21
2.5.7 抖音參數取樣、儲存與正規化 22
2.6 波形包絡振動參數之分析 24
第3章類神經網路模型 28
3.1 類神經網路簡介 28
3.2 類神經網路結構 29
3.3 類神經網路輸出入參數 31
3.3.1 輸入參數 31
3.3.2 輸出參數 35
3.4 單元個數實驗 36
第4章國語歌聲合成 41
4.1 表情參數之產生 41
4.1.1 音高曲線的產生 41
4.1.2 波形包絡的產生 45
4.2 音樂性參數之決定 47
4.2.1 滿度處理 47
4.2.2 下拍點處理 48
4.2.3 轉音處理 49
4.2.4 歌聲音量處理 51
4.3 結合抖音與波形包絡HNM之歌聲合成系統 52
第5章實驗實驗與結論 55
5.1 歌聲合成系統 55
5.2 聽測評估 58
5.3 結論 60
參考文獻 63
作者簡介 67
附錄A 68
【訓練歌曲之歌詞】 68

[1]古鴻炎、陳安璿、廖皇量，「整合MIDI伴奏之國語歌聲合成系統」，WOCMAT 2005 電腦音樂與音訊技術研討會(台北)，Session B，2005。
[2]古鴻炎、廖皇量，「用於國語歌聲合成之諧波加噪音模型的改進研究」，WOCMAT 2006 國際電腦音樂與音訊技術研討會(台北)，session 2 (音訊處理I)，2006。
[3]G. Grindlay and D. Helmbold, “Modeling, analyzing, and synthesizing expressive piano performance with graphical models”, Springer Netherlands, Vol. 65, pp. 361-387, Dec. 2006.
[4]Seashore, C. E. “The vibrato”, in University of Iowa Studiesin the Psychology of Music(Univ. of Iowa, Iowa City), Vol. I., 1932.
[5]Sundberg, J. “Effects of the vibrato and the ‘singing formant’ on pitch”, Musica Slovaca VI, 1978, Bratislava, 51–69; also J. Res. Singing 5(2), 5–17. 1978.
[6]Horii, Y. “Acoustic analysis of vocal vibrato: a theoretical interpretation of data“, J. Voice 3, 36–43. 1989.
[7]Imaizumi, S., Saida, H., Shimura, Y., and Hirose, H. “Harmonic analysis of the singing voice:—Acoustic characteristics of vibrato“, in Proceedings of the Stockholm Music Acoustics Conference (SMAC93) Royal Swedish Academy of Music, Stockholm, pp. 197–200. 1994.
[8]Sundberg, J., Prame, E., and Iwarsson J. “Replicability and Accuracy of Pitch Patterns in Professional Singers“, in Vocal Fold Physiology, edited by P. J. Davis and N. H. Fletcher (Singular, San Diego), 1996.
[9]Shonle, J. I., and Horan, K. E. “The pitch of vibrato tones“, J. Acoust. Soc. Am. 67, 246–252. 1980.
[10]Brown, J. C., and Vaughn, K. V. “Pitch center of stringed instrument vibrato tones“, J. Acoust. Soc. Am. 100, 1728–1735. 1996.
[11]E. Prame“Vibrato extent and intonation in professional western lyric singing”, J. Acoust. Soc. Am., Vol. 102, pp. 616-621, 1997.
[12]I. Arroabarren, et al.,“Measurement of vibrato in lyric singers”, IEEE instrumentation and measurement technology conference, pp. 1529-1534, 2001.
[13]Yorum Meron and Keikichi Hirose, “Synthesis of Vibrato Singing”, Proceedings of the Acoustics, Speech, and Signal Processing on IEEE International Conference, 2000.
[14]Michael W. Macon, Leslie Jensen-Link, James Oliverio, Mark A. Clements and E. Bryan George, “Concatenation-based MIDI-to-Singing Voice Synthesis,” 103rd Meeting of the AES, Sept. 1997.
[15]Takeshi Saitou, Masashi Unoki, and Masato Akagi, “Extraction of F0 Dynamic Characteristics and Development of F0 Control Model in Singing Voice,” Proceedings of the 2002 International Conference on Auditory Display, Kyoto, Japan, July 2-5, 2002.
[16]周彥佐, 基於HNM之國語、閩南語的語音合成研究, 國立台灣科技大學資訊工程研究所碩士論文, 2007。
[17]E. Prame, “Measurements of the vibrato rate of ten singers”, J. Acoust. Soc. Am., Vol. 96, pp. 1979-1984, 1994.
[18]K. Kato, et al., “Blending vocal music with the sound field - the effective duration of autocorrelation function of western professional singing voices with different vowels and pitches”, International Symposium on Musical Acoustics (ISMA2004), Nara, Japan, 2004.
[19]B. Boashash,“Estimating and interpreting the instantaneous frequency of a signal, Part I: Fundamentals”, Proceedings of the IEEE, Vol. 80, pp. 519-538, April 1992.
[20]B. Boashash,“Estimating and interpreting the instantaneous frequency of a signal. Part 2: Algorithms and applications”, Proceedings of the IEEE, Vol. 80, pp. 539-568, April 1992.
[21]P. Howes, et al.,“The relationship between measured vibrato characteristics and perception in western operatic singing”, Journal of Voice, Vol. 18, pp. 216-230, 1997.
[22]J. Schoukens, R. Pintelon, and H. Van Hamme,“The interpolated fast fourier transform: A comparative study”, IEEE trans. Instrum. Meas., Vol. 41, pp. 226-232, April 1992.
[23]H. G. Feichtinger and T. Strohmer, Gabor analysis and algorithms theory and applications, Birkhauser, Boston, Dec. 1997.
[24]D. G. Long, “Comments on Hilbert transform based signal analysis”, Microwave Earth Sensing (MERS) Laboratory, Feb. 2004.
[25]M. Johansson, “The Hilbert transform”, Math. Dept., Växjö Universitet, Sweden, http:// w3.msi.vxu.se/exarb/
[26]古鴻炎、張小芬、吳俊欣，「仿趙氏音高尺度之基週軌跡正規化方法及其應用」，第十六屆自然語言與語音處理研討會(ROCLING XVI)，台北，第325-334頁, 2004。
[27]Hideo Suzuki, et al., “Instantaneous frequencies of signals obtained by the analytic signal method”, Acoust. Sci. & Tech, Vol. 27, pp. 163-170, 2006.
[28]T. Wakayama, et al., “Comparison of violin vibratos among four virtuost”, Proceedings of the International Symposium on Musical Acoustics (ISMA2004), Nara, Japan.
[29]C. Langton,“Hilbert transform, analytic signal and the complex envelope”,LoralSpaceSystems, http://www.complextoreal.com/tcomplex.htm.
[30]葉怡成, 類神經網路模式應用與實作, 儒林圖書公司, 2006。
[31]曹亦岑，使用小型語料類神經網路之國語語音合成韻律參數產生，國立台灣科技大學電機所。
[32]王如江，基於歌聲表情分析與單元選擇之國語歌聲合成研究，國立台灣科技大學資訊工程研究所碩士論文, 2007。

電子全文

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	使用小型語料類神經網路之國語語音合成韻律參數產生
2.	基於HMM模型之歌聲合成與音色轉換
3.	基於歌聲表情分析與單元選擇之國語歌聲合成研究
4.	結合HMM頻譜模型與ANN抖音模型之國語歌聲合成
5.	歌唱聲以及樂器聲合成改進之研究
6.	國語合成歌聲流暢度改進之研究
7.	整合音色變換之國語語音合成系統
8.	使用半音節單元挑選及HNM信號模型之國語歌聲合成
9.	用於名人語音合成之PCA與ANN為基礎的音色轉換方法
10.	對於歌唱聲合成器的聲音品質增進之研究

無相關期刊

1.	結合HMM頻譜模型與ANN抖音模型之國語歌聲合成
2.	基於HMM模型之歌聲合成與音色轉換
3.	具功率因數修正之自激式電源轉換器之研製
4.	行動商務採用之社會技術觀點研究
5.	第四代行動通訊網路的網路選擇機制
6.	使用頻譜演進模型之國語語音合成研究
7.	你有用過抖音嗎?從對抖音滿意的前因來了解如何影響持續使用之意願—一個雙因子中介模式的探討
8.	媒介投入感、使用動機、用戶參與對於社群使用者黏著度之影響–以抖音短視頻（抖音）為例
9.	用於單音人聲和複音音樂的抖音偵測
10.	應用文化元素之產品設計手法探討
11.	台灣技術學院學生英文閱讀動機、態度、策略運用和閱讀表現之研究
12.	風力發電機功率輸出模式之建立與應用
13.	輪型相撲機器人之設計與研製
14.	特定諧波規劃與消除之三相大電流產生器研製
15.	影響網路拍賣使用者忠誠意願因素之研究

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室