臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.134) 您好！臺灣時間：2025/11/20 18:52

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
紙本論文
QR Code

本論文永久網址:

研究生:

朱嘉平

研究生(外文):

Chia-Ping Chu

論文名稱:

總體經驗模態分解及其平行化處理應用在強健性語音辨識

論文名稱(外文):

Robust Speech Recognition by Ensemble Empirical Mode Decomposition and its Parallel Processing

指導教授:

潘欣泰

指導教授(外文):

Shing-Tai Pan

學位類別:

碩士

校院名稱:

國立高雄大學

系所名稱:

資訊工程學系碩士班

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2012

畢業學年度:

100

語文別:

中文

論文頁數:

中文關鍵詞:

平行運算、基因演算法、語音辨識、隱藏式馬可夫模型、總體經驗模態分解法

外文關鍵詞:

Speech Recognition、Hidden Markov Model、Ensemble Empirical Mode Decomposition、Parallel Computing、Genetic Algorithms

相關次數:

被引用:0
點閱:289
評分:
下載:0
書目收藏:0

本論文主要目的是提升語音信號的抗雜訊能力以提升在具有環境雜訊下的語音辨識率。本論文應用總體經驗模態分解法（Ensemble Empirical Mode Decomposition, Ensemble EMD），將含雜訊的語音訊號分解成多組本質模態函式(Intrinsic Mode Functions, IMFs)，並以實數型基因演算法找出最佳IMFs組合參數，再將分離出之IMFs依組合參數還原成語音，還原最理想的語音信號，讓環境噪音影響語音的辨識率降到最低。此外，針對總體經驗模態分解法所造成的運算速度的問題，本論文提出平行化運算來加速總體經驗模態分解法的運算速度，在多核心系統的架構下，結合OpenMP函式庫平行指令針對總體經驗模態分解法做平行化處理使運算速度提升。

The main purposes of this study were to enhance and improve the speech recognition rate of speech recognition systems subject to some environment noise. In our research, we used Ensemble Empirical Mode Decomposition (Ensemble EMD) to decompose the speech signals with noise to several IMFs, and then find the best weights for each IMF by using real-coded genetic algorithm. Thereafter, the speech signals were recovered by summing the weighted IMFs to reduce the effect of the noise. Since the Ensemble EMD will take much computation time, a parallel computation algorithm under multi-core structure is proposed to speed up the computation of Ensemble EMD. We used parallel instruction coding in the OpenMP library to implement our algorithm.

第一章緒論.............................................1
1.1 研究動機與目的...................................2
1.2 研究方法.........................................3
第二章語音訊號前置處理.................................5
2.1 擷取語音之音框...................................6
2.2 語音預強調.......................................7
2.3 加入漢明窗.......................................7
2.4 快速傅立葉轉換...................................8
2.5 MFCC特徵值計算..................................10
2.5.1 梅爾濾波器組....................................10
2.5.2 對數轉換........................................12
2.5.3 離散餘弦轉換....................................13
第三章語音訊號結合Ensemble EMD訊號分解................14
3.1 瞬時頻率........................................15
3.2 經驗模態分解法..................................16
3.3 總體經驗模態分解法..............................22
3.4 模態函數分解結合基因演算法......................25
3.5 平行化運算......................................26
第四章隱藏式馬可夫模型與HTK...........................31
4.1 隱藏式馬可夫模型................................31
4.2 連續型隱藏式馬可夫模型..........................33
4.3 HTK工具.........................................35
第五章實驗方式與結果..................................38
5.1 實驗語料........................................38
5.2 實驗方法與數據..................................41
5.2.1 基礎實驗的測試結果..............................42
5.2.2 語音訊號以EMD的測試結果.........................43
5.2.3 語音訊號以Ensemble EMD的測試結果................45
5.3 實驗結果分析....................................47
5.3.1 針對SNR 0 ~20dB的平均辨識率分析.................47
5.3.2 針對不同SNR dB值在不同測試環境之分析............49
5.3.3 針對不同SNR dB值辨識率提升幅度之分析............53
5.4 平行化加速實驗結果 ..............................58
第六章結論與展望......................................62
6.1 結論............................................62
6.2 未來與展望......................................63
參考文獻.................................................64

[1] Z.Jin and D.L. Wang, “a multipitch tracking algorithm for noisy and reverberant speech, ” IEEE International Conference on Acoustics Speech andSignal Processing, pp. 4218-4221, 14-19 Mar. 2010.
[2] Y.I. Song, Y.Y. Wang, Y.C. Ju, M. Seltzer, I. Tashev and A. Acero “Voice search of structured media data,” IEEE International Conference on Acoustics, Speech and Signal Processing, pp, 1941-1944, 19-24 Apr. 2009.
[3] E. Erzin, “Improving Throat Microphone Speech Recognition by Joint Analysis of Throat and Acoustic Microphone Recordings,” IEEE Transactions on Audio, Speech , and Language Processing, Vol. 17, No. 7, pp, 1316-1324, Sep. 2009.
[4] C.W. Hsu and L.S. Lee, “Higher Order Cepstral Moment Normalization for Improved Robust Speech Recognition,” IEEE Transactions on Audio, Speech, and Language Processing, Vol. 17, No. 2, pp, 205-220, Feb. 2009.
[5] Y.k. Choi, K. You, J. Choi and W. Sung, “VLSI for 5000-word continuous speech recognition,” IEEE International Conference on Acoustics, Speech and Signal Processing, pp, 557-560, 19-24 Apr. 2009.
[6] C. Wan and L. Liu, “Research and Improvement on Embedded System Application of DTW-based Speech Recognition,” International Conference on Anti-counterfeiting, Security and Identification, pp, 401-404, 20-23 Aug. 2008.
[7] 王小川，語音信號處理，全華科技出版社，2004.
[8] 陳松琳，以類神經為架構之語音辨識系統，中山大學電機工程系碩士論文，2002.
[9] M.C. Mozer, “Neural-network speech processing for toys and consumer electronics,” IEEE Expert, Vol. 11, No. 4, pp, 4-5, August. 1996.
[10] L. Rabiner and B.H. Juang, Fundamntals of Speech Recognition, Pentice-Hall International, Inc. 1993.
[11] T. Kinjo and K. Funaki, “On HMM Speech Recognition Based on Complex Speech Analysis,” IEEE Industrial Electronics IECON, pp, 3477-3480, 6-10 Nov. 2006.
[12] J. Yamagishi, T. Nose, H. Zen, Z.H. Ling, T. Toda, K. Tokuda, S. King and S. Renals, “Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis,” IEEE Transactions on Audio, Speech, and Language Processing, Vol. 17, No. 6 pp, 1208-1230, Aug. 2009.
[13] T. Kobayashi, Y. Nakano, K. Ogata and J. Isogai, “Analysis of Speaker Adaptation Algorithms for HMM-Based Speech Synthesis and a Constrained SMAPLR Adaptation Algorithm,” IEEE Transactions on Audio, Speech, and Language Processing, Vol. 17, No. 1, pp, 66-83, Jan. 2009.
[14] Z.H. Ling, K. Richmond, J. Yamagishi and R.H. Wang, “Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis,” IEEE Transactions on Audio, Speech, and Language Processing, Vol. 17, No. 6, pp, 1171-1185, Aug. 2009
[15] S.Theodorakis, A. Katsamanis and P. Maragos, “Product-HMMs for automatic sign language recognition,” IEEE International Conference on Acoustics, Speech and Signal Processing, pp, 1601-1604, Apr. 2009.
[16] K. Yu, F. Mairesse and S. Young, “Word-level emphasis modelling in HMM-based speech synthesis,” IEEE International Conference on Acoustics Speech and Signal Processing, pp, 4238-4241, Mar. 2010.
[17] M. Dehghana, K. Faeza, M. Ahmadi and M. Shridharc, “Unconstrained Farsi handwritten word recognition using fuzzy vector quantization and hidden Markov models,” Pattern Recognition Letters, Vol. 22, Iss. 2, pp, 209-214, Feb. 2001
[18] U. Harun, O. Ali, R. Sarac and A. Arslan,” A biomedical system based on fuzzy discrete hidden Markov model for the diagnosis of the brain diseases,” Expert Systems with Applications, Vol. 35, Iss. 3, pp, 1104–1114, Oct. 2008.
[19] N.E. Huang, “The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis,” Proc. R. Soc. London, pp, 903-995, 1996.
[20] Xiong. Xiao, Siong. Chng, and Haizhou. Li, “Normalization of the Speech Modulation Spectra for Robust Speech Recognition,” IEEE Transactions on Audio, Speech, and Language Processing, Vol. 16, No. 8, pp, 1162-1174, Nov. 2008.
[21] R.L. Haupt and S.E. Haupt, Practical Genetic Algorithms, 2nd Edition, Wiley, 2004.
[22] S. Haykin and B.V. Veen, Signals and System 2nd Edition, Wiley, 2003.
[23] V. Oppenheim, R.W. Schafer and J.R. Buck, DISCRETE-TIME SIGNAL PROCESSING 2nd Edition, Pearson; 1999.
[24] S. Oraintara, Y.J. Chen and T.Q. Nguyen, ”Integer fast Fourier transform,” IEEE Transactions on Signal Processing, Vol. 50, No. 3, pp, 607-618, Mar. 2002.
[25] H. Mathews, K.D. Fink and Numerical Methods Using MATLAB, 4th Edition, Prentice-Hall, 2004.
[26] X. Huang, A. Acero and H. Wuenon, Spoken Language Processing A Guide to Theory, Algorithm and System Development, Pearson, 2005.
[27] 8-bit Microcontroller with 4K Bytes In-System Programmable Flash AT89S51, ATMEL.
[28] http://www.altera.com/literature/hb/nios2/n2sw_nii5v2.pdf
[29] http://www.altera.com/literature/hb/qts/qts_qii5v4.pdf
[30] H.G. Hirsch and D. Pearce, “The Aurora Experimental Framework for The Performance Evaluation of Speech Recognition Systems Under Noisy Conditions,” in Proc. ISCA ITRW ASR2000, Sep. 2000.

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	以類神經網路為架構之語音辨識系統
2.	用隱藏式馬可夫方法於頻域特徵之國語數字辨識
3.	語音辨識與VisualBasic
4.	以隱藏式馬可夫模型、向量量化與語言文法為基礎的中文語音辨識系統
5.	中文語音資訊檢索─以音節為基礎之索引特徵、統計式檢索模型及進一步技術
6.	以基因演算法與平行運算進行翼型優化
7.	日文語音辨識系統之設計研究
8.	中文語音辨識系統增進辨識率之策略研究-以地址系統與二、三、四字詞系統為例
9.	以樹狀結構有效使用調適語料之語者調適技術
10.	英文語音辨識系統之設計研究
11.	以語音辨識做電梯控制
12.	語音識別應用於卡拉OK之選曲輸入
13.	中文關鍵語詞搜尋系統之設計研究
14.	經驗模態分解法應用在情緒語音特徵值之計算
15.	隱藏式馬可夫模型之語音辨識在電視控制系統之應用

無相關期刊

1.	經驗模態分解法應用在情緒語音特徵值之計算
2.	基因演算法應用在以ECG信號為基礎之睡眠辨識特徵值選取
3.	合成具高度優選方向性之毛線球狀氧化鎢奈米線及其應用
4.	組織文化、知識分享與組織公民行為關係之研究-促動因子之調節效果
5.	利用T型結構傳輸線設計具有多傳輸零點之微波電路
6.	『與水共生』洪氾地區聚落發展模式探討-以嘉義縣民雄鄉金興村為例
7.	保險業務員人格特質與成就動機及主管領導型態對業務績效影響之研究---以T人壽公司南部事業處為例
8.	銅與鋅在廢棄物氣化過程中對產氣效率之影響
9.	顆粒材料變形特性研究─以物理模型探討
10.	從建材的化學性質來探索其改善室內空氣品質能力之研究
11.	涉及『第三方』交易之加值型營業稅研究-以外商服飾分公司及合作店案為例
12.	植基於奇異值分解與改良式區域方向圖樣之人臉表情辨識
13.	墾丁國家公園遊客動機、滿意度與重遊意願之研究
14.	不在籍人口對金門地區婦女子宮頸抹片篩檢工作困境之研究
15.	台資企業與外資企業採購管理之比較

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室