臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.172) 您好！臺灣時間：2025/09/12 05:17

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
電子全文
紙本論文
QR Code

本論文永久網址:

研究生:

古詩峰

論文名稱:

基於小波轉換特徵參數以及使用麥克風和電話語料之大量語者識別系統

論文名稱(外文):

A Large Population Speaker Identification System Based on Wavelet Transform Features by Using Microphone and Telephone Speech Corpus

指導教授:

呂仁園

指導教授(外文):

Ren-yuan Lyu

學位類別:

碩士

校院名稱:

長庚大學

系所名稱:

電機工程研究所

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2003

畢業學年度:

語文別:

中文

論文頁數:

中文關鍵詞:

語者識別、語者辨識、小波轉換、小波特徵參數、高斯混合模型

外文關鍵詞:

speaker identification、speaker recognition、wavelet transform、wavelet feature、Gaussian mixture model

相關次數:

被引用:9
點閱:275
評分:
下載:24
書目收藏:2

本篇論文中，我們建立以離散小波轉換為基礎的特徵參數及使用高斯混合模型來代表每位語者的聲學特性。我們實驗所使用的語料分別為TIMIT麥克風語料及本實驗是自行收集的FSC-Tel 1000電話語料。在TIMIT及FSC-Tel 1000實驗中我們得到的最高辨識率分別為99.127%及95.5%。
在建立以離散小波轉換為基礎的特徵參數方面，主要是尋找適當的小波分解樹，並將經過小波分解樹後的小波係數轉換為合適的語者特徵參數，在我們的實驗中發現，不同的小波分解樹對語者識別系統會有很大的影響。在本篇論文中提出幾種有不錯效果的小波分解樹。不過我們尚未找出最佳的小波分解樹。
最後，我們在Windows 2000/XP實做了一個以小波轉換為特徵參數及結合高斯混合模型的語者識別系統。使用者只須錄音25秒即可完成語者登記，而在語者測試時，使用者只須輸入2秒的語音，系統即可辨識出語者身份。在整個系統中花費了大部分的時間在語者的特徵參數擷取，因此，我們往後必須尋找到可以加速特徵擷取的演算法。

In this thesis, we have established a large population speaker identification system which is based on Gaussian Mixture Models using wavelet based features. The corpus for experiments of this thesis are the well-known TIMIT phonetically balance speech database for microphone speech (TIMIT) and a new collected Formosa Speech Corpus for telephone speech (FSC-Tel 1000). We achieve the best speaker accuracy of 99.127% for TIMIT corpus and 95.5% for FSC-Tel 1000 corpus.
About wavelet based features, our objective is to found a suitable wavelet decomposing tree by which the speech waveform could be transformed to the wavelet coefficients. In our experiments, we found that different wavelet decomposing trees play an important role for the accuracy of a speaker identification system. In the thesis we have found several efficient types of wavelet decomposing trees. However the optimal trees have not yet been found.
We also implemented a speaker identification system on Windows 2000/XP in which a user only speaks a segment of speech of about 25 seconds for registering his voice in the system. In real-time testing, each speaker speaks only 2 seconds and then the system can correctly identify him with a high accuracy. In the system the feature extraction consumes the most time. For this reason we must found a good algorithm to speed up the feature extraction.

致謝................................................ v
目錄................................................ vii
圖目錄.............................................. x
表目錄.............................................. xiii
中文摘要............................................ xiv
英文摘要............................................ xv
第一章緒論........................................ 1
1.1 研究動機........................................ 1
1.2 語者辨識系統概論................................ 3
1.3 研究背景及目的.................................. 6
1.4 研究方法........................................ 8
1.5 章節大要........................................ 10
第二章語者辨認的基本技術.......................... 11
2.1 語音背景知識.................................... 11
2.2 特徵參數擷取..................... .............. 13
2.2.1 倒頻譜參數................................. 13
2.2.2 梅爾倒頻譜參數............................. 14
2.2.3 能量對數................................... 16
2.2.4 一階導數及二階導數......................... 16
2.3 高斯混合模型與語者辨識.......................... 17
2.3.1 高斯混合模型簡介........................... 17
2.3.2 模型描述................................... 19
2.3.3 模型參數初始化............................. 21
2.3.4 最佳相似性估測法........................... 24
2.3.5 期望值最大演算法........................... 25
2.3.6 語者識別器................................. 30
第三章小波轉換為基礎之特徵參數.................... 32
3.1 小波理論簡介.................................... 32
3.2 連續訊號小波轉換................................ 36
3.3 離散訊號小波轉換................................ 39
3.4 小波重建........................................ 42
3.5 小波特徵之設計.................................. 43
第四章實驗設計與實驗結果.......................... 48
4.1 系統架構與實驗設計.............................. 48
4.2 系統效能的評估.................................. 49
4.3 語音資料庫簡介.................................. 50
4.4 麥克風語者辨識實驗.............................. 52
4.5 電話語者辨識實驗................................ 58
第五章系統實作.................................... 64
5.1 語者識別系統架構................................ 64
5.2 語者識別系統模組................................ 67
5.3 系統展示........................................ 73
第六章結論與展望.................................. 78
6.1 結論............................................ 78
6.2 展望............................................ 79
參考文獻............................................ 81

【1】Dugelay, J.-L.; Junqua, J.-C.; Kotropoulos, C.; Kuhn, R.; Perronnin, F.; Pitas, I.,“Recent advances in biometric person authentication,” Acoustics, Speech, and Signal Processing, 2002. Proceedings. (ICASSP ''02). IEEE International Conference on , Volume: 4 , 2002.
【2】Campbell, J.P., Jr., “Speaker recognition: a tutorial,”Proceedings of the IEEE , Volume: 85 Issue: 9 , Sep 1997.
【3】Reynolds, D.A., “An overview of automatic speaker recognition technology,” Acoustics, Speech, and Signal Processing, 2002. Proceedings. (ICASSP ''02). IEEE International Conference on , Volume: 4 , 2002.
【4】Webb, J.J.; Rissanen, E.L., “Speaker identification experiments using HMMs,” Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on , Volume: 2 , 27-30 Apr 1993.
【5】Reynolds, D.A., “Large population speaker identification using clean and telephone speech,”IEEE Signal Processing Letters , Volume: 2 Issue: 3 , Mar 1995.
【6】R. Sarikaya, B. Pellom, J.H.L. Hansen, "Wavelet Packet Transform Features with Application to Speaker Identification," NORSIG-98 IEEE Norsic Signal Processing Symposium, pp. 81-84, Vigso, Denmark, June 1998.
【7】Daniel J. Mashao and N. Tinyiko Baloyi, “Improvements in the speaker identification rate using feature-sets on a large population database,”Eurospeech 2001.
【8】鍾偉仁，“語者辨認與驗證之初步研究”，臺灣大學電信工程所碩士論文，民國八十九年
【9】范世明，“高斯混合模型在語者辨識與國語語音辨認之應用”，交通大學電信工程所碩士論文，民國九十年
【10】鄭順德，“不特定語句中量語者辨識系統之設計研究”，中山大學電機所碩士論文，民國九十一年
【11】Liu, C.-H.; Chen, O.T.-C., “A text-independent speaker identification system using PARCOR and AR model,”Circuits and Systems, 2002. MWSCAS-2002. The 2002 45th Midwest Symposium on , Volume: 3 , 2002.
【12】Siew Chan Woo; Chee Peng Lim; Osman, R.,“Development of a speaker recognition system using wavelets and artificial neural networks,”Intelligent Multimedia, Video and Speech Processing, 2001. Proceedings of 2001 International Symposium on , 2001.
【13】Torres, H.M.; Rufiner, H.L.,“Automatic speaker identification by means of Mel cepstrum, wavelets and wavelet packets,” Engineering in Medicine and Biology Society, 2000. Proceedings of the 22nd Annual International Conference of the IEEE , Volume: 2 , 2000.
【14】Hsieh, C.-T.; Lai, E.; Wang, Y.-C., “Robust speech features based on wavelet transform with application to speaker identification,” Vision, Image and Signal Processing, IEE Proceedings- , Volume: 149 Issue: 2 , Apr 2002.
【15】Bovbel, E.I.; Kheidorov, I.E.; Chaikou, Y.A.,“Wavelet-based speaker identification,”Digital Signal Processing, 2002. DSP 2002. 2002 14th International Conference on , Volume: 2 , 2002
【16】Reynolds, D.A.; Rose, R.C.,“Robust text-independent speaker identification using Gaussian mixture speaker models,”Speech and Audio Processing, IEEE Transactions on , Volume: 3 Issue: 1 , Jan 1995.
【17】G. McLachlan, Mixture Models. New York：Marcel Dekker, 1988.
【18】Jeff A. Blimes,“A Gentle Tutorial Of The EM Algorithm And Its Application To Parameter Estimation For Gaussian Mixture And Hidden Markov Models,”Interational Computer Science Institute Aprial 1998.
【19】Jialong_He，http://tiger.la.asu.edu/C_lib.htm
【20】Steven Young,“The HTK Book（Version 3.0）”,Cambridge University,2000.

電子全文

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	語者辨認與驗證之初步研究
2.	高斯混合模型在語者辨識與國語語音辨認之應用
3.	不特定語句中量語者辨識系統之設計研究
4.	強韌式語者辨識系統：從麥克風、市話到手機
5.	以DSPBase為架構的不特定語句即時語者辨識系統
6.	以支向機為基礎並結合特徵擷取之語者辨識系統
7.	基於線性預估頻譜估計的梅爾倒頻譜參數聲紋辨識演算法
8.	基於共振峰之自動化離線語者鑑定
9.	DSP-Based非特定語言關鍵詞檢索與辨識系統
10.	高斯混合模型之語者與情緒辨識系統
11.	動態更新語者模型及語者驗證
12.	修正式魏格納-韋立分佈應用於語者辨識
13.	語者辨識系統之研究
14.	應用SOM-PNN混合神經網路在語者識別
15.	語者/歌者識別

1.	[1]宋佩瑄，“纖維混凝土實務”,現代營建雜誌社(1991)

1.	一個基於MFCC的語者識別系統
2.	低複雜度心電圖訊號壓縮架構設計
3.	在多重協定標籤交換技術網路上有效率的多重路徑負載平衡通訊協定
4.	FPGA為基礎之微步進馬達模糊電流控制器設計
5.	影像伺服在機械手臂定位上的應用
6.	採用穩健旋律追蹤演算法及多語歌詞辨識引擎之整合式歌聲辨識系統
7.	正交分頻多工系統中變動頻率偏移之副載波間互擾的消除方法
8.	SOVA式渦輪碼解碼器之低延遲電路架構
9.	資料融合在多基地台無線定位的應用
10.	在無線無基礎網路中支援多重速率及距離之群播繞徑協定
11.	可攜式資料記錄器應用於步態分析
12.	用於多天線系統之適應性部分多級平行偵測技術
13.	利用Steiner樹之滑動操作解決超大型積體電路連線合成之效能最佳化設計
14.	模糊系統晶片的設計與應用
15.	應用模糊神經架構於三維臉型辨識

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室