臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.171) 您好！臺灣時間：2026/04/09 08:17

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
紙本論文
QR Code

本論文永久網址:

研究生:

魏禎德

研究生(外文):

Wei, Jen-Der

論文名稱:

建構於隱藏式馬可夫模型之語者辨識

論文名稱(外文):

HMM-Based Speaker Recognition

指導教授:

劉啟民

指導教授(外文):

Liu Chi-Min

學位類別:

碩士

校院名稱:

國立交通大學

系所名稱:

資訊工程學系

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

1998

畢業學年度:

語文別:

中文

論文頁數:

中文關鍵詞:

語者辨識、隱藏式馬可夫模型、語者判別、語者確認

外文關鍵詞:

Speaker Recognition、HMM、Speaker Identification、Speaker Verification

相關次數:

被引用:0
點閱:220
評分:
下載:0
書目收藏:1

摘要語者辨識是一種利用機
器自動辨別語者的過程，主要可以分為語者判別及語者確認兩種。在
一段語者的聲音訊號中，主要包含著兩種訊息，一是有關音素的特徵，另
一則是有關語者聲紋的特徵。在這篇論文中，主要探討此兩種訊息對語者
辨識的重要性。第一個部分，我們主要討論的主題是音素特徵在語者辨識
中的影響。在這個部份，我們對每一位語者分別建立了一個高斯模型以
及10個以數字為單位的隱藏式馬可夫模型。前者我們用來表示在語者辨識
時，不考慮音素特徵的情形；後者則代表同時考慮音素特徵及語者特徵時
的情形。我們對這兩類模型分別進行了有關混合數、訓練語句數、測試句
長度以及語者人數等的實驗。結果顯示後者的效果較好。以總混和數60為
例，前者在語者判別及語者確認的錯誤率分別為7.08% 及6.16%；後者則
為 6.69% 及5.86%。在第一部分的討論中，以同時考慮音素及語者兩
項特徵的結果較好。因此，在第二個部份中，我們以隱藏式馬可夫模型為
基礎，討論音素特徵在語者辨識中所應佔的比重，並提出四種不同的辨識
策略。在這些策略中，以組合兩類模型的方式可以達到最好的結果，其語
者判別及語者確認的錯誤率分別為 5.73%及 5.15%。其次則是利用與語者
無關的隱藏式馬可夫模型，抽取出具有辨別語者能力的音框進行辨識。這
種方式可以達到的語者判別及語者確認的錯誤率分別為 6.56% 及
5.78%。

Abstract Speaker recognition is
the process of automatically recognizingthe speaker on the basis
of information obtained from speech waves.It can be usually
divided into two subclasses: speaker identificationand speaker
verification. The speech signal contains both the phoneme and
the speakercharacteristics. While the former carries the
phoneme messages, thelater bring the information of the speaker.
This thesis considers thetwo characteristics on speaker
recognition. First, we discuss theeffects of phoneme
characteristics on speaker recognition. We constructone single
GMM and 10 digital HMMs for each speaker. The GMM is referredto
the condition of reducing the phoneme information, and the HMMs
areassociated to that of dealing with both the phoneme
information andspeaker characteristics. We exams the
performance of the two kinds ofmodels through various mixture
numbers, training data quantity, testingdata length, and the
speaker population size. With the total mixturesnumber equal to
60, the error rate (ER) of using GMMs is 7.08% in
speakeridentification, and equal error rate (EER) 6.16% in
speaker verificationsystem. While using the HMMs, we can reduce
the ER to 6.69% in speakeridentification, and the EER to 5.86%
in speaker verification. Because that the consideration of
phoneme and speaker characteristicsresult in better performance,
we provide four schemes for speakerrecognition based on the HMMs
in the second part. These four schemesconsider different
weights of the phoneme characteristics in speakerrecognition.
The best scheme is the model-combining method with the
compensationmodification. This method can lead to an ER 5.73%
in speaker identificationand an EER 5.15% in speaker
verification. The second one is theframe-refining method which
modify the reliability of each frame of aninput utterance.
Using this method can reduce the ER to 6.56% in
speakeridentification and the EER to 5.78% in speaker
verification.

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	語者辨認與驗證之初步研究
2.	語者辨識系統之研究
3.	基於小波轉換特徵參數以及使用麥克風和電話語料之大量語者識別系統
4.	語者辨識之研究
5.	語者/歌者識別
6.	基頻脈衝對語者鑑定影響的探討
7.	結構化語者模型之研究
8.	行動商務語音下單交易系統之身分驗證效能評估
9.	語音辨識應用於保全系統之密碼判別研究
10.	運用支持向量機在特定語句語者驗證之研究
11.	結合RFID與隱藏式馬可夫模型之即時辨識點名系統
12.	基於頻譜維度之語者辨識
13.	應用於咽喉微振動感測之語者辨識與其相關CMOS壓控振盪器設計
14.	基於模擬聲門來源波型之語者辨識系統與確認技術
15.	適合說話人辨認的強健性語音特徵參數

無相關期刊

1.	一般性的關鍵詞辨識及語句驗證系統
2.	具透明化及高效率機制之加密檔案系統
3.	具有高派發率之X86超純量微處理機解碼單元的設計
4.	具折疊功能之JavaBytecode的指令層次平行度分析
5.	利用虛擬實境技術進行個人電腦組裝訓練
6.	SSCOP協定於非同步傳輸網路UBR服務之速率方式流量控制方法
7.	以資訊流分析之程式庫解析工具的實作
8.	MPEG層次三之新位元分配方法
9.	SA-110微處理器設計功能驗證
10.	ATM通訊協定驗證
11.	具有高派發率之超純量微處理機的分離式指令分配單元
12.	一個ISO-9000標準認證作業程序之電腦輔助環境
13.	具有資料存取指令積極排序功能的X86資料存取單元
14.	有線電視系統中對於頻道保護的金匙分配方法
15.	網路聲訊標準G.723.1編解碼優化之研究

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室