(3.220.231.235) 您好!臺灣時間:2021/03/08 06:05
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果

詳目顯示:::

我願授權國圖
: 
twitterline
研究生:劉孆婷
論文名稱:應用麥克風陣列技術來提昇語音辨識
論文名稱(外文):Robust speech recognition using microphone arrays
指導教授:白明憲白明憲引用關係
學位類別:碩士
校院名稱:國立交通大學
系所名稱:機械工程學系
學門:工程學門
學類:機械工程學類
論文種類:學術論文
論文出版年:2010
畢業學年度:98
語文別:英文
論文頁數:56
中文關鍵詞:語音辨識麥克風陣列相位差
外文關鍵詞:speech recognitionmicrophone arraysphase difference
相關次數:
  • 被引用被引用:1
  • 點閱點閱:526
  • 評分評分:系統版面圖檔系統版面圖檔系統版面圖檔系統版面圖檔系統版面圖檔
  • 下載下載:59
  • 收藏至我的研究室書目清單書目收藏:1
本論文提供一能提昇語音辨識率的麥克風陣列。利用指向性比一般陣列高的超指向性麥克風陣列(其為端射陣列,endfire array),能夠達到減噪的效果,特別是當噪音在陣列背後的時候。評估表現的客觀參數有三種:指向性因子(directivity index)、前後比(front-to-back ratio)以及不變的波束寬(constant beam-width),利用將上述三種客觀參數最佳化,便能得到設計出超指向性麥克風陣列的濾波器。另一方面,如果噪音並不是在陣列背後的方向,相反地,是在靠近語音訊號的位置,則需使用另一種稱之為相位差評估(phase difference estimation)的演算法來解決這種問題,此方法能在不使語音訊號失真的情況下消除噪音。本研究發現在相位差評估內的ITD threshold 對於語音辨識提昇的效果扮演著重要的角色,因此必須要將其作一最佳化的設計,在此本研究是使用GSS(Golden Section Search)來將其最佳化。此外,音量亦會影響辨識率,因此
也必須要加以控制。如果目標訊號並不是在設計的主軸上,則必須要使用beam-steering 的技術將主波束轉置目標訊號的位置上。最後將會分析實驗結果,並驗證本研究所提出之演算法能夠使語音辨識率大幅提昇。
摘 要 i
ABSTRACT ii
誌謝 iii
TABLE OF CONTENTS iv
LIST OF TABLES vi
LIST OF FIGURES vii
I. INTRODUCTION 1
II. SUPER-DIRECTIVE MICROPHONE ARRAYS 4
A. First-order difference microphone array 4
1. First order adaptive DMA 6
B. Optimization of array beampattern 8
1. Maximum for directive index (MDI) 8
2. Maximum for front-to-back ratio (MFBR) 9
3. Maximum for constant beamwidth (MCBW) 10
C. Super-directive microphone array with equalizer 11
D. Simulated and experimental results 12
III. PHASE-DIFFERENCE ESTIMATION (PDE) 13
v
A. Optimization of the ITD threshold using GSS 14
1. Golden section search 15
2. The Optimal ITD threshold varying with the included angle 16
B. Volume scaling 18
C. Beam steering 18
D. Simulated and experimental results 19
IV. CONCLUSIONS 22
REFERENCES 23

1. Y. Gong, "Speech recognition in noisy environments: a survey", Speech Commun.
16(1995), 261-291.
2. J. Bitzer, K. U. Simmer and K. D. Kammeyer, "Multi-microphone noise reduction
techniques for hands-free speech recognition –a comparative study-," in Robust
Methods for Speech Recognition in Adverse Conditions (ROBUST99), 171–174,
Tampere, Finland, May 1999.
3. M. Cooke, P. Green, L. Josifovski, A. Vizinho, "Robust automatic speech
recognition with missing and unreliable acoustic data," Speech Commun.
34(2001), 267-285.
4. S. Srinivasan, N. Roman, D.L. Wang, "Binary and ratio time-frequency masks for
robust speech recognition," Speech Commun. 48(2006), 1486-1501.
5. R. M. Stern, E. Gouvea, C. Kim, K. Kumar, and H. Park, "Binaural and
multiple-microphone signal processing motivated by auditory perception", in
Hands-Free Speech Communication and Microphone Arrys, pages 98–103, May.
2008.
6. R. M. Stern and C. Trahiotis, "Models of binaural interaction," in Hearing, B. C. J.
Moore, Ed. Academic Press,2002, pp. 347–386.
7. H. Park, and R. M. Stern, "Spatial separation of speech signals using amplitude
estimation based on interaural comparisons of zero crossings," Speech
Communication, vol. 51, no. 1, pp. 15–25, Jan. 2009.
8. K.J. Palomaki, G.J. Brown, D.L. Wang, "A binaural processor for missing data
speech recognition in the presence of noise and small-room
reverberation," ,Speech Commun. 43(2004), 361-378.
9. N. Roman, D.L. Wang, G.J. Brown, "Speech segregation based on sound
localization," J. Acoust. Soc. Am. 114, 2236-2252, 2003.
10. M. Brandstein and D. Ward, Microphone arrays (Springer, New York, 2001).
11. S.L. Gay, J. Benesty, Acoustic signal processing for telecommunication, (Kluwer
Academic Publishers, 2000).
12. C. Kim, K. Kumar, B. Raj, and R. M. Stern, "Signal Separation for Robust Speech
Recognition Based on Phase Difference Information Obtained in the Frequency
Domain," in INTERSPEECH-2009, pages 2495–2498, Sept. 2009.
13. H. Teutsch, G.W. Elko, "First- and Second-order adaptive differential microphone
arrays," 2001.
14. H. Song, J. Liu, "First-Order Differential Microphone Array for Robust Speech
Enhancement," Language and Image Processing, 2008.
15. P. H. Rogers, A. L. V. Buren, "New approach to a constant beamwidth
transducer," J. Acoust See. Am. 64(1), July 1978.
16. W. Marshall Leach, Jr., Introduction to electroacoustics and audio amplifier
design (Kendall/Hunt publishing company,2003).
17. J.G.Wilpon, L.R.Rabiner, C.H.Lee, E.R.Goldmn, "Automatic recognition of
keyword in unconstrained speech using hidden Markov models," IEEE Trans.
ASSP. Nov 1990.
18. H. Ney, "The Use of a One-Stage Dynamic Programming Algorithm for
Connected Word Recognition," IEEE Trans. Acoustics, Speech, Signal Proc.,
vol.32, no2, pp.263-271, April 1984.
19. Chin-Hui Lee, Frank K. Soong and Kuldip K. Paliwal. "Automatic Speech and
Speaker Recognition," Kluwer Academic Publishers. 1995.
20. numerical recipes in C: the art of scientific computing, 2nd Edition, 1993.
21. J. Bergqvist and F. Rudolf, "A silicon condenser microphone using bond and
etch-back technology," Sensors and Actuators A, 45, 115-124 (1994).
22. C. Kim, R.M. Stern, K. Eom, J. Lee, "Automatic selection of thresholds for signal
separation algorithm based on interaural delay," 2010.
連結至畢業學校之論文網頁點我開啟連結
註: 此連結為研究生畢業學校所提供,不一定有電子全文可供下載,若連結有誤,請點選上方之〝勘誤回報〞功能,我們會盡快修正,謝謝!
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top
1. 洪允平,1988 ,「消費性貸款」,中小企銀季刊,30,p.46-47。
2. 林思惟,2006,銀行推動BaselII聯徵中心提供之服務與協助聯徵中心「個人信用評分」之應用:風險區隔與風險數量化,金融風險管理季刊,2006/2,2(2),p.99-110。
3. 李正福、王克陸、劉大安,2008,考量總體經濟環境之信用評等移轉矩陣:信用循環指標法及信用投資組合法之實證比較,臺大管理論叢,19(1),(2008/12) ,p.241-268
4. 陳宛伶,2008,探討「房貸評分卡」於房貸業務之運用,彰銀資料月刊,2008/5/31,57(5),p.4-24。
5. 劉展宏、張金鶚,2001,購屋貸款提前清償行為之研究,住宅學報第十卷第一期第29頁—49頁。
6. 梁德馨、黃高鴻,2007,小額信用貸款違約風險評分評等模型之建構─依循新巴賽爾資本協定零售型暴險內部評等法之規範,風險管理學報9(2),2007/7,p. 1-25。
7. 林宜村,2008,看美國次級房貸事件之影響-談台灣房市與房貸,今日合庫,第34卷,第一期,p63-81。
8. 莊瑞珠、陳穆貞,2007,金融機構住宅房屋貸款信用評分系統之建構研究,住宅學報15((2),民國九十五年十二月 學術論著,p.65-90。
9. 鍾經樊、黃嘉龍、黃博怡、謝有隆,2006,台灣地區企業信用評分系統的建置、驗證和比較中央研究院經濟研究所經濟論文,34(4),2006,p.541-590。
10. 陳業寧、王衍智、許鴻英,2004,「台灣企業財務危機之預測:信用評分法與選擇權評價法孰優?」,風險管理學報第六卷第二期,2004年7月,p.155-179。
 
系統版面圖檔 系統版面圖檔