跳到主要內容

臺灣博碩士論文加值系統

(216.73.216.136) 您好!臺灣時間:2025/09/20 14:02
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

: 
twitterline
研究生:蔡秉昌
研究生(外文):Ping-Chang Tsai
論文名稱:利用麥克風陣列做戶外異常聲源之定位
論文名稱(外文):Using a Microphone Array to Detect the Locations of Unusual Sound Sources in an Outdoor Area
指導教授:石勝文石勝文引用關係
指導教授(外文):Sheng-Wen Shih
學位類別:碩士
校院名稱:國立暨南國際大學
系所名稱:資訊工程學系
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2010
畢業學年度:98
語文別:中文
論文頁數:54
中文關鍵詞:聲源定位能量偵測延遲時間差波達方向
外文關鍵詞:Sound Sources LocalizationEnergy DetectionTime Delay of ArrivalDirection of Arrival
相關次數:
  • 被引用被引用:3
  • 點閱點閱:477
  • 評分評分:
  • 下載下載:77
  • 收藏至我的研究室書目清單書目收藏:1
本論文主要探討戶外場所的異常聲源定位研究。戶外聲源定位技術會因風向、環境雜訊以及溫度等因素影響其準確性。為了排除雜訊干擾以提昇定位準確率,且能拓展聲源定位技術用以偵測頻譜不完全重疊的多重聲源位置,本研究將利用兩訊號所含雜訊對其相位差之擾動,建立時間延遲量的機率模型,並根據此模型找出可能的聲源位置分佈。最後利用 mean-shift 演算法分群,過濾低權值的群後,得到最可能的聲源位置。在實驗上則利用四顆線性均勻分布的麥克風,來收集戶外聲源訊號。其單聲源訊號偵測的準確率在一般的情況下和目前最常使用且有高準確率的聲源定位方法Phase Transform (PHAT) 不相上下,但若針對戶外不同頻率的混音環境雜訊 (如蟲鳴鳥叫),此方法擁有比PHAT 較高的穩定度。另外在頻率差異性大的雙聲源訊號偵測上,也能有一定的準確率。
In this thesis, we study the unusual sound source localization problem in an outdoor area. In an outdoor environment, the accuracy of sound source localization will be influenced by the wind velocity, the air temperature, and background noises. This work aims to develop a sound source localization method which is accuracy and robust against noise and is able to localize multiple sound sources having non-overlapping spectrums in the frequency domain. The probability density function (PDF) of the time delay of arrival (TDOA) between two signals is derived based on the PDF of the phase angle between two signals. The PDF of the phase angle is derived from a noise model of the input signals. According to the derived probabilistic model, possible locations of the sound sources can be computed. The mean-shift algorithm is used to find clusters of possible locations. Too small clusters are discarded and the centers of the remaining clusters represent the estimated locations of the sound sources. To test the proposed method, a uniform linear microphone array consisting four microphones is constructed to collect sound signals in an outdoor area. The experimental results show that, in a strictly single sound source scenario, the accuracy of the proposed method is comparable to a very popular method known as the phase transform (PHAT) technique. Furthermore, when the background noises, such as the sounds from insects and/or birds, are not negligible, the proposed method outperforms the PHAT method. Additionally, the experimental results of estimating two simultaneous sound sources show that the proposed method also can achieve considerable stability for detecting/localizing multiple sound sources provided that the sound sources have non-overlapping spectrums.
致謝.................................................................................................................................. i
論文摘要........................................................................................................................... ii
Abstract ............................................................................................................................. iii
目錄................................................................................................................................. v
圖目錄............................................................................................................................. vi
表目錄............................................................................................................................. ix
第一章緒論...................................................................................................................... 1
1.1 研究動機與目的. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 文獻探討. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2.1 聲音定位系統的類型. . . . . . . . . . . . . . . . . . . . . . . . 3
1.2.2 麥克風陣列. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3 研究目標. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.4 論文架構. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
第二章聲音定位技術...................................................................................................... 9
2.1 系統流程. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.2 聲響偵測. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.3 TDOA 分析. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.4 最大概率TDOA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
2.4.1 單聲源TDOA 概率. . . . . . . . . . . . . . . . . . . . . . . . . 14
2.4.2 多聲源TDOA 概率. . . . . . . . . . . . . . . . . . . . . . . . . 18
2.5 定位計算. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.5.1 TDOA 轉DOA . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.5.2 利用向量交點求出聲源位置. . . . . . . . . . . . . . . . . . . . 19
第三章實驗結果.............................................................................................................. 24
3.1 單一聲源定位. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
3.2 雙聲源定位. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
3.3 多聲源定位. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
第四章結論與未來方向.................................................................................................. 50
4.1 結論. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
4.2 未來方向. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
參考文獻........................................................................................................................ 52
[1] 曾政傑, “基於多重訊號分類之聲源方位偵測,” Master’s thesis, 國立台灣科技大學資訊工程系, 2008.
[2] B. Kapralos, M. R. M. Jenkin, and E. Milios, “Audio-visual localization of multiple speakers in a video teleconferencing setting,” International Journal of Imaging Systems and Technology, vol. 13, pp. 95–105, 2002.
[3] M. Cristani, M. Bicego, and V. Murino, “Audio-visual event recognition in surveillance video sequences,” IEEE Transactions on Multimedia, vol. 9, no. 2, pp. 257–267, 2007.
[4] W. Zajdel, J. D. Krijnders, T. Andringa, and D. M. Gavrila, “CASSANDRA: audiovideo sensor fusion for aggression detection,” in Proceeding of IEEE Conference on Advanced Video and Signal Based Surveillance, pp. 200–205, 2007.
[5] J. Kuklyte, P. Kelly, C. O’Conaire, N. E. O’connor, and L. Q. Xu, “Anti-social behavior detection in audio-visual surveillance systems,” in In: PRAI*HBA - The Workshop on Pattern Recognition and Artificial Intelligence for Human Behaviour Analysis, 2009.
[6] V. T. Vu, F. Bremond, G. Davini, M. Thonnat, Q. C. Pham, N. Allezard, P. Sayd, J. L. Rouas, S. Ambellouis, and A. Flancquart, “Audio-video event recognition system for public transport security,” in The Institution of Engineering and Technology Conference on Crime and Security, pp. 414–419, 2006.
[7] E. Menegatti, E. Mumolo, M. Nolich, and E. Pagello, “A surveillance system based on audio and video sensory agents,” in Intelligent Autonomous Systems 8, pp. 335–343, 2004.
[8] S. Chen, Z. P. Sun, and B. Bridge, “Automatic traffic monitoring by intelligent sound detection,” in Proceedings of the IEEE Conference on Intelligent Transportation Systems, pp. 171–176, 1997.
[9] Y. Kung, R. E. Hudson, C. W. Reed, D. Chen, and F. Lorenzelli, “Blind beamforming on a randomly distributed sensor array system,” IEEE Journal on Selected Areas in Communications, vol. 16, no. 8, pp. 1555–1567, 1998.
[10] J. C. Chen, K. Yao, R. E. Hudson, T. L. Tung, C. W. Reed, and D. Chen, “Source localization of a wideband source using a randomly distributed beamforming sensor array,” in Proceedings of International Society of Information Fusion, pp. 11–18, 2001.
[11] J. C. Chen, Y. Kung, and R. E. Hudson, “Source localization and beamforming,” IEEE Signal Processing Magazine, vol. 19, no. 2, pp. 30–39, 2002.
[12] R. Schmidt, “Multiple emitter location and signal parameter estimation,” IEEE Transactions on Antennas and Propagation, vol. 34, no. 3, pp. 276–280, 1986.
[13] R. Roy and T. Kailath, “ESPRIT-estimation of signal parameters via rotational invariance techniques,” IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 37, no. 7, pp. 984–995, 1989.
[14] A. Lombard, T. Rosenkranz, H. Buchner, and W. Kellermann, “Multidimensional localization of multiple sound sources using averaged directivity patterns of blind source separation systems,” in Proceeding of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 233–236, 2009.
[15] R. Mukai, H. Sawada, S. Araki, and S. Makino, “Source separation and DOA estimation using small 3-d microphone array,” in Proceedings of the Joint Workshop on Hands-Free Speech Communication and Microphone Arrays, pp. 9–10, 2005.
[16] Y. Kung, J. C. Chen, and R. E. Hudson, “Maximum-likelihood acoustic source localization:experimental results,” in Proceeding of IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3, pp. 2949–2952, 2002.
[17] C. Knapp and G. Carter, “The generalized correlation method for estimation of time delay,” IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 24, no. 4, pp. 320–327, 1976.
[18] G. C. Carter, “Coherence and time delay estimation,” Proceedings of the IEEE, vol. 75, no. 2, pp. 236–255, 1987.
[19] Y. Zhang and W. H. Abdulla, “A comparative study of time-delay estimation techniques using microphone arrays,” 2005.
[20] M. Mizumachi, N. Ikoma, and K. Niyada, “DOA estimation based on cross-correlation by two-step particle filtering,” in European Signal Processing Conference, 2006.
[21] P. Smaragdis and P. Boufounos, “Position and trajectory learning for microphone arrays,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 1, pp. 358–368, 2007.
[22] P. Smaragdis and P. Boufounos, “Learning source trajectories using wrapped-phase hidden markov models,” in IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 114–117, 2005.
[23] 楊善祥, “聲源三維方位偵測之研究,” Master’s thesis, 國立台灣科技大學資訊工程系, 2009.
[24] K. K. Jung, H. S. Shin, S. H. Kang, and K. H. Eom, “Object tracking for security monitoring system using microphone array,” in International Conference on Control, Automation and Systems, pp. 2351–2354, 2007.
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top