研究生(外文):Yu-Cheng, Yan
論文名稱(外文):Detection of Ambulance and Fire Truck Siren Sounds
外文關鍵詞:Mel-Frequency Cepstral CoefficientsLong-Short Term Memory RNNPerceptronNeural NetworkSiren Sound Detection
This study investigates automatic methods for detecting ambulance and fire truck siren sounds. In general, emergency vehicles, such as ambulances, fire trucks, and police cars use sirens to warn other road users for quick passage, especially for moving through the traffic. However, due to the well sound insulation techniques in modern cars, drivers may not be aware of the approaching of the emergency vehicles, especially when in-vehicle audio systems are used. As a consequence, emergency vehicles may be blocked and even collided with other vehicles. To help drivers avoid being unaware of emergency vehicles, this work proposes automatic detection methods of siren sounds. But during the initial study stage, we focus only on the siren sounds of ambulance and fire truck in Taiwan. The detection task is formulated as a problem of identifying the three sound classes, respectively from ambulance, fire truck, and ambient noise. We propose a recurrent neural network with deep learning to determine which of the classes each 1-sec sound recording belongs to, based on its mel frequency cepstral coefficients. Our experiments shows that the proposed method can achieve the identification accuracy of 90% in simulated -15dB noisy sound data and 93% in real sound data recorded on a downtown street.
摘要 I
誌謝 IV
目錄 V
表目錄 VII
圖目錄 IX
第一章 緒論 1
1.1介紹 1
1.2相關文獻探討 1
1.3問題定義 2
1.4論文架構 2
第二章 警示鳴笛聲訊號特性與研究方法 3
2.1警示鳴笛聲訊號特性 3
2.2研究方法 5
第三章 特徵擷取 6
3.1特徵參數擷取 6
3.1.1預強調(Pre-emphasis) 7
3.1.2音框化(Framing) 7
3.1.3漢明窗(Hamming Window) 8
3.1.4快速傅立葉轉換(Fast Fourier Transform) 9
3.1.5三角帶通濾波器(Triangular Bandpass Filter) 9
3.1.6離散餘弦轉換(Discrete Cosine Transform) 10
第四章 類神經網路 12
4.1簡介 12
4.2類神經網路特性[7] 13
4.3人工神經元 14
4.4激勵函數 15
4.5多層感知機(Multi Layer Perceptron) 16
4.6遞迴式類神經網路(Recurrent Neural Network) 20
4.6.1傳統RNN 21
4.6.2 LSTM-RNN 28
4.7 自應性學習率演算法Adam 37
4.8 Softmax 38
4.9 One-Hot Vector 39
4.10 Cross-Entropy 40
4.11使用架構 41
4.11.1架構一 多層感知機(Multi Layer Perceptron) 41
4.11.2架構二 LSTM-RNN 42
4.12類神經網路訓練與測試流程 43
第五章 實驗 45
5.1實驗資料與實驗環境 46
5.2警示鳴笛聲訊號識別實驗 49
5.2.1實驗一 多層感知機 50
5.2.2實驗二 LSTM-RNN 54
5.2.3實驗三 外部測試 59
第六章 結論與未來展望 61
6.1結論 61
6.2未來展望 61
參考文獻 62
附錄 64
