

( 您好!臺灣時間:2024/09/16 22:18
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::


論文名稱(外文):Reduced Computation of Speech Coder Using a Voice Activity Detection Algorithm
外文關鍵詞:VAD algorithmReduction computational complexityG.723.1
  • 被引用被引用:0
  • 點閱點閱:205
  • 評分評分:
  • 下載下載:9
  • 收藏至我的研究室書目清單書目收藏:0

The explosive growth of Internet use and multimedia technology, multimedia communication is integrated into a personal information machine nowadays, and due to the latter’s limited computational capability, the need for a coder with low computational complexity to match different hardware platforms and integrate the services of media sources has arisen. For an Internet or wireless speech communicator, heavy computation uses more power and contributes to higher pricing of the communicator or reduced battery life. In order to achieve the real-time and continuity of speech communication, reduction of computational complexity for the speech coder is desirable for modern communication systems. In this thesis, we use a Voice Activity Detection (VAD) algorithm, which is merely used to classify the speech signal into two types of frames, active frames and inactive frames in our proposed method.
We analyzed the characteristic of the inactive speech signals in our experiments. The experimental results are obvious that the encoding parameters are uniform distributed for the inactive speech subframes. Therefore, if the current frame is an inactive speech frame, then the code excited signal of current frame is not encoded instead of random arrangement the encoding parameters for the codebook structure. The Overall simulation results indicate that the average perceptual evaluation of speech quality score is degraded slightly, by 0.023, and our proposed methods can reduce total computational complexity by about 30% relative to the original G.723.1 encoder computation load with perceptually negligible degradation.

摘 要
致 謝
目 錄
第1章 簡介
1.1 語音編碼技術背景
1.2 ITU-T制定語音編碼標準
1.3 論文研究目的
1.4 論文大綱
第2章 G.723.1語音編碼器
2.1 CELP編碼架構
2.2 ITU-T G.723.1語音編碼器
2.3 音框處理(Framer)
2.4 高通濾波器(High Pass Filter)
2.5 線性預估編碼分析(LPC Analysis)
2.6 線頻譜對量化(LSP Quantizer)
2.7 共振峰感官加權濾波器(FPWF)
2.8 開迴路基週預估(Pitch Estimator)
2.9 閉迴路適應性基週預估器(Pitch Predictor)
2.10 諧波雜訊濾波器(Harmonic Noise Shaping)
2.11 代數碼激發線性預估(ACELP)
2.12 語音活動檢測(VAD)
2.13 語音品質評估(PESQ)
第3章 非活動語音訊號編碼參數特性之分析
3.1 閉迴路適應性基週預估器編碼參數特性之分析
3.1.1 五階閉迴路適應性基週(pitch lag)特性之分析
3.1.2 五階閉迴路適應性基週增益(gain)特性之分析
3.2 ACELP 碼簿參數特性之分析
3.2.1 ACELP激發脈衝位置與極性之特性分析
3.2.2 ACELP激發脈衝增益(Gain)特性之分析
第4章 計算複雜度與語音品質的評估實驗
4.1 隨機閉迴路基週參數對音質的評估實驗
4.2 隨機ACELP碼激參數對音質的評估實驗
4.3 非活動語音訊號使用隨機編碼參數對音質之評估
第5章 結論
Computational Complexity Reduction of G.723.1 oder Using a voice activity detection Algorithm
Efficient Reduction Computational Complexity For Speech Coding


[2]F.K. Chen and D.J. Yue, “Complexity scalability design in coding of the adaptive codebook for ITU-T G.729 speech coder,” Information, Communications and Signal Processing (ICICS), Dec 2011.

[3]S. Wu, G. Zhang, “8Kbit/s Low Delay Speech Coding Algorithm with Adaptive Codebook,”IEEE ISECS International Colloquium on Computing, Aug 2009.

[4]S.K. Jung, K.T. Kim, Y.C. Park and H.G. Kang, “A Fast Adaptive-Codebook Search Algorithm for G.723.1 Speech Coder,” IEEE Signal processing letters, vol.12, no.1, pp.75-78, January 2005.

[5]V. Cuperman and R. Pettigrew,“Robust low-complexity backward adaptive pitch predictor for low-delay speech coding,”IEE Proceedings I - Communications, Speech and Vision, 1991, pp.338-344.

[6]E. D. Lee, S. H. Yun, S. I. Lee and J. M. Ahn, Iteration-Free Pulse Replacement Method for Algebraic Codebook Search, Electronics Letters, Vol.43, No.1, 2007, pp.59-60.

[7]E. D. Lee, M. S. Lee and D. Y. Kim, Global Pulse Replacement Method for Fixed Codebook Search of ACELP Speech Codec, Proceedings of the Second IASTED International Conference on Communications, Internet and Information Technology (CIIT 2003), Scottsdale, AZ, November,2003, pp.372-375.

[8]F.K. Chen and J.F. Yang, “Maximum-Take-Precedence ACELP: A Low Complexity Search Method,” IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP2001), vol.2, pp.693-696, May 2001.

[9]Fu-Kun Chen, Jar-Ferr Yang and Yu-Pin Lin, Complexity Scalability for ACELP and MP-MLQ Speech Coders, IEICE Transactions of Information and Systems, Vol.E85-D, No.1, 2002, pp.255-263.

[10]J. Jin, T.Q. Zhang, Y.L. Wan and.L.Deng, “Effective complexity reduction in codebook search for ACELP?,” IEEE Mechatronic Sciences, Dec 2013.

[11]L. Hua, G.F Yan and L.J Hong,“Improvement and Simulation for the ACELP Speech Encoding Algorithm,”Proceedings of the 30th Chinese Control Conference, July 2011.

[12]Mu-Liang Wang and Jar-Ferr Yang, A Generalized Candidate Scheme of Stochastic Codebook Search for Scalable CELP Coders, IEE Proceeding Vision Image and Signal Processing, Vol.151, No.5, 2004, pp.443-452.

[13]Rong-San Lin and Jia-Yu Wang, Efficient Candidate Scheme for Fast Codebook Search in G.723.1, IEICE Transactions on Information and Systems, Vol.E95-D, No.1, 2012, pp.239-246

[14]S. Kim, H. Park, S. Kang and T.R. Fischer,“Fixed codebook design for ACELP coder using algebraic trellis vector codes,”IEEE Signal Processing and Communication Systems (ICSPCS), Oct 2011

[15]Shu-Min Tsai and Jar-Ferr Yang, Efficient Algebraic Code-Excited Linear Predictive Codebook Search, IEE Proceedings -- Vision, Image, and Signal Processing, Vol.153, No.6, 2006, pp.761-768.

[16]Y. Zhao, S. Zhang and X. Li,“Two methods of Design and Implementation of ACELP Vocoder,”IEEE Signal Processing, Aug 2013.

[17]M.R. Schroeder and B.S. Atal, “Code-excited linear prediction (CELP): High quality speech at very low bit rates,” in ICASSP’85, 1985, pp. 937-940.

[18]林裕斌,ITU-T G.729 和G.723.1語音編碼器之快速演算法,國立成功大學電機工程研究所碩士論文,2002。

[19]ITU-T Rec. G.723.1: Dual Rate Speech Coder for Multimedia Communications Transmitting at 5.3 and 6.3 kbps, March 1996.

[20]ITU-T Rec. H.323: Visual Telephone Systems And Equipment for Local Area Networks Which Provide A Non-Guaranteed Quality of Service, November 1996.

[21]ITU-T Rec. H.324: Terminal for Low Bit Rate Multimedia Communication, March 1996.

[22]X.D. Gan, T. Chen, S.M. Si, L. van den Berghe, T. Miki and T. Ohya, “Implementation of Silence Compression Scheme for G.723.1 Speech Coder Using TI TMS320C51 DSP Chip,” IEEE Communications and Signal Processing, Sep 1997.

[23]ITU-T Rec. P.862: Perceptual Evaluation of Speech Quality (PESQ): An Objective Method for End-to-End Speech Quality Assessment of Narrow-Band Telephone Networks And Speech Codecs, Feb 2001.

[24]S.M. Lee, S. Park and Y. Jang, Cost-effective Implementation of ITU-T G.723.1 on A DSP Chip, Proceedings of 1997 IEEE International Symposium on Consumer Electronics, December 1997, pp. 31-34.

第一頁 上一頁 下一頁 最後一頁 top