跳到主要內容

臺灣博碩士論文加值系統

(44.221.73.157) 您好!臺灣時間:2024/06/20 19:03
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

我願授權國圖
: 
twitterline
研究生:林映伶
研究生(外文):Yin-Ling Lin
論文名稱:MPEG-2/4低複雜度先進音訊編碼演算法最佳化及在StrongARM平台上之實現
論文名稱(外文):MPEG-2/4 Low Complexity AAC Encoder Optimization and Implementation on a StrongARM Platform
指導教授:吳炳飛吳炳飛引用關係
指導教授(外文):Bing-Fei Wu
學位類別:碩士
校院名稱:國立交通大學
系所名稱:電機與控制工程系所
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2005
畢業學年度:93
語文別:英文
論文頁數:109
中文關鍵詞:音訊編碼
外文關鍵詞:AACMPEG AudioLow complexity
相關次數:
  • 被引用被引用:0
  • 點閱點閱:307
  • 評分評分:
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
這篇論文提出一套AAC編碼的最佳化演算法以及在AAC編碼系統中加入資料嵌入演算法的應用。最後將這兩套系統實現在一顆206MHz的32位元定點處理器StrongARM SA-1110上。實驗結果顯示,我們所提出的架構在實驗平台上可執行至少一倍速的壓縮。在AAC 編碼最佳化中,我們移除計算量龐大的長短窗轉換,簡化TNS及M/S立體聲編碼的控制流程,數學函式的簡化運算及較快速的量化模組,在MDCT的實現方式上,也採用了以快速演算法。為了彌補定點化過程中所產生的誤差,我們加入了頻寬控制及動態精確度的MDCT運算等。最後,為了進一步增加AAC檔案的功能性,並在AAC編碼系統中加入資料嵌入的應用。
In this thesis, we present an optimized AAC encoding scheme and also proposed a data embedded method integrated into AAC encoding system. Both of them are finally realized on a 32-bit fixed-point processor, StrongARM SA-1110. Experimental result shows that at least 1 encoding speed is achieved. In the AAC encoding algorithm, we propose several approaches including the removal of block switching, fast MDCT, simplified TNS, simplified M/S stereo coding, mathematical function optimization and fast quantization. To compensate the error caused by fixed-point conversion, a bandwidth control and a dynamic data precision MDCT are applied. Finally, a data embedded method is implemented to further increase its utility.
ABSTRACT(CHINESE).........................................i
ABSTRACT(ENGLISH)........................................ii
ACKNOWLEDGEMENTS........................................iii
CONTENTS.................................................iv
LIST OF TABLES..........................................vii
LIST OF FIGURES........................................viii
CHAPTER 1. Introduction..............................................1
1.1 Background...........................................1
1.2 Motivation...........................................1
1.3 Innovation...........................................2
1.4 Content Organization.................................2
CHAPTER 2. Psychoacoustic Model..........................3
2.1 The Absolute Threshold of Hearing....................3
2.2 Critical Band ........................................4
2.3 Masking Effect.......................................6
2.3.1 Simultaneous Masking...............................6
2.3.2 Temporal Masking...................................7
2.4 Psychoacoustic Model.................................7
CHAPTER 3. MPEG-2 AAC Algorithm.........................10
3.1 Overview............................................10
3.2 Filter Bank and Block Switching.....................14
3.2.1 MDCT..............................................15
3.2.2 Window Shape......................................16
3.2.3 Block Switching...................................18
3.3 Temporal Noise Shaping..............................21
3.3.1 Pre-echo Phenomenon...............................21
3.3.2 TNS Processing....................................23
3.4 M/S Stereo Coding...................................24
3.4.1 Binaural Masking Level Difference.................25
3.4.2 M/S Stereo Threshold..............................26
3.4.3 L/R and M/S Switching.............................28
3.5 Intensity Stereo Coding.............................28
3.6 Prediction..........................................29
3.6.1 Predictor Structure...............................30
3.6.2 Predictor Control.................................32
3.7 Quantization........................................33
3.7.1 Nonuniform Quantization Function..................33
3.7.2 Scalefactor Band..................................34
3.7.3 Iteration Process.................................34
3.8 Noiseless Coding....................................40
3.8.1 Grouping and Interleaving.........................40
3.8.2 Spectral Clipping.................................41
3.8.3 Huffman Coding....................................42
3.8.4 Sectioning........................................43
3.9 Gain Control........................................44
3.9.1 Polyphase Quadrature Filter.......................44
3.9.2 Gain Detector.....................................44
3.9.3 Gain Modifier.....................................45
3.10 Bitstream Format................................45
CHAPTER 4. MPEG-2/4 LC AAC Encoder Optimization.........48
4.1 Complexity Analysis.................................48
4.2 Removal of Block Switching..........................49
4.3 Fast MDCT.....................................................49
4.4 Simplified TNS......................................51
4.5 Simplified M/S Stereo Coding........................55
4.6 Quantization Optimization...........................58
4.6.1 Scalefactor Prediction............................60
4.6.2 Simplified QuantizeBand().........................61
4.7 Math Function...................................62
4.7.1 TNS...............................................62
4.7.2 Quantization....................................63
CHAPTER 5. Implementation on a StrongARM Processor......69
5.1 Implementation Flow.............................69
5.2 Fixed-point C Code Implementation...................70
5.3 Modify Coding Style.................................73
CHAPTER 6. Implementation of Data Embedded Method.......74
6.1 The Properties of Data Embedded Method..............74
6.2 Implementation of Data Embedded Encoder.............76
6.2.1 Embedding Data into High Frequency Range..........77
6.3 Implementation of Data Embedded Decoder.........79
CHAPTER 7. Experimental Results.........................81
7.1 MPEG-2/4 LC AAC Encoder.........................81
7.1.1 Resource Distribution.............................81
7.1.2 Resource Requirement Improvement..................82
7.1.3 Encoding Speed....................................83
7.1.4 Quality Evaluation................................84
7.2 Data Embedded Method............................85
7.2.1 Resource Distribution.............................85
7.2.2 Encoding Speed and File Size......................85
7.2.3 Embedded Data Size................................86
7.2.4 Quality Evaluation................................87
CHAPTER 8. Conclusions and Future Works..................89
8.1 Conclusions.....................................89
8.2 Future Works....................................90
REFERENCE ................................................92
APPENDIX A. Advantech PCM-7130 SBC......................97
APPENDIX B. Data Embedded Codec........................100
B.1 Package File.......................................100
B.2 Data Stream Analyzer...............................103
B.3 Lyrics Analyzer....................................104
Appendix C. EAQUAL.....................................108
[1]. K. Brandenburg, M. Bosi, S. Quackenbush, L. Fielder, K. Akagiri, H. Fuchs, M. Dietz, J. Herre, G. Davidson and Y. Oikawa, “ISO/IEC MPEG - 2 Advanced Audio Coding”, J. Audio Eng. Soc., October 1997, pp. 789 – 811.
[2]. ISO/IEC 13818 – 7, “Information Technology – Generic Coding of Moving Pictures and Associated Audio, Part 7: Advanced Audio Coding,” 1997.
[3]. N. Jayant and P. Noll, “Digital Coding of waveforms”, Prentice-Hall, Englewood Cliffs, NJ, 1984.
[4]. J. Herre and J. D. Johnston, “Enhancing the Performance of Perceptual Audio Coders by Using Temporal Noise Shaping (TNS),” 101st AES convention, Preprint 4384.
[5]. MPEG Audio FAQ [online]
URL: http://www.tnt.uni-hannover.de/project/mpeg/audio/faq/
[6]. J. D. Johnston and A. J. Ferreira, “Sum-Difference Stereo Transform Coding,” Proc. IEEE ICASSP, 1992, pp. 569 – 572.
[7]. T. T. Sandel, D. C. Teas, W. E. Feddersen and Jeffress, “ Localization of Sound From Single and Paired Sources,” J. Audio Eng. Soc. Am. 27, 1955, pp.842 – 852.
[8]. B. C. J. Moore, “An Introduction to the Psychology of Hearing, ” 3rd ed., Academic Press, NY, 1989.
[9]. M. L. Honig, and D. G. Messerschmitt , “Adaptive Filters: Structures, Algorithms, and Applications,”Kluwer Academic, 1984.
[10]. C. F. N. Cowan, P. M. Grant and P. F. Adams, “ Adaptive Filters, ”Prentice-Hall, Englewood Cliffs, 1985.
[11]. H. Fuchs, “Improving MPEG Audio Coding by Backward Adaptive Linear Stereo Prediction,” 99st AES convention, Preprint 4086.
[12]. S. R. Quackenbush and J. D. Johnston, “Noiseless Coding of Quantized Spectral Components in MPEG-2 Advanced Audio Coding,” IEEE ASSP, 1997, pp. 1 – 4.
[13]. R.G. v. d. Waal and R. N. J. Veldhuis, “Subband Coding of Stereophonic Digital Audio Signals,” IEEE ICASSP, 1991, pp. 3601 – 3604.
[14]. J. Herre, K. Brandenburg , and D. Lederer, “Intensity Stereo Coding,” 96st AES convention, Preprint 3799.
[15]. FAAC – Freeware Advanced Audio Coder [online]
URL: http://www.audiocoding.com
The proposed source code is modified based on FAAC’s implementation.
[16]. E. Zwicher and H. Fastl, “Psychoacoustics: Facts and Models,” Springer-Verlag, 1990.
[17]. T. Painter and A. Spanias, “A Review of Algorithms for Perceptual Coding of Digital Audio Signals,” DSP ’97 Conference, 1997, pp. 179 – 209.
[18]. E. Terhardt, “Calculating Virtual Pitch,” Hearing Research, pp. 155-182, 1979.
[19]. Multimedia and Streaming [online]
URL: http://www.liacs.nl/~joostd/WebTech/Day6/slides/multimedia.html
[20]. R. Gluth, “Regular FFT-Related Transform Kernels for DCT/DST-based polyphase filter banks,” IEEE ICASSP 1991, vol.3, pp. 2205 – 2208.
[21]. E. Kurniawati, C. T. Lau, B. Premkumar, J. Absar and S. George, ”New Implementation of Techniques of an Efficient MPEG Advanced Audio Coder,” IEEE Transactions on Consumer Electronics, Vol. 50, No. 2, MAY 2004, pp. 655 – 665.
[22]. H. oh, J. Kim, C. Song, Y. Park and D. Youn, “Low Power MPEG/Audio Encoders Using Simplified Psychoacoustic Model and Fast Bit Allocation,” IEEE Transactions on Consumer Electronics, Vol. 47, No. 3, August 2001, pp. 613 – 621.
[23]. SQAM – Sound Quality Assessment Material: EBU SQAM disc tracks.
URL: http://www.tnt.uni-hannover.de/project/mpeg/audio/sqam/
[24]. Test Audio Sample Description –
Elliott: Artist/Elliott Smith, Album/From a Basement on the Hill, Title/A Fond Farewell, Label/Anti.
Jeff: Artist/Jeff Buckley, Album/Grace, Title/So Real, Label/Columbia.
Radio:Artist/Radiohead, Album/The Bends, Title/High and Dry, Label/Parlophone.
Devic: Artist/The Devics, Album/The Stars at Saint Andrea, Title/Red Morning, Label/Bellaire
Sandee: Artist/Sandee Chan, Album/When We All Wept in Silence, Title/Track 03, Label/Music 543.
Always: Artist/Bon Jovi, Album/Cross Road, Title/Always, Label/Mercury.
Thank: Artist/Dido, Album/No Angel, Title/Thank You, Label/Arista.
Torn:Artist/Natalie Imbruglia, Album/Left of the Middle, Title/Torn, Label/RCA.
[25]. S. Cramer and R. Gluth, “Computationally Efficient Real-Valued Filter Banks Based on a Modified O2DFT,” Signal Processing V, Elsevier Sc. Publ., Proc. EUSIPCO 90, Barcelona, 1990.
[26]. G. Bonnerot and M. Bellanger, “Odd-Time Odd-Frequency Discrete Fourier Transform for Symmetric Real-Valued Series,” IEEE Proceedings, March 1976, pp. 392 – 393.
[27]. ISO/IEC 11172-3, “Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s, Part 3: Audio,” 1992.
[28]. Advanced RISC Machines Ltd. [online]
URL: http://www.arm.com/
[29]. Advanced RISC Machines Ltd., “Application Note 34: Writing Efficient C for ARM,” 1998.
[30]. Intel Corporation [online]
URL: http://www.intel.com/
[31]. Y. S. Lin, “MPEG-1 Layer III Audio Codec Optimization and Implementation on a DSP Chip,” Master thesis submitted to department of Electrical and Control Engineering, National Chiao Tung University, July 2004.
[32]. R. H. Huang, “A Study of Data Embedded Method on MPEG/Audio and Implementation of Data Embedded Decoder on the ADSP-2181 DSP Processor,” Master thesis submitted to department of Electrical and Control Engineering, National Chiao Tung University, July 2004.
[33]. Apple Computer, Inc. [online]
URL: http://www.apple.com/
[34]. Fraunhofer Institute [online]
URL: http://www.iis.fraunhofer.de/
[35]. Advantech PCM-7130 User Manual
[36]. Advantech PCM-7130 Data Sheet
[37]. Advantech Co., Ltd [online]
URL: http://www.advantech.com/
[38]. Thilo Thiede, William C. Treurniet, Roland Bitto, Christian Schmidmer, Thomas Sporer, John G. Beerends, Catherine Colomes, Michael Keyhl, Gerhard Stoll, Karlheinz Brandenburg and Bernhard Feiten, “PEAQ – The ITU Standard for Objective Measurement of Perceived Audio Quality,” J. Audio Eng. Soc. Vol. 48, No.1/2, Jan/Feb 2000. pp. 3 – 29.
[39]. Proseminar Redundanz, Fehlertoleranz und Kompression [online]
URL: http://goethe.ira.uka.de/seminare/rftk/mp3/
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top