跳到主要內容

臺灣博碩士論文加值系統

(44.201.92.114) 您好!臺灣時間:2023/03/28 04:48
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

我願授權國圖
: 
twitterline
研究生:侯思瑋
研究生(外文):Szu wei Hou
論文名稱:MPEG-4音訊切片算術編碼之效能分析與改進
論文名稱(外文):Performance Analysis and Improvement on MPEG-4 Bit-Sliced Arithmetic Coding for Audio
指導教授:杭學鳴杭學鳴引用關係
指導教授(外文):Hsueh-Ming Hang
學位類別:碩士
校院名稱:國立交通大學
系所名稱:電子工程系
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2003
畢業學年度:91
語文別:英文
論文頁數:74
中文關鍵詞:音訊算術編碼切片可調式
外文關鍵詞:BSACBit-Sliecd Arithmetic CodingMPEG-4
相關次數:
  • 被引用被引用:0
  • 點閱點閱:240
  • 評分評分:
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
MPEG-4是由ISO/IEC MPEG所制訂的一套很有效率的多媒體壓縮編碼標準。MPEG-4 第二版的音訊壓縮標準提供了一些新的工具來擴充其功能。其中一個工具稱為切片式算數編碼(Bit-Sliced Arithmetic Coding, BSAC)工具,這個工具提供了編碼率精細可調式的音訊編碼功能,每個可調間距大約為1 kbits/s/ch。這功能對一些頻寬容易變動的通訊系統,例如網際網路或行動通訊來說,是非常有用的。
在本篇論文當中,我們首先研究切片式算數編碼的音質效能及其對於傳輸錯誤的敏感度。接著,我們提出兩種方法試圖改善切片式算數編碼的編碼效率。比較切片式算數編碼和進階音訊編碼(Advanced Audio Coding, AAC)的音質效能之後,我們對實驗結果進行分析,並提出造成兩者效能差異的可能原因。因為算數編碼是一種對傳輸錯誤很敏感的編碼方式,所以我們也研究了切片式算數編碼中的錯誤傳遞問題。
在改善編碼效率方面,我們研究了在切片式算數編碼過程中中會用到的機率模型。我們也設計並測試經由實際聲音訊號產生的機率模型。另一個改善編碼效能的方法是改變每個可調層分配到的位元數。主要觀念在於分配更多的位元數給較低頻的可調層。這個方法將可以看到比較明顯的效能改善。

The MPEG-4 standard defined by ISO/IEC MPEG is a very efficient coding standard for multimedia data. MPEG-4 version 2 provides several new tools for audio coding. One of them is the so-called BSAC (Bit-Sliced Arithmetic Coding) tool, which provides scalable coding with fine granularity. The scalability step is about 1 kbits/s/ch, which would be useful for communication systems with fluctuating bandwidth, such as Internet and mobile communication.
In this thesis, we first investigate the quality and transmission error sensitivity of BSAC. Then, we propose two modifications in attempt to improve the coding efficiency. The coded audio quality of BSAC is compared with that of AAC (Advanced Audio Coding). And then we analyze the experimental results and identify the cause of its performance loss. It is well-known that arithmetic coding is sensitive to transmission errors. We thus also investigate the error-propagation problem of BSAC.
To improve the coding efficiency, we study the probability models used in the arithmetic coding process of BSAC. Probability models trained from real audio data have been designed and tested although they do not offer significant improvement. Another attempt is changing the bits allocated to each layer. The idea is that the lower frequency bands should receive more bits in coding. The coding gain using this strategy turns out to be much more significant.

中文摘要 i
Abstract ii
致 謝 iii
List of Figures iii
List of Tables v
Chapter 1 Introduction 1
Chapter 2 MPEG-2/4 Advanced Audio Coding and Scalable Audio 3
2.1 MPEG-2 AAC 3
2.1.1 Tools description of MPEG-2 AAC 5
2.1.1.1 Gain Control Tool 5
2.1.1.2 Filterbank Tool 5
2.1.1.3 Temporal Noise Shaping (TNS) Tool 6
2.1.1.4 Intensity/Coupling Tool 7
2.1.1.5 Prediction Tool 8
2.1.1.6 Middle/Side (M/S) Tool 8
2.1.1.7 Scalefactor Tool 8
2.1.1.8 Quantization Tool 9
2.1.1.9 Noiseless Coding Tool 9
2.2 MPEG-4 AAC 9
2.2.1 Long Term Prediction (LTP) Tool 9
2.2.2 Perceptual Noise Substitution (PNS) tool 11
2.2.3 Transform-domain Weighted Interlaced Vector Quantization (TwinVQ) tool 12
2.3 MPEG Scalable Audio Coding 13
Chapter 3 Bit-Sliced Arithmetic Coding (BSAC) 16
3.1 Introduction 16
3.2 Architecture and algorithm of audio coding using BSAC 17
3.2.1 Architecture of BSAC 17
3.2.2 BSAC Operations 18
3.2.2.1 Bit Slicing of the quantized spectral coefficients 19
3.2.2.2 Probability Determination 19
3.2.2.3 Grouping and Interleaving 24
3.2.2.4 Coding the Scalefactors 25
3.2.2.5 Layer Segmentation 26
3.2.2.6 Arithmetic Coding 27
3.2.2.7 Overall Encoding Procedure 27
Chapter 4 Performance Analysis of BSAC 30
4.1 Audio Quality Measurement 30
4.2 Quality Analysis of BSAC 31
4.3 Performance Attenuation due to Error 34
Chapter 5 Performance Improvement of BSAC 38
5.1 Arithmetic Models Selection 38
5.1.1 Introduction 38
5.1.2 Models selection outside the rate-distortion loop 39
5.1.3 Experimental Results 40
5.1.4 Models selection inside the rate-distortion loop 41
5.1.5 Experiment Result 41
5.2 Create New Arithmetic Models 45
5.2.1 Motivation 45
5.2.2 Method 45
5.2.3 Experimental Results 46
5.3 Bit Allocation to each layer 50
5.3.1 Motivation 50
5.3.2 Proposed Algorithm 51
5.3.3 Experimental Results 55
Chapter 6 Conclusions and Future Work 63
Bibliography 65

[1] Heiko Purnhagen, “An Overview of MPEG-4 Version 2 Audio”, AES 17th International Conference on High Quality Audio Coding, Sep, 1999.
[2] ISO/IEC JTC/SC29/WG11 MPEG, International Standard ISO/IEC 13818-7 “Advanced Audio Coding”, 1997
[3] Ted Painter and Andreas Spainias, “Perceptual Coding of Digital Audio”, Proceedings of the IEEE, Vol. 88, No. 4, April 2000
[4] ISO/IEC JTC/SC29/WG11 MPEG, International Standard ISO/IEC 14496-3 “Coding of Audio-Visual Objects: Audio”,1999
[5] ISO/IEC JTC/SC29/WG11 MPEG, International Standard 14496-3 Amd1 “Coding of Audio-Visual Objects: Audio”,2000
[6] S. H. Park et al., “Multi-Layer Bit-Sliced Bit Rate Scalable MPEG-4 Audio Coder”, presented at the 103th Convention of the AES, New York, Sep. 1997(preprint 4520)
[7] Don-Hyung Kim, Jung-Hoe Kim, and Sang Wook Kim, “Scalable Lossless Audio Coding Based on MPEG-4 BSAC”, presented at the 113th Convention of the AES, Los Angels, Oct. 2002(preprint 5679)
[8] ISO/IEC JTC/SC29/WG11 MPEG, Final Draft International Standard ISO/IEC 14496-5, 2001
[9] http://www.scalatech.co.uk/technology.htm
[10] Charles D. Creusere, “An Analysis of Perceptual Artifacts in MPEG Scalable Audio Coding”, Proceedings of the Data Compression Conference, IEEE, 2002
[11] John G. Beerends, Jan A. Stemerdink, “A Perceptual Audio Quality Measurement Based on a Psychoacoustic Sound Representation”, AES Vol. 40 No.12
[12] A. Lerchs, “EAQUAL software”, Version 0.1.3 alpha, http://www.mp3-tech.org/
[13] Dai Yang, Hongmei Ai, Chris Kyriakakis and C. C. Jay Kuo, “Embedded High-Quality Multichannel Audio Coding”, Conference on Media Processors, Part of the Symposium on Electronic Imaging 2001, San Jose, CA, January 2001.
[14] Dai. Yang, Hongmei. Ai, C. Kyriakakis, and C. C. Jay Kuo “Design of progressive syntax-rich multichannel audio codec”, Proc. SPIE, pages 121-132, San Jose, CA, January 2002.

QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top
無相關論文