跳到主要內容

臺灣博碩士論文加值系統

(216.73.216.17) 您好!臺灣時間:2025/09/03 07:53
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

我願授權國圖
: 
twitterline
研究生:施澤聰
研究生(外文):Tse-Tsung Shih
論文名稱:MPEG-4Part.10AVC之工作量特性分析
論文名稱(外文):MPEG-4 Part.10 AVC Workload Characterization
指導教授:楊佳玲楊佳玲引用關係
指導教授(外文):Chia-Lin Yang
學位類別:碩士
校院名稱:國立臺灣大學
系所名稱:資訊工程學研究所
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2004
畢業學年度:92
語文別:英文
論文頁數:50
中文關鍵詞:工作量特性分析影像壓縮
外文關鍵詞:AVCWorkload CharacterizationCABACH.264
相關次數:
  • 被引用被引用:0
  • 點閱點閱:271
  • 評分評分:
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
多媒體應用程式在現代的電腦系統上已成為很重要的工作量(Workload)。最近一代的的影像壓縮標準H.264/AVC 採用了許多的壓縮工具,能夠讓改善壓縮效率以及影像品質,不過卻也增加了很多實作上的複雜度。這些增加的運算量以及對儲存記憶體的需求,對一般性處理器是否能夠及時地播放影片造成很大的挑戰。
從這一份研究,我們可以發現在現代一般性處理器上執行H.264解碼器在效能上的瓶頸,瞭解H.264解碼器的特性可以讓我們調整硬體處理器的架構和軟體程式的實作去求得更好的效能。我們的分析重點放在H.264解碼器原本存在的指令間的平行度,記憶體效能,以及程式控制流程的可預測性。除此之外,我還進一步去研究哪些程式特徵(影像內容,大小,位元比例)及新增加的編碼工具(多畫框區塊參考方式,CABAC)會對超純量架構處理器的效能有直接的影響。在研究的過程中,我是採用以軟體模擬為基礎的方法來分析這個工作量的特性,它可以讓我們可以徹底地探索設計空間以及彈性地去評估各種不同架構上的加強。一些重要的發現包括1) H.264解碼器的確有顯著的指令間的平行度 2) H.264 效能不是受記憶體所限制,因為資料重複使用的單位是方塊大小而且可以留存在資料快取記憶體 3)H.264有很差的分支指令預測來自於多層迴圈以及依內容決定的分支指令,展開迴圈及絕對值指令可以減少很多因預測錯誤而等候的時間。對於程式特徵,影像內容和大小只對快取記憶體有很小的影響,較高的位元比例會增加熵函數解碼的執行時間。新增加的多畫框區塊參考方式(multi-ref frame) 並不會對資料快取記憶體有直接的影響,因為參考到前一張畫框的資料並不能留存在資料快取記憶體供重複使用。CABAC 比CAVLC 更差的程式流程可預測性,這是因為需要對位元串流做位元處理而有二分之一的機會將預測錯誤。
Multimedia applications have become important workloads for modern computer systems. The latest video coding standard H.264/AVC adopts lots of coding tools, which can improve the coding efficiency and visual quality but also add the implementation complexity a lot. The increasing computation and storage requirements pose challenges to achieve real-time video playback on general-purpose processors (GPPs). In this thesis, I study and analyze the performance of a software implementation of H.264/AVC decoder on GPPs. Through this study, we can find out the performance bottleneck of running the H.264 decoder on a modern GPP. Understanding the characteristics of H.264 decoder allows us to tune hardware processor architecture and software program implementation for performance. I analyze three important program characteristics: the intrinsic available ILP, program locality and control flow predictability. Furthermore, I investigate what application features (sequence content, resolution, bitrate) and new added coding tools (multi-ref frames, CABAC) have direct impact on performance. In this study, I adopt the simulation-based approach to perform workload characterization. It allows us to explore the design space thoroughly and evaluate different architectural enhancements. The important findings of this study includes 1) H.264 decoder does present significant instruction level parallelism. 2) H.264 is computation-bound not memory-bound because block-level data reuse can be captured by data cache. 3) H.264 has poor branch predictability due to nested loops and content dependent branch. Loop unrolling and absolute instruction can reduce branch stall time significantly. 4) For application features, video contents with low motion and smaller resolution increase the inter frame prediction opportunity thereby increasing cache miss rates. Higher bitrate increases execution time of entropy coding. New added multi-ref frame does not have direct impact on cache performance since inter-frame reuse cannot be captured in data cache. CABAC has lower control flow predictability than CAVLC due to bit-wise access to bitstream.
List of Tables iv
List of Figures v
1 Introduction 1
2 Related Work 5
2.1 Multimedia Application Characteristics 5
2.2 Video application Characteristic 7
2.3 Media Architectural Enhancements 8
3 H.264/AVC Overview 11
3.1 Motion Compensation 14
3.1.1 Intra Prediction 15
3.1.2 Inter Prediction 16
3.2 Integer Transform 17
3.3 Entropy Coding 18
3.4 Deblocking Filter 19
4 Tools and Methodology 21
5 Workload Characterization 23
5.1 ILP Analysis 23
5.2 Memory System Characteristic 27
5.3 Control Flow Behavior 35
6 Conclusion 46
Bibliography 48
[1]“Draft ITU-T Recommendation H.264 and Draft ISO/IEC 14 496-10 AVC,” in Joint Video Team of ISO/IEC JTC1/SC29/WG11 & ITU-T SG16/Q.6 Doc. JVT-G050, T. Wieg, Ed., Pattaya, Thailand, Mar. 2003.
[2]T. Wiegand, G. J. Sullivan, G. Bjontegaard, and A. Luthra, “Overview of the H.264/AVC Video Coding Standard,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, pp. 560–576, July 2003.
[3]K. Diefendorff and P.K. Dubey, “How multimedia workloads will change processor design,” IEEE Computer Magazine, vol. 30, no. 9, pp. 43-45, Sep. 1997.
[4]J. Fritts, “Architecture and compiler design issues in programmable media processors,” Ph.D. Thesis, Dept. of Electrical Engineering, Princeton University, 2000.
[5]H. Liao and A. Wolfe, “Available parallelism in video applications,” Proc. IEEE/ACM Int. Sym. on Micro-architecture, pp. 321-329, Dec. 1997.
[6]S. Sohoni, Z. Xu, R. Min, and Y. Hu, “A study of memory system performance of multimedia applications,” Proc. ACM Sigmetrics, pp. 206-215, Jun. 2001.
[7]N. Slingerland and A. J. Smith, “Cache performance for multimedia applications,” Proc. ACM Intl. Conf. on Supercomputing, pp. 204-217, Jun. 2001.
[8]V. Lappalainen, Antti Hallapuro, and Timo D.Hamalainen, “Complexity of Optimized H.26L Video Decoder Implementation,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, July 2003.’
[9]Sally A. McKee, Zhen Fang, and Mateo Valero, “An MPEG-4 Performance Study for non-SIMD, General Purpose Architectures,” ISPASS-2003.
[10]Matthew J. HOLLIMAN, Eric Q. LI, and Yen-Kuang CHEN, “MPEG Decoding Workload Characterization,”
[11]Nathan Slingerland and Alan Jay Smith, “Measuring the Performance of Multimedia Instruction Sets,” IEEE Tran. on computers, Vol. 51, NO. 11, Nov 2002
[12]Deepu Talla, Lizy K. John and Doug Burger, “Bottlenecks in Multimedia Processing with SIMD Style Extensions and Architectural Enhancements,” IEEE Trans. on Computers, Vol. 52, NO. 8, Aug 2003
[13]Zhigang Jin, Nelson L. Passos, and Virgil Andronache, “Branch Prediction of Conditional Nested Loops through an Address Queue,” PDCS-2001
[14]D. Burger and T.M. Austin, “The SimpleScalar Tool Set, version 2.0,” Technical Report 1342, Computer Science Dept., Univ. of Wisconsin-Madison, 1997.
[15]http://www.simplescalar.com
[16]Joint Model Version 7.3 (AVC/H.264 Reference software). Available via http://bs.hhi.de/~suehring/tml/
[17]Markus Flierl and Bernd Girod, “Generalized B Pictures and the Draft H.264/AVC Video-Compression Standard” IEEE Trans. Circuits Syst. Video Technol., July 2003.
[18]D. Marpe, H. Schwarz, and T. Wiegand, “Context-adaptive binary arithmetic coding in the H.264/AVC video compression standard,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, pp. 620–636, July 2003.
[19]H. Malvar, A. Hallapuro, M. Karczewicz, and L. Kerofsky, “Low-Complexity transform and quantization in H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, pp. 598–603, July 2003.
[20]P. List, A. Joch, J. Lainema, G. Bjøntegaard, and M. Karczewicz, “Adaptive deblocking filter,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, pp. 614–619, July 2003.
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top
無相關期刊