臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.17) 您好！臺灣時間：2025/09/03 07:53

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
紙本論文
QR Code

本論文永久網址:

研究生:

施澤聰

研究生(外文):

Tse-Tsung Shih

論文名稱:

MPEG-4Part.10AVC之工作量特性分析

論文名稱(外文):

MPEG-4 Part.10 AVC Workload Characterization

指導教授:

楊佳玲

指導教授(外文):

Chia-Lin Yang

學位類別:

碩士

校院名稱:

國立臺灣大學

系所名稱:

資訊工程學研究所

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2004

畢業學年度:

語文別:

英文

論文頁數:

中文關鍵詞:

工作量特性分析、影像壓縮

外文關鍵詞:

AVC、Workload Characterization、CABAC、H.264

相關次數:

被引用:0
點閱:271
評分:
下載:0
書目收藏:0

多媒體應用程式在現代的電腦系統上已成為很重要的工作量(Workload)。最近一代的的影像壓縮標準H.264/AVC 採用了許多的壓縮工具，能夠讓改善壓縮效率以及影像品質，不過卻也增加了很多實作上的複雜度。這些增加的運算量以及對儲存記憶體的需求，對一般性處理器是否能夠及時地播放影片造成很大的挑戰。
從這一份研究，我們可以發現在現代一般性處理器上執行H.264解碼器在效能上的瓶頸，瞭解H.264解碼器的特性可以讓我們調整硬體處理器的架構和軟體程式的實作去求得更好的效能。我們的分析重點放在H.264解碼器原本存在的指令間的平行度，記憶體效能，以及程式控制流程的可預測性。除此之外，我還進一步去研究哪些程式特徵(影像內容，大小，位元比例)及新增加的編碼工具(多畫框區塊參考方式，CABAC)會對超純量架構處理器的效能有直接的影響。在研究的過程中，我是採用以軟體模擬為基礎的方法來分析這個工作量的特性，它可以讓我們可以徹底地探索設計空間以及彈性地去評估各種不同架構上的加強。一些重要的發現包括1) H.264解碼器的確有顯著的指令間的平行度 2) H.264 效能不是受記憶體所限制，因為資料重複使用的單位是方塊大小而且可以留存在資料快取記憶體 3)H.264有很差的分支指令預測來自於多層迴圈以及依內容決定的分支指令，展開迴圈及絕對值指令可以減少很多因預測錯誤而等候的時間。對於程式特徵，影像內容和大小只對快取記憶體有很小的影響，較高的位元比例會增加熵函數解碼的執行時間。新增加的多畫框區塊參考方式(multi-ref frame) 並不會對資料快取記憶體有直接的影響，因為參考到前一張畫框的資料並不能留存在資料快取記憶體供重複使用。CABAC 比CAVLC 更差的程式流程可預測性，這是因為需要對位元串流做位元處理而有二分之一的機會將預測錯誤。

Multimedia applications have become important workloads for modern computer systems. The latest video coding standard H.264/AVC adopts lots of coding tools, which can improve the coding efficiency and visual quality but also add the implementation complexity a lot. The increasing computation and storage requirements pose challenges to achieve real-time video playback on general-purpose processors (GPPs). In this thesis, I study and analyze the performance of a software implementation of H.264/AVC decoder on GPPs. Through this study, we can find out the performance bottleneck of running the H.264 decoder on a modern GPP. Understanding the characteristics of H.264 decoder allows us to tune hardware processor architecture and software program implementation for performance. I analyze three important program characteristics: the intrinsic available ILP, program locality and control flow predictability. Furthermore, I investigate what application features (sequence content, resolution, bitrate) and new added coding tools (multi-ref frames, CABAC) have direct impact on performance. In this study, I adopt the simulation-based approach to perform workload characterization. It allows us to explore the design space thoroughly and evaluate different architectural enhancements. The important findings of this study includes 1) H.264 decoder does present significant instruction level parallelism. 2) H.264 is computation-bound not memory-bound because block-level data reuse can be captured by data cache. 3) H.264 has poor branch predictability due to nested loops and content dependent branch. Loop unrolling and absolute instruction can reduce branch stall time significantly. 4) For application features, video contents with low motion and smaller resolution increase the inter frame prediction opportunity thereby increasing cache miss rates. Higher bitrate increases execution time of entropy coding. New added multi-ref frame does not have direct impact on cache performance since inter-frame reuse cannot be captured in data cache. CABAC has lower control flow predictability than CAVLC due to bit-wise access to bitstream.

List of Tables iv
List of Figures v
1 Introduction 1
2 Related Work 5
2.1 Multimedia Application Characteristics 5
2.2 Video application Characteristic 7
2.3 Media Architectural Enhancements 8
3 H.264/AVC Overview 11
3.1 Motion Compensation 14
3.1.1 Intra Prediction 15
3.1.2 Inter Prediction 16
3.2 Integer Transform 17
3.3 Entropy Coding 18
3.4 Deblocking Filter 19
4 Tools and Methodology 21
5 Workload Characterization 23
5.1 ILP Analysis 23
5.2 Memory System Characteristic 27
5.3 Control Flow Behavior 35
6 Conclusion 46
Bibliography 48

[1]“Draft ITU-T Recommendation H.264 and Draft ISO/IEC 14 496-10 AVC,” in Joint Video Team of ISO/IEC JTC1/SC29/WG11 & ITU-T SG16/Q.6 Doc. JVT-G050, T. Wieg, Ed., Pattaya, Thailand, Mar. 2003.
[2]T. Wiegand, G. J. Sullivan, G. Bjontegaard, and A. Luthra, “Overview of the H.264/AVC Video Coding Standard,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, pp. 560–576, July 2003.
[3]K. Diefendorff and P.K. Dubey, “How multimedia workloads will change processor design,” IEEE Computer Magazine, vol. 30, no. 9, pp. 43-45, Sep. 1997.
[4]J. Fritts, “Architecture and compiler design issues in programmable media processors,” Ph.D. Thesis, Dept. of Electrical Engineering, Princeton University, 2000.
[5]H. Liao and A. Wolfe, “Available parallelism in video applications,” Proc. IEEE/ACM Int. Sym. on Micro-architecture, pp. 321-329, Dec. 1997.
[6]S. Sohoni, Z. Xu, R. Min, and Y. Hu, “A study of memory system performance of multimedia applications,” Proc. ACM Sigmetrics, pp. 206-215, Jun. 2001.
[7]N. Slingerland and A. J. Smith, “Cache performance for multimedia applications,” Proc. ACM Intl. Conf. on Supercomputing, pp. 204-217, Jun. 2001.
[8]V. Lappalainen, Antti Hallapuro, and Timo D.Hamalainen, “Complexity of Optimized H.26L Video Decoder Implementation,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, July 2003.’
[9]Sally A. McKee, Zhen Fang, and Mateo Valero, “An MPEG-4 Performance Study for non-SIMD, General Purpose Architectures,” ISPASS-2003.
[10]Matthew J. HOLLIMAN, Eric Q. LI, and Yen-Kuang CHEN, “MPEG Decoding Workload Characterization,”
[11]Nathan Slingerland and Alan Jay Smith, “Measuring the Performance of Multimedia Instruction Sets,” IEEE Tran. on computers, Vol. 51, NO. 11, Nov 2002
[12]Deepu Talla, Lizy K. John and Doug Burger, “Bottlenecks in Multimedia Processing with SIMD Style Extensions and Architectural Enhancements,” IEEE Trans. on Computers, Vol. 52, NO. 8, Aug 2003
[13]Zhigang Jin, Nelson L. Passos, and Virgil Andronache, “Branch Prediction of Conditional Nested Loops through an Address Queue,” PDCS-2001
[14]D. Burger and T.M. Austin, “The SimpleScalar Tool Set, version 2.0,” Technical Report 1342, Computer Science Dept., Univ. of Wisconsin-Madison, 1997.
[15]http://www.simplescalar.com
[16]Joint Model Version 7.3 (AVC/H.264 Reference software). Available via http://bs.hhi.de/~suehring/tml/
[17]Markus Flierl and Bernd Girod, “Generalized B Pictures and the Draft H.264/AVC Video-Compression Standard” IEEE Trans. Circuits Syst. Video Technol., July 2003.
[18]D. Marpe, H. Schwarz, and T. Wiegand, “Context-adaptive binary arithmetic coding in the H.264/AVC video compression standard,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, pp. 620–636, July 2003.
[19]H. Malvar, A. Hallapuro, M. Karczewicz, and L. Kerofsky, “Low-Complexity transform and quantization in H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, pp. 598–603, July 2003.
[20]P. List, A. Joch, J. Lainema, G. Bjøntegaard, and M. Karczewicz, “Adaptive deblocking filter,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, pp. 614–619, July 2003.

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	針對H.264/AVC去方塊濾波器及框內編碼之演算法和架構設計
2.	MPEG-4AVC/H.264之適應性二元算術編碼及運動補償架構設計
3.	H.264視訊編碼中P畫面的快速決定編碼模式演算法
4.	視訊場景轉換之偵測演算法與H.264畫面內編碼模式之研究
5.	具有低運算量與壓縮率的H.264/AVC數位視訊加密之研究
6.	適用於H.264/AVC熵解碼器和反離散餘弦轉換之可重組化架構設計
7.	適用於H.264之內文可調適性算術編碼器架構設計與實現
8.	H.264之低功率適應性二進制算術編碼硬體電路設計
9.	適用於H.264/AVC之高速CABAC殘值解碼器架構
10.	可變區塊大小移動估測之多模終止技術
11.	應用於H.264/AVC視訊標準之去區塊瀘波器的設計
12.	適用於H.264/AVC分數像素移動估測之快速演算法與設計
13.	用於H.264視訊解碼器之記憶體控制器與熵解碼器之設計
14.	應用於H.264/AVC1080HD的高產量背景適應性二元算術編解碼器
15.	H.264CABAC編碼器之全硬體設計

無相關期刊

1.	從癌症病人及家屬觀點探討病情告知之經驗
2.	二維水深平均束縮渠道流場之解析
3.	正常顳顎關節之開閉口運動分析
4.	扇形渦狀相位調變式超音波換能器結合機械掃描系統於超音波熱手術之探討
5.	層狀寬頻表面聲波濾波器在矽基二維聲子晶體頻溝量測之應用
6.	接觸時間與拉離加載率對結合的抗體與抗原之間解離力的影響研究
7.	飼料中添加不同類胡蘿蔔素對赤鰭笛鯛體色之影響
8.	未校正立體像機之自動化影像對正技術
9.	量子點雷射受激態及溫度效應之模擬與分析
10.	台灣農業部門勞動力流動之研究
11.	個人資料使用授權之設計
12.	以MobileIP為基礎之動態路由設計
13.	省能源高效率無線感測器網路:以跨層次設計方法之探討
14.	以MPEG-4為核心之多媒體群體合作應用架構
15.	過渡段軌道之動態反應

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室