跳到主要內容

臺灣博碩士論文加值系統

(44.220.255.141) 您好!臺灣時間:2024/11/14 05:11
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

: 
twitterline
研究生:許凱翔
研究生(外文):Kai-Hsiang Hsu
論文名稱:利用逐次增加複雜度準則之快速階層H.264編碼模式決定機制
論文名稱(外文):Fast Hierarchical Coding Mode Decision Using Progressive Increasing Complexity Criterions for H.264
指導教授:章定遠
指導教授(外文):Din-Yuen Chan
學位類別:碩士
校院名稱:國立嘉義大學
系所名稱:資訊工程學系研究所
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2004
畢業學年度:94
語文別:英文
論文頁數:64
中文關鍵詞:H.264模式決策動作估計動作向量
外文關鍵詞:H.264Mode decisionMotion estimationMotion vector
相關次數:
  • 被引用被引用:0
  • 點閱點閱:203
  • 評分評分:
  • 下載下載:4
  • 收藏至我的研究室書目清單書目收藏:0
比起先前所有的編碼標準,H.264提供了許多的機制作重大改進。其中一個改良機制,就是模式決策機制(mode decision)。模式決策機制提供了不同的區塊大小(16×16, 16×8, 8×16, 8×8, 8×4, 4×8, 4×4),來適用於不同的影像特性。可以在節省位元數的情形下得到較佳的影像品質。相對的模式決策機制也造成了編碼時複雜度大量上升。所以本研究的目標就是在維持壓縮品質下,有效減少模式決策複雜度。首先,利用16×16區塊之動作估計(Motion Estimation)時所得之SAD(Sum of absolute difference)值作判斷。若SAD之差值大於定義的臨界值,就將此模式視為決定模式。否則,利用Rate-SADdiff model之對應區域來決定是否對較小的區塊模式作判斷。若需要對8×8區塊模式作判斷,則先對8×8區塊做動作估計。再利用8×8區塊所得的SAD值決定候選模式。最後利用Lagrangian cost在候選模式中決定出最佳的模式。
本研究利用三個由簡而繁之模式判斷法則,讓較複雜之判斷法則的使用機率降低,而不明顯影響 PSNR之表現。
Comparing to the previous coding standard, H.264 provides many schemes for improving the encoding result. One of the improving scheme is mode decision. Mode decision provides different block sizes (16×16, 16×8, 8×16, 8×8, 8×4, 4×8, 4×4) to suit different video property. By the way, H.264 standard can save coding bits and gains a better video quality at the same time. On the contrary, the mode decision scheme makes coding complexity increase by times. The goal of this study is to maintain the compression quality and efficiently reduce the mode decision complexity.
First, exploiting the SAD (Sum of absolute difference) obtained from 16×16 block size ME (Motion estimation) to determine. If the SAD difference greater than defined threshold, such mode is viewed as decision mode. Otherwise, exploit the uncertainty region of Rate-SADdiff model to determine whether the smaller block size mode should be considered. If it is need to consider 8×8 block size, then 8×8 block size ME is done before. And exploit the SAD obtained from 8×8 block size ME to determine the candidate mode. Finally, use Lagrangian cost function to choose the best mode from the candidate modes.
In the study, we use three mode decision criteria to go from the simple to the complex let the complex determine criteria with the lower probability without influence PSNR.
摘要 i
Abstract i
CONTENTS iii
List of Figures v
List of Tables vii
Chapter 1 Introduction 1
1.1 Digital Video Compression Standards 1
1.2 Motivation 5
1.3 Thesis Organization 6
Chapter 2 Overview of H.264 Mode Decision 7
2.1 Mode Decision of H.264 7
2.2 Inter Prediction 8
2.1.1 Fractional Pixel Search 11
2.1.2 Quarter Luminance Sample Interpolation 12
2.1.3 Motion Vector Prediction 14
2.1.3.1 Median Prediction 14
2.1.3.2 Directional Segmentation Prediction 16
2.3 Intra Prediction 17
2.3.1 4×4 Luminance Prediction Modes 18
2.3.2 16×16 Luminance Prediction Modes 20
2.3.3 8×8 Chrominance Prediction Modes 20
2.4 Rate Distortion Optimization Schemes 21
2.4.1 Lagrangian Cost 22
2.4.2 Determining and 23
2.4.3 SAD, Hadamard Transform, SATD, SA(T)D 23
CHAPTER 3 A Hierarchical Benefit-Balancing Coding Mode Decision Mechanism for H.264 26
3.1 Introduction 26
3.2 Hierarchical Strategy Utilize The Decision Criterions 31
3.3 Uncertainty Range Decision 34
3.4 Depiction of processes for HB-CDM tree nodes 37
Chapter4 Experiment Results 41
4.1 Parameter Setting 41
4.2 Experiment Results 42
Chapter 5 Conclusions 50
5.1 Conclusions 50
5.2 Future Works 51
References 52
[1]J. Ostermann, J. Bormans, P. List, D. Marpe, M. Narroschke, F. Pereira, T. Stockhammer, and T. Wedi “Video coding with H.264/AVC:Tools, Performance, and Complexity,” IEEE Circuits and Systems Magazine, pp.7-28, 2004.
[2]T. Wiegand, G. J. Sullivan, G. Bjøntegaard, and A. Luthra, “Overview of the H.264/AVC Video Coding Standard, ” IEEE Trans. on Circuits and Systems for Video Technology, vol. 13, no.7, pp.560-576, July. 2003.
[3]H. S. Malvar, A. Hallapuro, M. Karczewicz, and L. Kerofsky “Low-Complexity Transform and Quantization in H.264/AVC, ” IEEE Trans. on Circuits and Systems for Video Technology, vol. 13, no.7, pp.598-603, July. 2003.
[4]I. Rhee, G. R. Martin, S. Muthukrishnan, and R. A. Packwood, “Quadtree-Structured Variable-Size Block-Matching Motion Estimation with Minimal Error,” IEEE Trans. on Circuits and Systems for Video Technology, vol. 10, no.1, pp.42-50, February. 2000.
[5]J. F. Yang, S. H. Chang, and C. Y. Chen, “Computation Reduction for Motion Search in Low Rate Video Coders,” IEEE Trans. on Circuits and Systems for Video Technology, vol. 12, no.10, pp.948-951, October. 2002.
[6]T. Wiegand and B. Girod, “Lagrange Multiplier Selection in Hybrid Video Coder Control,” IEEE Image Processing International Conference, vol.2, pp.837-840, September. 2003.
[7]H. kim, N. Kamaci, and Y. Altunbasak, “Low-Complexity Rate-Distortion Optimal Macroblock Mode Selection and Motion Estimation for MPEG-Like Video Coders,” IEEE Trans. on Circuits and Systems for Video Technology, vol. 3, pp.542-545, October. 2001.
[8]A. M. Peacock, D. Renshaw, and J. Hannah, “Motion direction estimates from differenced DCT images,” Electronics Letters, vol. 37, no.3, pp.163-164, Febuary. 2001.
[9]H. S. Malvar, A. Hallapuro, M. Karczewicz, and L. Kerofsky, “Low-Complexity Transform and Quantization in H.264/AVC,” IEEE Trans. on Circuits and Systems for Video Technology, vol. 13, no.7, pp.598-603, July. 2003.
[10]F. Pan, X. Lin, S. Rahardja, K. P. Lim, Z. G. Li, D. Wu, and S. Wu, “Fast Mode Decision Algorithm for Intraprediction in H.264/AVC Video Coding,” IEEE Trans. on Circuits and Systems for Video Technology, vol. 15, no.7, pp.813-822, July. 2005.
[11]B. Meng, O. C. Au, C. W. Wong, and H. K. Lam, “Efficient Intra-Prediction Algorithm in H.264,” IEEE Image Processing International Conference, vol.2, pp.837-840, September. 2003.
[12]D. Wu, S. Wu, K. P. Lim, F. Pan, Z. G. Li, and X. Lin, “Block INTER mode decision for fast encoding of H.264,” IEEE Acoustics, Speech, and Signal Processing, International Conference, vol.3, pp.181-184, May. 2004.
[13]D. Zhu, Q. Dai, and R. Ding, “Fast inter prediction mode decision for H.264,” IEEE Multimedia and Expo, International Conference, vol.2, pp.1123-1126, June. 2004.
[14]C. H. Kuo, M. S. Kuo, and C. J. Kuo, “Fast inter-prediction mode decision and motion search for H.264,” IEEE Multimedia and Expo, International Conference, vol.1, pp.663-666, June. 2004.
[15]X. Jing, and L. P. Chau, “Fast approach for H.264 inter mode decision,” Electronic Letters, vol.40, pp.1050-1052, Auguest. 2004.
[16]Q. Dai, D. Zhu, and R. Ding, “Fast Mode Decision for INTER Prediction,” IEEE Image Processing International Conference, vol.1, pp.119-122, October. 2004.
[17]B. Jeon, “Fast mode decision for H.264,” in Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG, December, 2003 Doc. JVT-J033.
[18]F. Pan, X. Lin, R. Susanto, K. P. Lim, Z. G. Li, G. N. Feng, D. J. Wu, and S. Wu, “Fast Mode Decision for Intra Prediction,” in Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG, March, 2003 Doc. JVT-G013.
[19]K. P. Lim, S. Wu, D. J. Wu, S. Rahardja, X. Lin, F. Pan, and Z. G. Li, “Fast INTER Mode Selection,” in Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG, September, 2003, Doc. JVT-I020.
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top
無相關期刊