跳到主要內容

臺灣博碩士論文加值系統

(18.97.14.81) 您好!臺灣時間:2025/02/13 08:44
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

我願授權國圖
: 
twitterline
研究生:盧彥彰
研究生(外文):Yen-Chang Lu
論文名稱:基於近似繪圖法實現之適用於手持裝置之低功率三維繪圖系統
論文名稱(外文):Approximate Rendering Based Low Power GPU for Mobile Devices
指導教授:簡韶逸
指導教授(外文):Shao-Yi Chien
口試委員:陳炳宇楊佳玲范倫達
口試日期:2010-10-18
學位類別:碩士
校院名稱:國立臺灣大學
系所名稱:電子工程學研究所
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2011
畢業學年度:99
語文別:英文
論文頁數:94
中文關鍵詞:三維繪圖系統低功率近似繪圖手持裝置晶片系統
外文關鍵詞:GPUlow power3D renderingmobile devicesSoC
相關次數:
  • 被引用被引用:0
  • 點閱點閱:336
  • 評分評分:
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
近年消費性電子產品在多媒體應用的發展非常蓬勃,其中個人手持式電子裝置如智慧型手機,除了基本的電話、簡訊通訊外,還支援各種的多媒體應用服務,如:數位相機、網路服務、MP3隨身聽、電玩遊戲…等,並大大地影響人們的生活。而從我們日常使用類似產品的經驗中不難發覺,螢幕已成為一個難以取代的使用者介面,也因此人們對於其裝置的「視覺效果」有了越來越高的需求;在手持式裝置上配備三維繪圖處理器(GPUs)的情況也越來越普遍。然為因應使用者對畫面品質的高要求,三維繪圖處理器所需要處理的運算也越來越複雜,這同時意味著更多的能源損耗;對資源極有限的手持式裝置來說,不啻為一大挑戰:如何在顧及畫面品質的前提下,以最有效地方式描繪畫面。更甚者,在電源短缺的時候,如何能提供使用者一個省電的模式,來完成手邊正進行中的重要事項,也是一大考驗。
在這篇論文中,我們提出了一個可調整解析度的繪圖流程(Graphic pipeline),利用近似繪圖的方式,減少需要運算的畫素(Pixel)量,進而達到節省電量的目的。而為能確保繪圖品質的減損維持在可接受的範圍內,我們也提出了一個估計運算誤差的誤差函式(Error function),而推導此誤差函式的方法可以適用於各種繪圖場景,甚至是其他更通用性的物理計算;如此一來,三維繪圖處理器便能更有效率地描繪出符合使用者期待品質的畫面,而在能源的控制上也能達到更細緻的程度。
結合以上的技術,我們修改了一個現有的適用於行動式多媒體裝置的低功率三維繪圖處理器,來測試我們提出的繪圖流程,並將之實現成一個系統晶片平台,原型晶片利用台積電65nm 技術製成,面積為4x4 mm2,其工作頻率為200MHz,最大消耗功率為128 mW。


Due to the fast growth of the market for personal mobile devices and the improvement of process technology, more and more multi-media applications are embedded on our handheld devices. Since most of these devices use the screen as its main user interface, high visual quality becomes one of the most important factors when evaluating the user experience. To achieve this requirement, GPU becomes a great computing power for a mobile device. However, the mobile devices may not have enough power due to the nature of sparse resource. For example, the only power supply for mobile phone is the battery. Therefore, it is a big challenge to maintain visual quality under a low battery situation.
In this thesis, we proposed a low power graphics pipeline based on approximate rendering technique with error control scheme. The approximate rendering technique is targeting on reducing the number of rendered pixel and hence to reduce the expensive computation power. The proposed technique can reach a reduction rate of 50% for medium resolution mode and 80% for low resolution mode in most cases. Furthermore, the error control scheme can estimate the error caused by the approximation so the system can render objects adaptively. Consequently the error control scheme, we can render image with fewer pixels but still sustain acceptable quality. Finally, with the proposed techniques above, a multi-core GPU system for mobile devices is implemented. The prototype chip is fabricated in TSMC 65nm technology, and the chip size is 4.0o4.0mm2. The designed working requency is 200MHz, and the worst case power consumption is 128mW.

1 Introduction 1
1.1 Introduction to Graphic Pipeline . . . . . . . . . . . . . . . . . . 3
1.2 Limitations for Embedded GPU on Mobile Device . . . . . . . . 4
1.3 Analysis of The Power Consumption for Graphic Pipeline . . . . 6
1.3.1 Level of Detail . . . . . . . . . . . . . . . . . . . . . . . 7
1.3.2 Lighting Model . . . . . . . . . . . . . . . . . . . . . . . 8
1.3.3 Texture Model . . . . . . . . . . . . . . . . . . . . . . . 9
1.3.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.4 Motivation and Target . . . . . . . . . . . . . . . . . . . . . . . . 10
1.5 Thesis Organization . . . . . . . . . . . . . . . . . . . . . . . . . 11
2 PreviousWorks 13
2.1 CORDIC Vector Interpolator . . . . . . . . . . . . . . . . . . . . 15
2.2 Geometry-aware Framebuffer Level of Detail . . . . . . . . . . . 17
2.3 Spatio-Temporal Upsampling on GPU . . . . . . . . . . . . . . . 19
3 Proposed Approximate Rendering Pipeline 23
3.1 Configurable Resolution Sampling . . . . . . . . . . . . . . . . . 24
3.1.1 Structure Overview . . . . . . . . . . . . . . . . . . . . . 24
3.1.2 Configurable Resolution Sampling . . . . . . . . . . . . . 25
3.1.3 Repacking Pixels to Sampling Pattern . . . . . . . . . . . 26
3.2 Error Estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
3.2.1 Taylor Expansion Series . . . . . . . . . . . . . . . . . . 29
3.2.2 Error Function . . . . . . . . . . . . . . . . . . . . . . . 29
3.2.3 Simple Example of Fast Lighting . . . . . . . . . . . . . 30
3.2.4 Complicated Lighting: Phong Shading . . . . . . . . . . . 32
3.2.5 Computation Overhead . . . . . . . . . . . . . . . . . . . 35
3.2.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . 36
3.3 Proposed Adaptive Resolution Rendering Architecture . . . . . . 36
3.4 Other Applications . . . . . . . . . . . . . . . . . . . . . . . . . 38
3.4.1 Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
3.4.2 Frame Rate Control Mode . . . . . . . . . . . . . . . . . 39
4 Architecture Design of Proposed Approximate Rendering Graphics
Pipeline 43
4.1 Architecture Overview . . . . . . . . . . . . . . . . . . . . . . . 43
4.2 Tile-based Architecture . . . . . . . . . . . . . . . . . . . . . . . 46
4.3 Repakcing and Sampling Table . . . . . . . . . . . . . . . . . . . 48
4.3.1 Repacking Unit . . . . . . . . . . . . . . . . . . . . . . . 48
4.3.2 Sampling Table . . . . . . . . . . . . . . . . . . . . . . . 50
4.3.3 Upsample Unit . . . . . . . . . . . . . . . . . . . . . . . 52
4.4 Error Estimation Unit . . . . . . . . . . . . . . . . . . . . . . . . 54
5 Experimental Results 57
5.1 Benchmark Images and Lighting Effects . . . . . . . . . . . . . . 59
5.2 Pixel Reduction Rate . . . . . . . . . . . . . . . . . . . . . . . . 60
5.3 Visual Quality . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
5.3.1 Error Control . . . . . . . . . . . . . . . . . . . . . . . . 62
5.3.2 Comparing to Previous Work . . . . . . . . . . . . . . . . 76
5.3.3 Error Estimation with Different Precision . . . . . . . . . 79
Implementation of Low Power Graphics Processing Units for Mobile
Multimedia Applications 81
6.1 Architecture Overview . . . . . . . . . . . . . . . . . . . . . . . 82
6.2 Design Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
6.3 Chip Layout and Specification . . . . . . . . . . . . . . . . . . . 85
6.4 Comparison . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
7 Conclusion 89
Reference 91

[1] Brian Wallace, “Ati radeon hd 2900 xt video card review,”
http://www.legitreviews.com/article/503/3/, May 2007.
[2] Kayvon Fatahalian, Edward Luong, Solomon Boulos, Kurt Akeley, William
R. Mark, and Pat Hanrahan, “Data-parallel rasterization of micropolygons
with defocus and motion blur,” in Proceedings of the Conference on High
Performance Graphics 2009, New York, NY, USA, 2009, HPG ’09, pp. 59–
68, ACM.
[3] Masatoshi Kameyama, Yoshiyuki Kato, Hitoshi Fujimoto, Hiroyasu
Negishi, Yukio Kodama, Yoshitsugu Inoue, and Hiroyuki Kawai, “3d graph-
ics lsi core for mobile phone ”z3d”,” in HWWS ’03: Proceedings of the ACM
SIGGRAPH/EUROGRAPHICS conference on Graphics hardware, Aire-la-
Ville, Switzerland, Switzerland, 2003, pp. 60–67, Eurographics Association.
[4] You-Ming Tsao, Chih-Hao Sun, Yu-Cheng Lin, Ka-Hang Lok, Chia-Jung
Hsu, Shao-Yi Chien, and Liang-Gee Chen, “A 26mw 6.4gflops multi-core
stream processor for mobile multimedia applications,” jun. 2008, pp. 24 –25.
[5] Magnus Ekman, Fredrik Warg, and Jim Nilsson, “An in-depth look at com-
puter performance growth,” SIGARCH Comput. Archit. News, vol. 33, no. 1,
pp. 144–147, 2005.
[6] Bren Mochocki, Kanishka Lahiri, and Srihari Cadambi, “Power analysis
of mobile 3d graphics,” in DATE ’06: Proceedings of the conference on Design, automation and test in Europe, 3001 Leuven, Belgium, Belgium,
2006, pp. 502–507, European Design and Automation Association.
[7] Diego Nehab, Pedro V. Sander, Jason Lawrence, Natalya Tatarchuk, and
John R. Isidoro, “Accelerating real-time shading with reverse reprojection
caching,” in GH ’07: Proceedings of the 22nd ACM SIGGRAPH/
EUROGRAPHICS symposium on Graphics hardware, Aire-la-
Ville, Switzerland, Switzerland, 2007, pp. 25–35, Eurographics Association.
[8] Pitchaya Sitthi-amorn, Jason Lawrence, Lei Yang, Pedro V. Sander, Diego
Nehab, and Jiahe Xi, “Automated reprojection-based pixel shader optimization,”
in SIGGRAPH Asia ’08: ACM SIGGRAPH Asia 2008 papers, New
York, NY, USA, 2008, pp. 1–11, ACM.
[9] Kyusik Chung, Chang-Hyo Yu, Donghyun Kim, and Lee-Sup Kim,
“Tessellation-enabled shader for a bandwidth-limited 3d graphics engine,”
sep. 2008, pp. 367 –370.
[10] Kayvon Fatahalian, Edward Luong, Solomon Boulos, Kurt Akeley, William
R. Mark, and Pat Hanrahan, “Data-parallel rasterization of micropolygons
with defocus and motion blur,” in HPG ’09: Proceedings of the Conference
on High Performance Graphics 2009, New York, NY, USA, 2009, pp. 59–
68, ACM.
[11] Cliff Woolley, David Luebke, Benjamin Watson, and Abhinav Dayal, “Interruptible
rendering,” in I3D ’03: Proceedings of the 2003 symposium on
Interactive 3D graphics, New York, NY, USA, 2003, pp. 143–151, ACM.
[12] Pedro V. Sander, Diego Nehab, and Joshua Barczak, “Fast triangle reordering
for vertex locality and reduced overdraw,” ACM Trans. Graph., vol. 26,
no. 3, pp. 89, 2007.
[13] Jon Hasselgren and Thomas Akenine-M¨oller, “Pcu: the programmable
culling unit,” ACM Trans. Graph., vol. 26, no. 3, pp. 92, 2007.
[14] Timo Aila, Ville Miettinen, and Petri Nordlund, “Delay streams for graphics
hardware,” ACM Trans. Graph., vol. 22, no. 3, pp. 792–800, 2003.
[15] Byeong-Gyu Nam, Jeabin Lee, Kwanho Kim, Seung Jin Lee, and Hoi-Jun
Yoo, “A 52.4mw 3d graphics processor with 141mvertices/s vertex shader
and 3 power domains of dynamic voltage and frequency scaling,” 2007, pp.
278 –603.
[16] Bren C. Mochocki, Kanishka Lahiri, Srihari Cadambi, and X. Sharon Hu,
“Signature-based workload estimation for mobile 3d graphics,” in DAC ’06:
Proceedings of the 43rd annual Design Automation Conference, New York,
NY, USA, 2006, pp. 592–597, ACM.
[17] Yan Gu and Samarjit Chakraborty, “Control theory-based dvs for interactive
3d games,” in DAC ’08: Proceedings of the 45th annual Design Automation
Conference, New York, NY, USA, 2008, pp. 740–745, ACM.
[18] Jack E. Volder, “The cordic trigonometric computing technique,” Electronic
Computers, IRE Transactions on, vol. EC-8, no. 3, pp. 330 –334, sep. 1959.
[19] Jeongseon Euh, Jeevan Chittamuru, and Wayne Burleson, “Power-aware 3d
computer graphics rendering,” J. VLSI Signal Process. Syst., vol. 39, no. 1-2,
pp. 15–33, 2005.
[20] John S. Montrym, Daniel R. Baum, David L. Dignam, and Christopher J.
Migdal, “Infinitereality: A real-time graphics system,” in Proceedings of
ACM SIGGRAPH, 1997, pp. 293–302.
[21] Lei Yang, Pedro V. Sander, and Jason Lawrence, “Geometry-aware framebuffer
level of detail,” Comput. Graph. Forum (Proc. of Eurographics Symposium
on Rendering 2008), pp. 1183–1188, 2008.
[22] MANDUCHI R. TOMASI C., “Bilateral filteringfor gray and color images,”
1998.
[23] Robert Herzog, Elmar Eisemann, Karol Myszkowski, and H.-P. Seidel,
“Spatio-temporal upsampling on the gpu,” 2010.
[24] Chih-Hao Sun, Ka-Hang Lok, You-Ming Tsao, Chia-Ming Chang, and
Shao-Yi Chien, “Cfu: multi-purpose configurable filtering unit for mobile
multimedia applications on graphics hardware,” in HPG ’09: Proceedings of
the Conference on High Performance Graphics 2009, New York, NY, USA,
2009, pp. 29–36, ACM.
[25] Emile Hsieh, Vladimir Pentkovski, and Thomas Piazza, “Zr: A 3d api transparent
technology for chunk rendering,” Microarchitecture, IEEE/ACM International
Symposium on, vol. 0, pp. 284, 2001.
[26] Jeong-Ho Woo, Ju-Ho Sohn, Hyejung Kim, Jongcheol Jeong, Euljoo Jeong,
Suk Joong Lee, and Hoi-Jun Yoo, “A 152mw mobile multimedia soc with
fully programmable 3d graphics and mpeg4/h.264/jpeg,” jun. 2007, pp. 220
–221.
[27] Jae-Sung Yoon, Donghyun Kim, Chang-Hyo Yu, and Lee-Sup Kim, “A 3d
graphics processor with fast 4d vector inner product units and power aware
texture cache,” sep. 2008, pp. 539 –542.

QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top