跳到主要內容

臺灣博碩士論文加值系統

(18.97.14.87) 您好!臺灣時間:2025/01/14 04:26
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

: 
twitterline
研究生:沈日昇
研究生(外文):Jih-sheng Shen
論文名稱:應用於低(工力)率嵌入式系統之十字型匯流排溝通架構
論文名稱(外文):An On-Chip Crossroad Communication Architecture for Low Power Embedded Systems
指導教授:陳添福陳添福引用關係
指導教授(外文):Tien-fu Chen
學位類別:碩士
校院名稱:國立中正大學
系所名稱:資訊工程所
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2005
畢業學年度:93
語文別:英文
論文頁數:65
中文關鍵詞:匯流排架構晶片系統
外文關鍵詞:busNOCsystem on chipcommunication architecture
相關次數:
  • 被引用被引用:0
  • 點閱點閱:680
  • 評分評分:
  • 下載下載:13
  • 收藏至我的研究室書目清單書目收藏:0
隨著晶片所能容納的元件越來越多,因為本身的溝通架構(communication
architecture)所消耗掉的(工力)率;已經佔了整體所能消耗的(工力)率裡相當大的比率。在晶
片系統裡有著各式各樣不同的組成元件,而他們在(工力)能及溝通的需求上都不相
同,所以,設計一個溝通拓撲(communication topology)時,就應該針對這些元件
不同的溝通情況以及工作負載來做最佳化的調整設計。在本篇論文裡,我提出一
個創新的連結架構,這個架構可以利用十字型轉換開關動態的建構出兩個元件中
一個專屬的溝通路徑。我將會介紹針對不同的應用,根據profile的一些溝通特
徵,如何去建構出匯流排溝通架構的設計策略。根據應用的溝通情況,我設計了
兩種匯流排操作模式。第一種模式屬於動態執行仲裁(arbitration)的模式,這就是
一般當各個元件組合到匯流排上時,十字型轉換開關將會根據每個元件所發出的
request加以判斷,並決定由哪一個元件得以傳輸資料。第二種模式則是預先決定
十字型轉換開關所開啟的方向(路徑),透過這種預先決定的方式,固定了十字型
轉換開關裡仲裁的動作,也就是說,不需再經過仲裁的動作,藉此省下執行仲裁
動作所消耗的(工力)率;。我使用MPEG4 decoder和JPEG的工作負載來做實驗,而實驗
結果顯示出,如果我們可以知道嵌入式軟體的應用,我們將可以善加控制十字型
轉換開關以及產生適當的匯流排的架構以得到較佳的效能及較低的(工力)率消耗。
As the number of cores on a chip increases, power consumed by the communication structures takes significant portion of the overall power-budget. The individual components of the SoCs will be heterogeneous in nature with widely varying functionality and communication requirements. The communication topology should optimally match communication workflows among these components. In this paper, we first propose a novel interconnect
architecture, which uses crossroad switches to dynamically construct a dedicated communication path between any two cores. We then present a design methodology for constructing network on chip (NoC) for application-specific computer systems with profiled communication characteristics. We design two application-specific bus operation schemes. The first scheme is dynamically-controlled arbitration, where a core placement tool automatically
maps the cores to communication topologies such that the bus is shared and controlled at run time and the total communication energy can be minimized. The second scheme is the pre-determined control assignment for switches. Each switch may operate in a ”lease line” mode, which can dynamically offer a dedicated path between two high-communicative cores for a specific period according to the application characteristics. We take the MPEG4 decoder
and JPEG as our case studies, and experimental results show the power consumptions can be saved if we organize cores carefully and dynamically control NoC switches when the behavior of the embedded software is well-known.
1 Introduction 1
1.1 INTRODUCTION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2 Background and Related Work 7
2.1 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
3 Crossroad Bus Architecture 18
3.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
3.2 The Proposed Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
3.3 Segmenting a bus for power optimization . . . . . . . . . . . . . . . . . . . . 21
3.4 Bus Control and Arbitration . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
3.4.1 Self-routing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
3.4.2 Analysis of delay . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
3.5 Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
3.5.1 Fully Configurable . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
3.5.2 Power Optimization by Localization . . . . . . . . . . . . . . . . . . . 33
3.5.3 Better Communication Parallelism . . . . . . . . . . . . . . . . . . . 33
4 Strategies on Crossroad Bus Architecture 35
4.1 Tailoring Crossroad Bus Architecture to Application-Specific SoC . . . . . . 35
4.2 Dynamic Switch Mode Assignment for Temporary Lease Path . . . . . . . . 37
4.2.1 Pre-determined Mode of the Switch . . . . . . . . . . . . . . . . . . . 37
4.2.2 Mode Selection for Each Switch . . . . . . . . . . . . . . . . . . . . . 39
5 Experiment and Performance Evaluation ................................42
5.1 Evaluation Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
5.2 Evaluation Results of Bus Architectures . . . . . . . . . . . . . . . . . . . . 45
5.3 Effects of Placement Policies . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
5.4 Effects of the Bus Configurable Controller and Pre-determined Mode Selection ................................................................54
6 Conclusion and Future Work 61
[1] Neal K. Bambha and Shuvra S. Bhattacharyya. Joint application mapping/interconnect
synthesis techniques for embedded chip-scale multiprocessors. IEEE Transaction on
Parallel and Distributed Systems, 16(2):99–112, February 2005.
[2] Nilanjan Banerjee, Praveen Vellanki, and Karam S. Chatha. A power and performance
model for network-on-chip architectures. In Proceedings of the Design, Automation and
Test in Europe Conference and Exhibition, 2004.
[3] A. Bellaouar, I. Abu-Khater, and M. I. Elmastry. An ultra-low-power cmos on-chip
interconnect architecture. In Symposium on Low Power Electronics. Digest of Technical
Papers, pages 52–53, October 1995.
[4] L. Benini and G. De Micheli. Networks on chips: A new soc paradigm. Computer, pages
70–78, January 2002.
[5] D. Bertozzi, A. Jalabert, Srinivasan Murali, R. Tamhankar, S. Stergiou, L. Benini,
and G. De Micheli. Noc synthesis flow for customized domain specific multiprocessor
systems-on-chip. IEEE Transaction on Parallel and Distributed Systems, 16(2):113–129,
February 2005.
[6] Doug Burger and Todd M. Austin. The simplescalar tool set, version 2.0.
www.simplescalar.com, June 1997.
[7] J. Y. Chen, W. B. Jone, J. S. Wang, H. I. Lu, and T. F. Chen. Segmented bus design
for low-power systems. IEEE Transactions on VLSI Systems, 7(1):25–29, March 1999.
[8] W.J. Dally and B. Towles. Route packets, not wires: On-chip interconnection networks.
In Proceedings of Design and Automation Conference DAC 2001, pages 684–689, June
2001.
[9] E.B. Van der Tol and E.G.T. Jaspers. Mapping of mepg-4 decoding on a flexible architecture
platform. In SPIE2002, pages 1–13, January 2002.
[10] K. Goossens, J. Dielissen, O.P. Gangwal, S.G. Pestana, A. Radulescu, and E Rijpkema.
A design flow for application-specific networks on chip with guaranteed performance to
accelerate soc design and verification. In Proceedings of the Design, Automation and
Test in Europe Conference and Exhibition, pages 1182–1187, 2005.
[11] Samuelsson Henrik and Kumar Shashi. Ring road noc architecture. In Proceedings of
Norchip Conference, November 2004.
[12] Cheng-Ta Hsieh and Massoud Pedram. Architectural energy optimization by bus splitting.
IEEE Transactions on Computer-Aided Design on Integrated Circuits and Systems,
21(4):408–414, April 2002.
[13] J. Hu and R. Marculescu. Energy-aware mapping for tile-based noc architectures under
performance constraints. In Proceedings of Asia and South Pacific Design Automation
Conference, pages 233–23, January 2003.
[14] J. Hu and R. Marculescu. Exploiting the routing flexibility for energy/performance
aware mapping of regular noc architectures. In Proceedings of DATE Conference, March
2003.
[15] S. Ravi J. Chang and A. Raghunathan. Flexbar: a crossbar switching fabric with
improved performance and utilization. Proc. IEEE CICC, pages 405–408, 2002.
[16] E.G.T. Jaspers and P.H.N. de With. Chip-set for video display of multimedia information.
IEEE Transaction on Consumer Electronics, 45(3):707–716, August 1999.
[17] W. B. Jone, J. S. Wang, H. I. Lu, I. P. Hsu, and J.Y. Chen. Design theory and
implementation for low-power segmented bus systems. ACM Transactions on Design
Automation of Electronic Systems, 8(1):38–54, January 2003.
[18] Tang Lei and Kumar S. Optimizing network on chip architecture size for applications.
In 5th International Conference on ASIC, October 2003.
[19] P. Lettieri and M. B. Srivastava. A qos-aware, energy-efficient wireless node architecture.
IEEE International Workshop on Mobile Multimedia Communications, pages 252–261,
November 1999.
[20] R. Mehra, L. M. Guerra, and J. M. Rabaey. A partitioning scheme for optimizing
interconnect power. IEEE Journal of Solid-State Circuit, 22:433–443, March 1997.
[21] S. Murali and G. De Micheli. Bandwidth constrained mapping of cores onto noc architectures.
In Proceedings of Conference DATE, 2004.
[22] S. Murali and G. De Micheli. Sunmap: A tool for automatic topology selection and
generation for nocs. In Proceedings of Design Automation Conference, 2004.
[23] Juha Plosila, Tiberiu Seceleanu, and Pasi Liljeberg. Implementation of a self-timed
segmented bus. IEEE Journals on Design and Test of Computers, 20(6):44–50, 2003.
[24] Vijay Raghunathan, Mani B. Srivastava, and Rajesh K. Gupta. A survey of techniques
for energy efficient on-chip communication. In Proc. ACM/IEEE Design Automation
Conference, pages 900–905, June 2003.
[25] Silicore Corporation. WISHBONE System-On-Chip Interconnection Architecture for
Portable IP Cores, 2001.
[26] T. Ye, L. Benini, and G. De Micheli. Analysis of power consumption on switch fabrics
in network routers. Proc. ACM/IEEE Design Automation Conference, pages 524–529,
2002.
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top