跳到主要內容

臺灣博碩士論文加值系統

(216.73.216.215) 您好!臺灣時間:2025/11/26 09:16
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

我願授權國圖
: 
twitterline
研究生:鄧翔升
研究生(外文):Hsiang-Sheng Teng
論文名稱:適用於多媒體應用之多執行緒數位訊號協同處理器
論文名稱(外文):Multithreaded DSP Coprocessor for Multimedia Applications
指導教授:劉志尉
指導教授(外文):Chih-Wei Liu
學位類別:碩士
校院名稱:國立交通大學
系所名稱:電子工程系所
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2007
畢業學年度:96
語文別:中文
論文頁數:67
中文關鍵詞:多執行緒協同處理器界面
外文關鍵詞:MultithreadingCoprocessor Interface
相關次數:
  • 被引用被引用:0
  • 點閱點閱:320
  • 評分評分:
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
針對多樣且日益複雜的嵌入式系統應用,近年來興起的多核心架構平台(Multi-Core Platform),已被視為能以合理的成本滿足高運算需求的方案之一。其中以一微處理器單元(MPU)搭配一數位訊號處理器(DSP)的異質性雙核心處理器平台(Heterogeneous Dual-Core Platform),是目前行動多媒體嵌入式系統的主流。在異質雙核心處理器平台中,微處理器單元將具有高運算需求的工作(Task),交由數位訊號處理器作加速,藉此可達到有效率的分工並降低系統的耗能; 然而,繁複且不具效率的工作管理(Task Management),包括工作排程(Task Schedule)、工作指派(Task Dispatch)等,會因為處理器間的互動與複雜的處理器間的溝通(Inter-Processor Communication, IPC),造成異質性雙核心處理器平台的效能大為降低。為提升數位訊號處理器的效能,本論文提出一適用於多媒體應用之多執行緒數位訊號協同處理器,該協同處理器包括一負責與微處理器單元溝通、協調之精巧協同處理器介面(Smart Coprocessor Interface, SCI),以及具零管線延遲(Zero Pipeline Latency)特性之交錯式多執行緒(Interleaved Multithreaded)數位訊號處理器。所提之精巧協同處理器界面,可動態管理工作分派,並可有效的解決處理器間的溝通問題。為評估精巧協同處理器界面對數位訊號處理器效能的影響,本論文利用CoWare電子系統層級(ESL)設計方法,建構一雙核心處理器虛擬平台(Virtual Platform)。以一張256x256 JPEG壓縮為例,所提之精巧協同處理器界面,分別較傳統以微處理器單元透過作業系統(如Linux)、或在數位訊號處理器上執行微核心(Micro-Kernel或 RTOS)進行兩處理器間的工作管理,減少約68% 以及15 %的執行時間。另一方面,所提之數位訊號處理器的資料路徑為一加法器(A)、一乘法器(M)、一累加器(A)、與一移位器(S)的串接(Cascade)組合。此AMAS複雜資料路徑(Composite Datapath)可執行加乘(AM), 乘加(MA), 加-乘-累加-位移(AMAS)等多媒體應用常見的運算。最後,我們利用TSMC 0.13μm CMOS 製程,實做所提出之數位訊號協同處理器,其操作頻率為250MHz,平均消耗40mW功率。該晶片面積為2.7 x 2.7 mm2,而精巧協同處理器界面僅佔約0.65%的晶片面積。
Due to task divergence in most embedded systems, heterogeneous dual-core/multi-core SoC, i.e. RISC+DSP, is accepted as a cost-effective solution for the increasing computation demands in mobile media applications. TI Davinci, for example, is one popular dual-core platform, where the DSP, as the slave or coprocessor, performs the computation-intensive task sent and requested by the host processor (i.e. RISC). However, inefficient DSP-task management, such as task scheduling and task dispatch, induces inter-processor communication (IPC) overheads and thus lowers the DSP performance. In order to improve DSP performance, a multithreaded DSP coprocessor for multimedia applications is proposed in this thesis. The DSP coprocessor consists of a smart coprocessor interface (SCI) and an application-specific interleaved multithreaded datapath. The proposed SCI can dynamically manage DSP-tasks and effectively reduce IPC overheads, while the interleaved multithreaded datapath can exploit thread-level parallelism (TLP) to tolerate full pipeline latency. A dual-core SystemC transaction-level virtual platform, constructed by CoWare electronic system level (ESL) design platform, is used to evaluate the DSP performance affected by the SCI. To encode a 256x256 JPEG image, the SCI reduces 68% and 15% total execution time comparing that with task management on MPU supported by OS and on DSP supported by a μ-kernel or RTOS, respectively. On the other hand, a cascaded adder, multiplier, accumulator, and shifter (A-M-A-S) functional units are applied on the proposed DSP datapath. This composite datapath is able to perform complicated operations, such as addition-multiplication (AM), multiplication-accumulation (MA), and addition-multiplication-accumulation-shift (AMAS) for multimedia applications. The proposed multithreaded DSP coprocessor is implemented by TSMC 0.13μm CMOS technology. The implementation results show that it can operate at 250MHz with 40mW power consumption. The chip area is 2.7x 2.7mm2 and the SCI occupies only 0.65% of the total chip size
1 INTRODUCTION 1
1.1 INTER-PROCESSOR COMMUNICATION (IPC) OVERHEAD 2
1.2 PIPELINE LATENCY 4
1.3 THESIS ORGANIZATION 7
2 DUAL-CORE SOFTWARE ARCHITECTURE 9
2.1 TASK MANAGEMENT ON MPU 9
2.2 TASK MANAGEMENT ON DSP 12
3 SMART COPROCESSOR INTERFACE 15
3.1 TASK MANAGEMENT 16
3.1.1 Data Structure for Task Management 19
3.1.2 Task Scheduling Policy 21
3.1.3 Task Loading Mechanism 23
3.2 MEMORY MANAGEMENT 26
3.3 INTER-PROCESSOR COMMUNICATION 29
4 MULTITHREADED DATAPATH 33
4.1 MULTITHREADED ARCHITECTURES 33
4.2 COMPOSITE DATAPATH 38
5 EXPERIMENT & IMPLEMENTATION RESULTS 43
5.1 SIMULATION PLATFORM 43
5.2 SIMULATION SCENARIO 45
5.3 SIMULATION & ESTIMATION RESULTS 46
5.4 IMPLEMENTATION RESULTS 50
5.4.1 Microarchitecture of Proposed SCI 53
5.4.2 Microarchitecture of the Proposed DSP Coprocessor 56
5.4.3 Silicon Implementation 61
6 CONCLUSION AND FUTURE WORKS 63
[1] DaVinci Technology Background and Specification, Texas Instruments, Jan. 2007. Available online: http://focus.ti.com/lit/an/sprt401a/sprt401a.pdf
[2] OMAP Platform Processors, Software and Support, Texas Instruments. Available online: http://focus.ti.com/general/docs/wtbu/wtbugencontent.tsp?templateId=6123&navigationId=11988&contentId=4638
[3] DSPLinux, http://www.cadenux.com/ridgerun
[4] SangBae Lee, "Analysis of interrupt entry latency in Linux 2.4 vs 2.6", presented at CELF Embedded Linux Conference, 2007
[5] Shih-Hao Ou, Tay-Jyi Lin, et al., "Multithreaded coprocessor interface for multi-core multimedia SoC," ASPDAC, Seoul, January 2008
[6] Linux DSP Gateway Specification, rev. 3.3.1, Nokia, Sep. 2006. Available online: http://dspgateway.sourceforge.net/pub/3.3.1/DSP_Gateway331_spec_a.pdf
[7] DSP/BIOS Timing Benchmarks for Code Composer Studio v2.2. Application Report SPRA900B, Texas Instruments, Inc., April, 2004.
[8] J. L. Hennessy and D. A. Patterson, Computer Architecture – Quantitative Approach, 3rd Edition, Morgan Kaufmann, 2002
[9] TMS320 DSP/BIOS User Guide, Texas Instruments, Inc., Nov, 2001.
[10] E. A. Lee and T. M. Pards, “Dataflow process networks,” in Proc. IEEE, vol83, 1995.
[11] Gregory T. Byrd and Mark A. Holliday, “Multithread processor architectures,” IEEE Sp, August 1995, pp. 38-46.
[12] Theo Ungerer, Borut Robic, and Jurij ilc, “A survey of Processors with explicit multithreading,” ACM Computing Surveys, 2003.
[13] J. S. Kowalik, on Parallel MIMD Computation: HEP Supercomputer and Its Applications, MIT Press, Cambridge, 1985, pp. 41–55.
[14] ALVERSON R., CALLAHAN D., and CUMMINGS D. et al., "The Tera computer system". Proceedings of the 4th International Conference on Supercomputing, 1990, pp. 1-6.
[15] Poonacha Kongetira, Kathirgamar Aingaran, and Kunle Olukotun,"Niagara: A 32-Way Multithreaded Sparc Processor", IEEE Micro, v.25 n.2, 2005, pp. 21-29.
[16] AGARWAL A., KUBIATOWICZ J., and KRANZ D. et al.,"Sparcle: an evolutionary processor design for large-scale multiprocessors". IEEE Micro 13, 3, 1993, pp. 48–61.
[17] MIKSCHL A. and DAMM W. "Msparc: a multithreaded Sparc", Lecture Notes in Computer Science, vol. 1123, 1996, pp. 461–469.
[18] Yaniv Pessach,"Make It Snappy: Juice Up Your App with the Power of Hyper-Threading", MSDN Magazine, June 2005.
[19] Gero Dittmann,“Organizing pattern libraries for ASIP design", IBM Research Report RZ3488, 2003.
[20] K. K. Parhi, VLSI Digital Signal Processing Systems – Design and Implementation, John Wiley & Sons, 1999.
[21] CoWare, Inc. http://www.coware.com
[22] μC/OS-II, http://www.micrium.com/products/rtos/kernel/rtos.html
[23] Valentine, B. Sohm, O. "Optimizing the JPEG2000 binary arithmetic encoder for VLIW architectures " ICASSP, May 2004
[24] B. Flachs et al., “The Microarchitecture of the Synergistic Processing Unit of Cell Architecture Processor”, ISSCC 2005, pp. 134-135
[25] Hao-I Yang, Ming-Hung Chang, et al,. “A Low-Power Low-Swing Single-Ended Multi-Port SRAM”, VLSI DAT 2007.
[26] M. Wroblewski, M. Mueller, A. Wortmann, S. Simon, W. Pieper, and J. A. Nossek, “A power efficient register file architecture using master latch sharing,” in Proc. ISCAS, May 2003
[27] McNutt G., and Fischer T., “Using Linux to Control DSP Processor in Mixed-Processor Systems”, Embedded Edge, Issue 4, pp 8-13, October, 2001.
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top
無相關論文
 
1. 11.柯承恩、葉銀華 (1999),「中小企業融資問題與誘因型契約之實證研究」,產業管理學報,第一卷,第一期,pp.41-63
2. 8.林修葳、蘇敏賢(2006),「Merton模型預測違約之使用限制探索」,金融風險管理季刊,第二卷,第三期,pp.65-87
3. 7.沈中華、黃博怡(2006),「台灣中小企業產業別信用風險模型」,中小企業發展季刊,創刊號,pp.1-34
4. 4.沈中華、張家華(2005),「產業違約率及景氣循環」,金融風險管理季刊,第一卷,第四期,pp.91-105
5. 2.朱正中(2006),「建立產業信用風險預測指標之研究」,金融風險管理季刊,第二卷,第二期,pp.91-98
6. 1.王金利、程文田(2000),「台灣中小企業融資問題之探索」,中華戰略學刊,pp.51-102
7. 13.郭瑞基、蔡敏華(2003),「風險評估報告與銀行授信決策之分析」,當代會計期刊,第四卷第一期,pp.57-76
8. 17.葉仕國、張庭樹 (2005),「台灣地區上市櫃公司違約機率之衡量與調整」,金融風險管理季刊,第一卷,第四期,pp.1-17
9. 18.黃博怡、張大成、江欣怡 (2006),「考慮總體經濟因素之企業危機預警模型」,金融風險管理季刊,第二卷,第二期,pp.75-89
10. 19.黃愷婷、楊素柳(2005),「銀行授信風險管理架構之探討」,真理財經學報,pp.93-150
11. 21.曾信超、黃新宗 (2005),「中小企業融資授信實務與風險轉嫁之實務探討」,華人前瞻研究,第一卷,第二期,pp.43-64
12. 22.楊蓁海 (2005),「我國銀行授信行為與景氣循環的關係:兼論新版巴塞爾資本協定的順循環影響效果暨其因應之道」,中央銀行季刊第二十八卷第一期,pp.43-72
13. 23.廖咸興、陳宗岡 (2004),「多期企業短期信用風險評估模型」,金融風險管理季刊,第一卷,第一期,pp.61-86