|
Reference [1]Jeff Parkhurst, John Darringer, Bill Grundmann, “From Single Core to Multi-Core Preparing for a new exponential,” Proceedings of the IEEE/ACM International Conference on Computer-Aided Design, pp. 67-72, November 2006. [2]Eshel Haritan, Hiroyuki Yagi, Wayne Wolf, Toshihiro Hattori, Pierre Paulin, Achim Nohl, Drew Wingard, Mike Muller, “Multicore Design is the challenge! what is the solution?,” Proceedings of the 45th annual Design Automation Conference, pp. 128-130, June 2008. [3]Lei Chai, Qi Gao, Panda, D.K., “Understanding the Impact of Multi-Core Architecture in Cluster Computing: A Case Study with Intel Dual-Core System,“ Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid, pp. 471-478, May 2007. [4]T. Trawick, “Multicore communication: today and the future,” Embedded Computing Design, March 2007. [5]Baojun Qiao, Feng Shi, Weixing Ji, “A new Hierarchical Interconnection Network for Multi-core Processor,” Proceedings of the 2nd IEEE Conference on Industrial Electronics and Applications, pp. 246-250, May 2007. [6]Jesshope CR, Miller PR, Yantchev JT, “High Performance Communications in Processor Networks, Computer Architecture,” Proceedings of the 16th Annual International Symposium on Computer Architecture, pp. 150-157, June 1989. [7]Dietmar Tutsch and Gunter Hommel, “High Performance Low Cost Multicore NoC Architectures for Embedded Systems,” Proceedings of the International Workshop on Embedded Systems-Modeling, Technology and Applications, pp. 53-62, 2006. [8]Partha Pratim Pande, Cristian Grecu, Michael Jones, Andre’ Ivanov, and Resve Saleh, “Performance Evaluation and Design Trade-Offs For Network-on-Chip Interconnect Architectures,” IEEE Transactions on Computers, Vol. 54, Issue 8, pp. 1025-1040, August 2005. [9]Henrique C. Freitas and Philippe O. A. Navaux , “A high throughput multi cluster noc architecture,” Proceedings of the IEEE 11th International Conference on Computational Science and Engineering, pp. 56-63, July 2008. [10]Lei Chai, Albert Hartono, Dhabaleswar K. Panda, “Designing High Performance and Scalable MPI Intra-node Communication Support for Clusters,” Proceedings of the IEEE International Conference on Cluster Computing, pp. 1-10, September 2006. [11]Marek Tudruj, Lukasz Masko, “Dynamic SMP Clusters with Communication on the Fly in NoC Technology for Very Fine, Parallel and Distributed Computing,” Proceedings of the 3rd International Symposium on Parallel and Distributed Computing, pp. 97-104, July 2004. [12]Reetuparna Das, Asit K. Mishra, Chrysostomos Nicopoulos, Dongkook Park, Vijaykrishnan Narayanan, Ravishankar Iyer, Mazin S. Yousif, Chita R. Das, “Performance and Power Optimization through Data Compression in Network-on-Chip Architectures,” Proceedings of the IEEE 14th International Symposium on High Performance Computer Architecture, pp. 215-225, February 2008. [13]Hsiang-Ning Liu, Yu-Jen Huang, Jin-Fu Li, “Memory built-in self test in multicore chips with mesh-based networks,” IEEE Micro, Vol. 29, Issue 5, pp. 46-55, September 2009. [14]D. Chandra, F. Guo, S. Kim and Y. Solihin, “Predicting inter-thread cache contention on a chip multiprocessor architecture,” Proceedings of the 11th International Symposium on High Performance Computer Architecture, pp. 340 - 351, February 2005 [15]L. Hsu, S. Reinhardt, R. Iyer and S. Makineni, “Communist, Utilitarian, and Capitalist Cache Policies on CMPs: Caches as a Shared Resource,” Proceedings of the 15th international conference on Parallel architectures and compilation techniques, pp.13-22, September 2006 [16]R. Iyer, “CQoS: A Framework for Enabling QoS in Shared Caches of CMP Platforms,” Proceedings of the 18th annual international conference on Supercomputing, pp. 257 - 266 , July 2004 [17]C. Kim, D. Burger, S. W. Keckler, “Nonuniform Cache Architectures for Wire-Delay Dominated On-Chip Caches,” IEEE Micro, Vol. 23, Issue 6, pp. 99-107, November 2003 [18]Fei Guo, Hari Kannan, Li Zhao, Ramesh Illikkal, Ravi Iyer, Don Newell, Yan Solihin, and Christos Kozyrakis, “From Chaos to QoS: Case Studies in CMP Resource Management,” SIGARCH Computer Architecture News, Vol. 35, pp. 21-30, March 2007 [19]S. Kim, D. Chandra, and Y. Solihin, “Fair Cache Sharing and Partitioning in a Chip Multiprocessor Architecture,” Proceedings of 13th International Conference on Parallel Architecture and Compilation Techniques , pp. 111 - 122, October 2004 [20]C. Liu, A. Sivasubramaniam, M. Kandemir, “Organizing the Last Line of Defense before Hitting the Memory Wall for CMPs,” Proceedings of the 10th International Symposium on High Performance Computer Architecture, pp.176 - 185, February 2004 [21]Yang Ding, Mahmut Kandemir, Padma Raghavan, Mary Jane Irwin, “A helper thread based edp reduction scheme for adapting application execution in cmps,” Proceedings of IEEE International Symposium on Parallel and Distributed Processing, pp.1-14, April 2008 [22]J. Chang and G. S. Sohi, “Cooperative cache partitioning for chip multiprocessors,” Proceedings of the 21st annual international conference on Supercomputing, pp. 242 - 252, June 2007 [23]Fei Guo, Yan Solihin, Li Zhao, Ravishankar Iyer, “A framework for providing quality of service in chip multi-processors,” Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture, pp. 343-355, December 2007 [24]Bong-Jun Ko, Kang-Won Lee, Khalil Amiri, Seraphin Calo, “Scalable service differentiation in a shared storage cache,” Proceedings of the 23rd International Conference on Distributed Computing Systems, pp.184-193, May 2003. [25]M. K. Qureshi and Y. N. Patt, “Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches,” Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture, pp. 423-432, December 2006 [26]Nauman Rafique, Won-Taek Lim, Mithuna Thottethodi, “Architectural support for operating system-driven CMP cache management,” Proceedings of the 15th international conference on Parallel architectures and compilation techniques, pp.2-12, September 2006. [27]G. E. Suh, L. Rudolph, S. Devadas, “Dynamic partitioning of shared cache memory,” The Journal of Supercomputing, Vol. 28, Issue 1, April 2004 [28]Kagi, A., Goodman J.R., Burger, D., “Memory bandwidth limitations of future microprocessors,” Proceedings of the 23rd annual international symposium on Computer architecture, pp.78-89, May 1996 [29]D. Kaseridis, J. Stuecheli, J. Chen, and L.K. John, “A bandwidth-aware memory-subsystem resource management using non-invasive resource profilers for large cmp systems,” Proceedings of the 16th International Symposium on In High Performance Computer Architecture, pp.1-11, January 2010. [30]F. Liu, X. Jiang, and Y. Solihin, “Understanding how off-chip memory bandwidth partitioning in chip multiprocessors affects system performance,” Proceedings of the 16th International Symposium on In High Performance Computer Architecture, pp.1-12, January 2010. [31]Coutinho L. M., Mendes J. L., Martins C. A., “Dynamically Reconfigurable Split Cache Architecture,” Proceedings of the 2008 International Conference on Reconfigurable Computing and FPGAs, pp.163-168, December 2008 [32]M. B. Carvalho, L. F. W. Goes and C.A.P.S. Martins, “Dynamically reconfigurable cache architecture using adaptive block allocation policy,” Proceedings of the 20th international conference on Parallel and distributed processing Symposium, pp. 25-29, April 2006 [33]Kaseridis, D., Stuecheli, J., John, L.K., “Bank-aware Dynamic Cache Partitioning for Multicore Architectures,” Proceedings of the 2009 International Conference on Parallel Processing, pp.18-25, September 2009 [34]M.T. Kandemir, T. Yemliha, and E. Kultursay, “A helper thread based dynamic cache partitioning scheme for multithreaded applications,” Proceedings of the 48th Design Automation Conference, pp.954-959, June 2011 [35]Sai Prashanth Muralidhara, Mahmut Kandemir, Padma Raghavan, “Intra-application shared cache partitioning for multithreaded applications,” ACM SIGPLAN Notices, Vol. 45, Issue 5, May 2010 [36]http://www.windriver.com/products/simics/, April 2012 [37]S. C. Woo, M. Ohara, E. Torrie, J. P. Singh, and A. Gupta, “The SPLASH-2 programs: Characterization and methodological considerations,” ACM SIGARCH Computer Architecture News - Special Issue: Proceedings of the 22nd annual international symposium on Computer architecture, Vol. 23, Issue 2, June 1995
|