|
[1]T. Mück and A. Frohlich, A run-time memory management approach for scratch-pad-based embedded systems, in Proc. ETFA, 2010, pp. 1-4. [2]M. Verma and P. Marwedel, Overlay techniques for scratchpad memories in low power embedded processors, IEEE Trans. Very Large Scale Integration (VLSI) Systems, vol. 14, pp. 802-815, 2006. [3]R. Banakar, S. Steinke, B. S. Lee, M. Balakrishnan, and P. Marwedel, Scratchpad memory: a design alternative for cache on-chip memory in embedded systems, in Proc. ACM CODES, 2002, pp. 73-78. [4]M. Kandemir, I. Kadayif, A. Choudhary, J. Ramanujam, and I. Kolcu, Compiler-directed scratch pad memory optimization for embedded multiprocessors, IEEE Trans. Very Large Scale Integration (VLSI) Systems, vol. 12, pp. 281-287, 2004. [5]L. Xue, F. Li, M. Kandemir, and I. Kolcu, Dynamic Partitioning of Processing and Memory Resources in Embedded MPSoC Architectures, in Proc. IEEE DATE, 2006, pp. 690-695. [6]M. Kandemir, O. Ozturk, and M. Karakoy, Dynamic on-chip memory management for chip multiprocessors, in Proc. ACM CASES, 2004, pp. 14-23. [7]H. Takase, H. Tomiyama, and H. Takada, Partitioning and allocation of scratch-pad memory for priority-based preemptive multi-task systems, in Proc. IEEE DATE, 2010, pp. 1124-1129. [8]D. Chang, I. Lin, Y. Chien, C. Lin, A. Su, and C. Young, CASA: Contention-Aware Scratchpad Memory Allocation for Online Hybrid On-Chip Memory Management, IEEE Trans. Computer-Aided Design of Integrated Circuits and Systems, vol. 33, pp. 1806-1817, 2014. [9]W. Ji, N. Deng, F. Shi, Q. Zuo, and J. Li, Dynamic and adaptive SPM management for a multi-task environment, in Journal of Systems Architecture, vol. 57, pp. 181-192, 2011. [10]B. Egger, J. Lee, and H. Shin, Scratchpad memory management in a multitasking environment, in Proc. ACM EMSOFT, 2008, pp. 265-274. [11][Z. H. Chen and A. W. Su, A hardware/software framework for instruction and data scratchpad memory allocation, ACM Trans. TACO, vol. 7, p. 2, 2010. [12]P. R. Panda, N. D. Dutt, and A. Nicolau, Efficient utilization of scratch-pad memory in embedded processor applications, in Proc. IEEE European conference on Design and Test, 1997, pp. 7-11. [13]S. Steinke, L. Wehmeyer, B. S. Lee, and P. Marwedel, Assigning program and data objects to scratchpad for energy reduction, in Proc. IEEE DATE, 2002, pp. 409-415. [14]O. Avissar, R. Barua, and D. Stewart, An optimal memory allocation scheme for scratch-pad-based embedded systems, ACM Trans. TECS, vol. 1, pp. 6-26, 2002. [15]S. Meftali, F. Gharsalli, F. Rousseau, and A. A. Jerraya, An optimal memory allocation for application-specific multiprocessor system-on-chip, in Proc. ACM ISSS, 2001, pp. 19-24. [16]V. Suhendra, C. Raghavan, and T. Mitra, Integrated scratchpad memory optimization and task scheduling for MPSoC architectures, in Proc. ACM CASES, 2006, pp. 401-410. [17]S. Chattopadhyay and A. Roychoudhury, Static bus schedule aware scratchpad allocation in multiprocessors, in Proc. ACM LCTES, 2011, pp. 11-20. [18]J. Hu, C. J. Xue, Q. Zhuge, W. C. Tseng, and E. M. Sha, Towards energy efficient hybrid on-chip scratch pad memory with non-volatile memory, in Proc. IEEE DATE, 2011, pp. 1-6. [19]L. A. Bathen and N. Dutt, HaVOC: A hybrid memory-aware virtualization layer for on-chip distributed ScratchPad and Non-Volatile Memories, in Proc. ACM DAC, 2012, pp. 447-452. [20]M. Kandemir, J. Ramanujam, and A. Choudhary, Exploiting shared scratch pad memory space in embedded multiprocessor systems, in Proc. ACM DAC, 2002, pp. 219-224. [21]M. Kandemir, J. Ramanujam, J. Irwin, N. Vijaykrishnan, I. Kadayif, and A. Parikh, Dynamic management of scratch-pad memory space, in Proc. ACM DAC, 2001, pp. 690-695. [22]L. A. D. Bathen, N. D. Dutt, D. Shin, and S. S. Lim, SPMVisor: dynamic scratchpad memory virtualization for secure, low power, and high performance distributed on-chip memories, in Proc. CODES+ISSS, 2011, pp. 79-88. [23]A. Marongiu and L. Benini, Efficient OpenMP support and extensions for MPSoCs with explicitly managed memory hierarchy, in Proc. IEEE DATE, 2009, pp. 809-814. [24]Y. Guo, Q. Zhuge, J. Hu, J. Yi, M. Qiu, and E. M. Sha, Data Placement and Duplication for Embedded Multicore Systems With Scratch Pad Memory, IEEE Trans. Computer-Aided Design of Integrated Circuits and Systems, vol. 32, pp. 809-817, 2013. [25]W. Hu, G. Wang, J. Chen, X. Lou, and T. Chen, Efficient scratchpad memory management based on multi-thread for MPSoC architecture, in Proc. Scalable Computing and Communications, 2009, pp. 429-434. [26]J. Paul, W. Stechele, M. Kroehnert, and T. Asfour, Improving Efficiency of Embedded Multi-core Platforms with Scratchpad Memories, in Proc. VDE ARCS, 2014, pp. 1-8. [27]A. Marongiu and L. Benini, An OpenMP compiler for efficient use of distributed scratchpad memory in MPSoCs, IEEE Trans. Computers, vol. 61, pp. 222-236, 2012. [28]Y. Etsion and D. G. Feitelson, L1 cache filtering through random selection of memory references, in Proc. Parallel Architecture and Compilation Techniques, 2007, pp. 235-244. [29]D. Burger and T. M. Austin, The SimpleScalar tool set, version 2.0, SIGARCH Comput. Archit. News 25, 3, 1997, pp.13-25.. [30]C.-C. Huang and V. Nagarajan, ATCache: reducing DRAM cache latency via a small SRAM tag cache, in Proc. ACM PACT, 2014, pp. 51-60. [31]N. Muralimanohar, R. Balasubramonian, and N. P. Jouppi, CACTI 6.0: A tool to understand large caches, Univ. Utah and Hewlett Packard Lab.,Tech. Rep., 2009. [32]M. R. Guthaus, J. S. Ringenberg, D. Ernst, T. M. Austin, T. Mudge, and R. B. Brown, MiBench: A free, commercially representative embedded benchmark suite, in Proc. Workload Characterization, 2001, pp. 3-14. [33]T. Austin, E. Larson, and D. Ernst, SimpleScalar: An infrastructure for computer system modeling, IEEE Trans. Computer, vol. 35, pp. 59-67, 2002.
|