|
[1] A. J. Smith, “Cache Memories,” ACM Computing Surveys, vol. 14, no. 3, pp.473-530, September 1982. [2] J. E. Smith, “Decoupled Access/Execute Computer Architectures,” ACM Trans. On Computer Systems, vol. 2, no. 4, pp.289-308, November 1984. [3] R. L. Lee, P. C. Yew, D. H. Lawrie, “Multiprocessor Cache Design Considerations,” Proc. 14th Int’l Symp. Computer Architecture, pp.253-262, June 1987. [4] N. P. Jouppi, “Improving Direct-Mapped Cache Performance by the Addition of a Small Fully-Associative Cache and Prefetch Buffers,” Proc. 17th . Int’l Symp. Computer Architecture, pp.364-373, May 1990. [5] D. Callahan, K. Kennedy, A. Potterfield, “Software Prefetching,” Proc. Of 4th Symp. On Architectural Support for Programming Languages and Operating Systems, April 1991. [6] J. W. C. Fu, J. H. Patel, “Data Prefetching in Multiprocessor Vector Cache Memories,” Proc. Of the 18th Int’l Symp. on Computer Architecture, pp.54-63, May 1991. [7] T. F. Chen, J. L. Baer, “An Effective On-Chip Preloading Scheme to Reduce Data Access Penalty,” Proc. Of Supercomputing ’91, pp.176-186, November 1991. [8] J. W. C. Fu, J. H. Patel, “Stride directed Prefetching in Scalar Processors,” Proc. Of the 25th Int’l Symp. on Microarchitecture, pp.102-110, December 1992. [9] I. Sklenar, “Prefetch Unit for Vector Operations on scalar computers,” ACM Computer Architecture News, vol. 20, no. 4, September 1992. [10] D. M. Tullsen, S. J. Eggers, “Limitations of Cache Prefetching on a Bus-Based Multiprocessor,” Proc. 20th Int’l Symp. on Computer Architecture,pp.278-288, May 1993. [11] S. Palacharla, R. E. Kessler, “Evaluating Stream Buffers as a Secondary Cache Replacement,” Proc. Of the 21st Int’l Symp on Computer Architecture, pp.24-33, April 1994. [12] T. F. Chen, J. L. Baer, “Effective Hardware-Based Data Prefetching for High-Performance Processors,” IEEE Trans. Computers, vol. 44, no. 5, pp.609-623, May 1995. [13] S. Kim, A. V. Veidenbaum, “Stride-Directed Prefetching for Secondary Caches,” Proc. of the ’97 Int’l Conference on Parallel Processing , p.314, August 1997. [14] J. Tse, A. J. Smith, “CPU Cache Prefetching: Timing Evaluation of Hardware Implementations,” IEEE Trans. Computers, vol. 47, no. 5, pp. 509-526, May 1998. [15] R. Cucchiara, M. Piccardi, A. Prati, “Exploiting Cache in Multimedia,” IEEE Conf. Multimedia Computing Systems ’99, pp.345-350, June 1999. [16] W. Tan, A.Veindenbaum, A. Nicolau, R. Gupta. “Conflict Miss Elimination by Time-stride Prefetch,” technical report, Information and Computer Science Dept., Univ. of California, Irvine, March 2000. [17] D. F. Zucker, R. B. Lee, M. J. Flynn, “Hardware and Software Cache Prefetching Techniques for MPEG Benchmarks,” IEEE Trans. Circuits & Systems for Video Technology, vol. 10, no. 5, pp.782-789, August 2000. [18] R. Cucchiara, , M. Piccardi, A. Prati, “Hardware Prefetching Techniques for Cache Memories in Multimedia Applications,” Proc. Of Int’l Workshop on Computer Architectures for Machine Perception(Camp 2000), pp.311-319, 2000. [19] S. A. McKee, W. A. Wulf, J.H. Aylor, R. H. Klenke, M. H. Salinas, S. I. Hong, D. A. B. Weikle, “Dynamic Access Ordering for Streamed Computations,” IEEE Trans. Computers, vol. 49, no. 11, pp.1255-1271, November 2000. [20] R. Cucchiara, M. Piccardi, A. Prati, “Temporal Analysis of Cache Prefetching Strategies for Multimedia Applications,” IEEE Int’l Conference on Performance, Computing, and Communications, pp.311-318, April 2001. [21] P. Reungsang, S. K. Park, S. W. Jeong, H. L. Roh, G. Lee, “Reducing Cache Pollution of Prefetching in Small data cache,” Proc. of Int’l Conference on Computer Design ’01,pp.530-533, September 2001. [22] ARM Ltd., “AMBA 2.0 Specification,” http://www.arm.com/, 1999. [23] ARM Ltd., “ARM7TDMI Technical Reference Rev 3,” http://www.arm.com/, 2001. [24] ARM Ltd, “ARM Development Suite (ADS) version 1.2,” http://www.arm.com/, 2000.
|