|
[1]A. Park and M. K. Farrens, “Address Compression through Base Register Caching,” in Proceedings of the Annul IEEE/ACM International Symposium on Microarchitecture,1990 , pp.193-199. [2]D. Citron and L. Rudolph, “Creating a Wider Bus Using Caching Techniques,” in Proceedings of the 1st IEEE Symposium on High-Performance Computer Architecture, 1995, pp.90-99. [3]S. Palacharla, N. P. Jouppi, and J. E. Smith, “Quantifying the Complexity of Superscalar Processors,” University of Wisconsin-Madison, Tech. Rep. CS-1328, May 1997. [4]D. Burger and T. M. Austin, “The SimpleScalar tool set, version 2.0”, in University of Wisconsin-Madison, Jun. 1997, CS-1342. [5]D. Brooks, V. Tiwari, and M. Martonosi, “Wattch: A framework for architectural-level power analysis and optimizations,” in Proceedings on the 27th Annual International Symposium on Computer Architecture, 2000, pp.83-94. [6]G. Reinman and N. P. Jouppi, “CACTI 2.0: An Integrated Cache Timing and Power Model,” COMPAQ Western Research Lab, Palo Alto, CA, Tech. Rep., Feb. 2000. [7]J. L. Henning, “SPEC CPU2000: Measuring CPU performance in the new millennium,” IEEE Computer, Vol: 33, 2000, pp.28-35 [8]L. Villa, M. Zhang, and K. Asanovic, “Dynamic Zero Compression for Cache Energy Reduction,” in Proceedings of the 33rd International Symposium on Microarchitecture, Dec.2000 [9]R. Canal, A. González, and J. E. Smith, “Very low power pipelines using significance compression,” in Proceedings of the 33rd Annual ACM/IEEE international Symposium on Microarchitecture (Monterey, California, United States). MICRO 33. ACM Press, New York , 2000, pp.181-190 [10]Dmitry Ponomarev, Gurhan Kucuk, Kanad Ghose, “Power Reduction in Superscalar Datapaths Through Dynamic Bit-Slice Activation,” Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'01), 2001, pp.0016 [11]I. Park, C. L. Ooi, and T. N. Vijaykumar, “Reducing Design Complexity of the Load/Store Queue,” in Proceedings of the 36th Annul IEEE/ACM International Symposium on Microarchitecture, 2003, pp.411-422. [12]S. Sethumadhavan, R. Desikan, D. Burger. C. R. Moore, and S. W. Keckler, “Scalable Hardware Memory Disambiguation for High ILP Processors,” in Proceedings of the 36th Annul IEEE/ACM International Symposium on Microarchitecture, 2003, pp.188-127. [13]H. W. Cain and M. H. Lipasti, “Memory Ordering: A Value-Based Approach,” in Proceedings on the 31st Annual International Symposium on Computer Architecture, 2004, pp.90-101. [14]J. Liu, K. Sundaresan and N. R. Mahapatra, “Dynamic Address Compression Schemes: A Performance, Energy, and Cost Study,” in Proceedings of the IEEE International Conference on Computer Design, 2004, pp.458-463. [15]R. Gonzalez, A. Critstal, D. Ortega, A. Veidembaum, and M. Valero, “A content aware integer register file organization,” in 31st Annual International Symposium on Computer Architecture, 2004, pp.314-324. [16]Ramon Canal, Antonio González and James E. Smith, “Value Compresson for Efficient Computation”, European Conference on Parallel Computing (Europar'05), Lisboa (Portugal); Lecture Notes in Computer Science, August 2005, pp. 519-529 [17]Abella and A. González, “SAMIE-LSQ: Set-Associative Multiple-Instruction Entry Load/Store Queue,” in 20th IEEE International Parallel and Distributed Processing Symposium, 2006. [18]L. Baugh and C. Zilles, “Decomposing the Load-Store Queue by Function for Power Reduction and Scalability,” in IBM 2006 Journal of Research and Development in Computers & Technology, 2006, pp.287- 297. [19]F. Castro, D. Chaver, L. Pinuel, M. Prieto, M .C. Huang, and F. Tirado, “LSQ: a power efficient and scalable implementation,” in IEE proceedings Computers and digital Techniques, 2006, pp.389-398. [20]Kostas Pagiamtzis, “Content-Addressable Memory (CAM) Circuits and Architectures: A Tutorial and Survey,” in IEEE Journal of Solid-State Circuits, 2006, pp.712-727. [21]J. Liu, K. Sundaresan, and N. R. Mahapatra , “A Fast Dynamic Compression scheme for Low-Latency On-Chip Address Buses,” in the 18th International Conference on Microelectronics, 2006. [22]O. Rochecouste, G. Pokam, and A. Seznec, “A case for a complexity-effective, width-partitioned Microarchitecture,” in ACM Trans Archit. Code Optim, 2006, pp.295-326
|