|
[1]Chen, D. K., Yew, P. C., and Torrellas, J., “An efficient algorithm for the run-time parallelization of doacross loops,” in Proc. 1994 Supercomputing, 518-527, 1994. [2]Huang, T. C. and Hsu, P. H., “The SPNT test: a new technology for run-time speculative parallelization of loops,” in Lecture Notes in Computer Science No. 1366 (Z. Li, P. C. Yew, S. Chatterjee, C. H. Huang, P. Sadayappan, and D. Sehr, eds.), 177-191, Springer-Verlag, 1998. [3]Huang, T. C., Hsu, P. H., and Sheng, T. N., “Efficient run-time scheduling for parallelizing partially parallel loops,” J. Information Science and Engineering 14(1), 255-264, 1998. [4]Leung, S. T. and Zahorjan, J., “Improving the performance of run-time parallelization,” in Proc. 4th ACM SIGPLAN Symp. Principles and Practice of Parallel Programming, 83-91, 1993. [5]Leung, S. T. and Zahorjan, J., “Extending the applicability and improving the performance of run-time parallelization,” Tech. Rep. 95-01-08, Dept. CSE, Univ. of Washington, 1995. [6]Midkiff, S. and Padua, D., “Compiler algorithms for synchronization,” IEEE Trans. Comput. C-36, 12, 1485-1495, 1987. [7]Polychronopoulos, C., “Compiler optimizations for enhancing parallelism and their impact on architecture design,” IEEE Trans. Comput. C-37, 8, 991-1004, 1988. [8]Rauchwerger, L., Amato, N., and Padua, D., “A scalable method for run-time loop parallelization,” Int. J. Parallel Processing, 26(6), 537-576, 1995. [9]Rauchwerger, L. and Padua, D., “The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization,” IEEE Trans. Parallel and Distributed Systems; 10(2), 160-180, 1999. [10]Saltz, J., Mirchandaney, R., and Crowley, K., “Run-time parallelization and scheduling of loops,” IEEE Trans. Comput. 40(5), 603-612, 1991. 11.Xu, C. and Chaudhary, V., “Time-stamping algorithms for parallelization of loops at run-time,” in Proc. 11th Int. Parallel Processing Symp. 1997. [12]Zhu, C. Q. and Yew, P. C., “A scheme to enforce data dependence on large multiprocessor systems,” IEEE Trans. Software Eng. 13(6), 726-739, 1987. [13]Z. Shen, Z. Li, and P. C. Yew, “An empirical study on array subscripts and data dependencies,” in Proc. Of ICPP, pp. II-145 to II-152, 1989. [14]Lawrence Rauchwerger, “Run-time parallelization: A framework for parallel computation,” A thesis for the degree of Doctor of Philosophy in Computer Science in the Graduate College of the Univ. of Illinois, 1995. [15]H. Zima, Supercompilers for Parallel and Vector Computer. ACM press, New York, 1991. [16]Michael Wolfe, High Performance Compilers for Parallel Computing. Addition-Wesley Publish, 1996.
|