|
[1]E. Anderson, J. Brooks, C. Grassl, and S. Scott, “Performance of the Cray T3E Multiprocessor,” Proceedings of the 1997 ACM/IEEE Conference on Supercomputing, pp. 1-17, November 1997.
[2]T.-S. Chen, C.-Y. Chang, and J.-P. Sheu, “Efficient Multicast Communication in Multidestination Wormhole-Routed Mesh Networks,” Proceedings of International Conference on Parallel and Distributed Processing Techniques and Applications, Vol. 2, pp. 674-681, July 1998.
[3]T.-S. Chen, N.-C. Wang, and C.-P. Chu, “Multicast Communication in Wormhole-Routed Star Graph Interconnection Networks,” Parallel computing, pp. Vol. 26, No. 11, pp. 1459-1490, October 2000.
[4]W. J. Dally and C. L. Seitz, “The Torus Routing Chip,” Journal of Distributed Computing, Vol. 1, No. 3, pp. 187-196, October 1986.
[5]G. E. Fagg, S. S. Vadhiyar, and J. J. Dongarra, “Automatic Collective Communications Tuning,” Recent Advances in Parallel Virtual Machine and Message Passing Interface, No. 1908, pp. 354-361, September 2000.
[6]E. Fleury and P. Fraigniaud, “Strategies for Path-Based Multicasting in Wormhole-Routed Meshes,” Journal of Parallel and Distributed Computing, Vol. 53, pp. 26-62, August 1998.
[7]A. Gara, A. Blumrich, D. Chen, G. L.-T. Chiu, P. Coteus, M. E. Glampaga, R. A. Haring, P. Heidelberger, D. Hoenicke, G. V. Kopcsay, T. A. Liebsch, M. Ohmacht, B. D. SteinmacherBurow, T. Takken, and P. Vranas, “Overview of the Blue Gene/L System Architecture,” IBM Journal of Research and Development, Vol. 49, No. 2/3, pp. 195-212, April 2005.
[8]L. V. Kale, S. Kumar, and K. Vardarajan, “A Framework for Collective Personalized Communication,” Proceedings of the 17th International Parallel and Distributed Processing Symposium, pp. 69(a), April 2003.
[9]R. E. Kessler and J. L. Schwarzmeier, “CRAY T3D: A New Dimension for Cray Research,” Proceedings of the 38th IEEE Computer Society International Conference, pp. 176-182, February 1993.
[10]F. Harary, Graph Theory. Reading, MA: Addison-Wesley, 1972.
[11]C.-T. Ho and M. Kao, “Optimal broadcast on hypercubes with wormhole and e-cube routings,” Proceedings of the 1993 International Conference on Parallel and Distributed Systems, pp. 694-697, December 1993.
[12]R. Libeskind-Hadas, K. Watkins, and T. Hehre, “Fault-tolerant Multicast Routing in the Mesh with No Virtual Channels,” Proceedings of the 2nd International Symposium on High-Performance Computer Architecture, pp. 180-190, February 1996.
[13]X. Lin, P. K. McKinley, and L. M. Ni, “Deadlock-Free Multicast Wormhole Routing in 2D Mesh Multicomputers,” IEEE Transactions on Parallel and Distributed Systems, Vol. 5, No. 8, pp. 793-804, October 1994.
[14]X. Lin and L. M. Ni, “Deadlock-Free Multicast Wormhole Routing in Multicomputer Network,” Proceedings of the 18th Annual International Symposium on Computer Architecture, pp. 116-125, May 1991.
[15]X. Lin and L. M. Ni, “Multicast Communication in Multicomputers Networks,” Proceedings of the 1990 International Conference on Parallel Processing, Vol. 3, pp. 114-118, August 1990.
[16]T. Marescaux, A. Bartic, D. Verkest, S. Vernalde, and R. Lauwereins, "Interconnection Networks Enable Fine-Grain Dynamic Multi-Tasking on FPGAs," The 12th Conference on Field-Programmable Logic and Applications, pp. 795-805, September 2002.
[17]P. K. McKinley, H. Xu, A.-H. Esfahanian, and L. M. Ni, “Unicast-Based Multicast Communication in Wormhole-Routed Networks,” IEEE Transactions on Parallel and Distributed Systems, Vol. 5, Issue 12, pp. 1252-1265, December 1994.
[18]L. M. Ni and P. K. McKinley, “A Survey of Wormhole Routing Techniques in Direct Networks,” IEEE Computer, Vol. 26, No. 2, pp. 62-76, February 1993.
[19]M. Noakes and W. J. Dally, “System Design of the J-machine,” Proceedings of the 6th MIT conference on Advanced research in VLSI, pp. 179-194, April 1990.
[20]D. K. Panda, S. Singal, and P. Prabhakaran, "Multidestination Message Passing Mechanism Conforming to Base Wormhole Routing Scheme," Proceedings of the Parallel Computer Routing and Communication Workshop, No. 853, pp. 131-145, May 1994.
[21]R. Rabenseifner and G. Wellein, "Communication and Optimization Aspects of Parallel Programming Models on Hybrid Architectures," International Journal of High-Performance Computing Applications, Vol. 17, No. 1, pp. 49-62, Spring 2003.
[22]D. F. Robinson, P. K. McKinley, and B. H. C. Cheng, “Path-Based Multicast Communication in Wormhole-routed Unidirectional Torus Networks,” Journal of Parallel and Distributed Computing, Vol. 45, No. 2, pp. 104-121, September 1997.
[23]L. Schwiebert, “There is No Optimal Routing Policy for the Torus,” Information Processing Letters, Vol. 83, Issue 6, pp. 331-336, September 2002.
[24]R. Thakur, W. Gropp, and B. Toonen, “Optimizing the Synchronization Operations in MPI One-Sized Communication,” International Journal of High-Performance Computing Applications, Vol. 19, No. 2, pp. 119-128, Summer 2005.
[25]Y.-C. Tseng, D. K. Panda, and T.-H. Lai, “A Trip-Based Multicasting Model for Wormhole-Routed Networks with Virtual Channels,” IEEE Transactions on Parallel and Distributed Systems, Vol. 7, No. 2, pp. 138-150, February 1996.
[26]Y.-C. Tseng, M.-H. Yang, and T.-Y. Juang, “Achieving Fault-Tolerant Multicast in Injured Wormhole-Routed Tori and Meshes Based on Euler Path Construction,” IEEE Transactions on Computers, Vol. 49, No. 3, pp. 1282-1296, March 2000.
[27]N.-C. Wang, C.-P. Yen, and C.-P. Chu, “Multicast Communication in Wormhole-routed Symmetric Networks with Hamiltonian Cycle Model,” Journal of System Architecture, Vol. 51, No. 3, pp. 165-183, March 2005.
|