|
[1]“Apache Hadoop, http://hadoop.apache.org, retrieved on March 2013. [2]D. P. Bertsekas, Constrained Optimization and Lagrange Multiplier Methods. Mass: Athena Scientific, 1996. [3]K. E. Batcher, “Sorting Networks and their Applications, Proceedings of the AFIPS Spring Joint Computer Conference, vol. 32, pp. 307-314, 1968. [4]“CUDA, http://developer.nvidia.com/category/zone/cuda-zone, retrieved on March 2013. [5]“Compute Capability, http://www.geeks3d.com/20100606/gpu-computing-nvidia-cuda-compute-capability-comparative-table/, retrieved on March 2013. [6]L. Chen, and G. Agrawal, “Optimizing MapReduce for GPUs with Effective Shared Memory Usage, Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing, pp. 199-210, 2012. [7]L. Chen, X. Huo, and G. Agrawal, “Accelerating MapReduce on a Coupled CPU-GPU Architecture, Proceedings of the international conference on High Performance Computing, Networking, Storage and Analysis, no. 25, 2012. [8]B. Catanzaro, N. Sundaram, and K. Keutzer, “A Map Reduce Framework for Programming Graphics Processors, Workshop on Software Tools for MultiCore Systems, 2008. [9]T. Chen, R. Raghavan, J. N. Dale, and E. Iwata, “Cell Broadband Engine Architecture and its First Implementation – A Performance View, IBM Journal of Research and Development, vol. 51, no. 5, pp. 559-572, 2007. [10]R. M. Chen, S. T. Lo, Y. M. Huang, and C.M. Wang, Solve Multiprocessor Real-Time Scheduling Using Competitive Slack Neural Networks, International Computer Symposium, 2004. [11]S. C. Cheng, and Y. M. Huang, Scheduling Multi-Processor Tasks with Resource and Timing Constraints Using Genetic Algorithm, Proceedings of 5th IEEE International Symposium on Computational Intelligence in Robotics and Automation, 2003. [12]R. M. Chen, and Y. M. Huang, Multiprocessor Task Assignment with Fuzzy Hopfield Neural Network Clustering Technique, Neural Computing & Applications, vol. 10, pp. 12-21, 2001. [13]R. M. Chen, and Y. M. Huang, Multiconstraint Task Scheduling in Multiprocessor System by Neural Network, Proceedings of 10th IEEE International Conference on Tools with Artificial Intelligence, 1998. [14]A. Dou, V. Kalogeraki, D. Gunopulos, T. Mielikainen, and V. H. Tuulos, “Misco: A MapReduce Framework for Mobile Systems, Proceedings of the 3rd international conference on Pervasive Technologies Related to Assistive Environments, no. 32, 2010. [15]J. Dean, and S. Ghemawat, “MapReduce: Simplified Data Processing on Large Clusters, Communications of the ACM, pp. 107-113, 2008. [16]“Flynn分類法, http://computing.llnl.gov/tutorials/parallel_comp, retrieved on March 2013. [17]“GPGPU, http://gpgpu.org/about, retrieved on March 2013. [18]D. Gross, and C. M. Harris, Fundamentals of Queueing Theory, 3rd ed. New York: Wiley, 1998. [19]B. V. Gnedenko, and I. N. Kovalenko, Introduction to Queueing Theory, 2nd ed. Boston: Birkhauser, 1989. [20]C. Hong, D. Chen, W. Chen, W. Zheng, and H. Lin, “MapCG: Writing Parallel Program Portable between CPU and GPU, Proceedings of the 19th international conference on Parallel Architecture and Compilation Techniques, pp. 217-226, 2010. [21]B. He, W. Fang, Q. Luo, N. K. Govindaraju, and T. Wang, “Mars: A MapReduce Framework on Graphics Processors, Proceedings of the 17th international conference on Parallel Architectures and Compilation Techniques, pp. 260-269, 2008. [22]Y. M. Huang, and R. M. Chen, Scheduling Multiprocessor Job with Resource and Timing Constraints using Neural Networks, IEEE Transactions on Systems, Man, and Cybernetics on, vol. 29, pp. 490-502, 1999. [23]F. Ji, and X. Ma, “Using Shared Memory to Accelerate MapReduce on Graphics Processing Units, Proceedings of the IEEE International Parallel & Distributed Processing Symposium, pp. 805-816, 2011. [24]S. T. Lo, R. M. Chen, Y. M. Huang, and C. L. Wu Multiprocessor System Scheduling with Precedence and Resource Constraints Using an Enhanced Ant Colony System, Expert Systems With Applications, vol. 34, pp. 2071-2081, 2008. [25]“Mars, http://www.cse.ust.hk/gpuqp/Mars.html, retrieved on December 2012. [26]NVIDIA, “NVIDIA CUDA C Programming Guide, version 4.2, NVIDIA Cooperation, retrieved on January 2013. [27]NVIDIA, “NVIDA’s Next Generation CUDATM Compute Architecture: KeplerTM GK110, NVIDIA Cooperation, retrieved on January 2013. [28]NVIDIA, “The CUDA Compiler Driver NVCC, NVIDIA Cooperation, retrieved on January 2013. [29]G. F. Newell, Applications of Queueing Theory, 2nd ed. New York: Chapman and Hall, 1982. [30]“OpenCL, http://www.khronos.org/opencl/, retrieved on March 2013. [31]“Overlapping, http://developer.nvidia.com/content/how-overlap-data-transfers-cuda-cc, retrieved on March 2013. [32]“Phoenix, http://mapreduce.stanford.edu, retrieved on March 2013. [33]A. Papagiannis, and D. S. Nikolopoulos, “Rearchitecturing MapReduce for Heterogeneous Multicore Processors with Explicitly Managed Memories, Proceedings of the 39th international conference on Parallel Processing, pp. 121-130, 2010. [34]H. Peters, O. Schulz-Hildebrandt, and N. Luttenberger, “Parallel External Sorting for CUDA-enabled GPUs with Load Balancing and Low Transfer Overhead, Proceedings of the IEEE International Parallel & Distributed Processing Workshops and Phd Forum, pp. 1-8, 2010. [35]J. A. Stuart, and J. D. Owens, “Multi-GPU MapReduce on GPU Clusters, Proceedings of the IEEE International Parallel & Distributed Processing Symposium, pp. 1068-1079, 2011. [36]N. Sundaram, A. Raghunathan, and S. T. Chakradhar, “A Framework for Efficient and Scalable Execution of Domain-specific Templates on GPUs, Proceedings of the IEEE International Parallel & Distributed Processing Symposium, pp. 1-12, 2009. [37]M. Tanner, Practical Queueing Analysis. New York: McGraw-Hill, 1995.
|