|
1. S. Aggarwal, S. Phadke, and M. Bhandarkar, Characterization of Hadoop Jobs Using Unsupervised Learning, in Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on. 2010. p. 748-753. 2. M. Burrows, The Chubby lock service for loosely-coupled distributed systems, in Proceedings of the 7th symposium on Operating systems design and implementation. 2006, USENIX Association: Seattle, Washington. p. 335-350. 3. M. Cafarella and D. Cutting, Building Nutch: Open Source Search, in Queue. 2004. p. 54-61. 4. T. Condie, N. Conway, P. Alvaro, J.M. Hellerstein, K. Elmeleegy, and R. Sears, MapReduce online, in Proceedings of the 7th USENIX conference on Networked systems design and implementation. 2010, USENIX Association: San Jose, California. p. 21-21. 5. J. Dean and S. Ghemawat, MapReduce: simplified data processing on large clusters, in Commun. ACM. 2008. p. 107-113. 6. J. Ekanayake, S. Pallickara, and G. Fox, MapReduce for Data Intensive Scientific Analyses, in Proceedings of the 2008 Fourth IEEE International Conference on eScience. 2008, IEEE Computer Society. p. 277-284. 7. S. Ghemawat, H. Gobioff, and S.-T. Leung, The Google file system, in SIGOPS Oper. Syst. Rev. 2003. p. 29-43. 8. P. Hunt, M. Konar, F.P. Junqueira, and B. Reed, ZooKeeper: wait-free coordination for internet-scale systems, in Proceedings of the 2010 USENIX conference on USENIX annual technical conference. 2010, USENIX Association: Boston, MA. p. 11-11. 9. X. Jiong, Y. Shu, R. Xiaojun, D. Zhiyang, T. Yun, J. Majors, A. Manzanares, and Q. Xiao, Improving MapReduce performance through data placement in heterogeneous Hadoop clusters, in Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on. 2010. p. 1-9. 10. U. Kang, C.E. Tsourakakis, and C. Faloutsos, PEGASUS: A Peta-Scale Graph Mining System Implementation and Observations, in Proceedings of the 2009 Ninth IEEE International Conference on Data Mining. 2009, IEEE Computer Society. p. 229-238. 11. R.T. Kaushik, M. Bhandarkar, and K. Nahrstedt, Evaluation and Analysis of GreenHDFS: A Self-Adaptive, Energy-Conserving Variant of the Hadoop Distributed File System, in Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on. 2010. p. 274-287. 12. A. Kimball, S. Michels-Slettvet, and C. Bisciglia, Cluster computing for web-scale data processing, in SIGCSE Bull. 2008. p. 116-120. 13. S.Y. Ko, I. Hoque, B. Cho, and I. Gupta, On availability of intermediate data in cloud computations, in Proceedings of the 12th conference on Hot topics in operating systems. 2009, USENIX Association: Monte Verit\&\#224;, Switzerland. p. 6-6. 14. L.F. Lai, C.C. Wu, L.T. Huang, and J.C. Kuo, A Fuzzy Query Mechanism for Human Resource Websites, in Proceedings of the International Conference on Artificial Intelligence and Computational Intelligence. 2009, Springer-Verlag: Shanghai, China. p. 579-589. 15. L.F. Lai, C.C. Wu, M.Y. Shih, L.T. Huang, and W. Chiou, Parallel Processing for Fuzzy Queries in Human Resources Websites, in Journal of Internet Technology. 2010. p. 943-953. 16. J. Leverich and C. Kozyrakis, On the energy (in)efficiency of Hadoop clusters, in SIGOPS Oper. Syst. Rev. 2010. p. 61-65. 17. P. Mell and T. Grance, The NIST Definition of Cloud Computing (Draft). 2011. 18. B. Nicolae, D. Moise, G. Antoniu, L. Bouge, and M. Dorier, BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applications, in Parallel & Distributed Processing (IPDPS), 2010 IEEE International Symposium on. 2010. p. 1-11. 19. C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins, Pig latin: a not-so-foreign language for data processing, in Proceedings of the 2008 ACM SIGMOD international conference on Management of data. 2008, ACM: Vancouver, Canada. p. 1099-1110. 20. B. Panda, J.S. Herbach, S. Basu, and R.J. Bayardo, PLANET: massively parallel learning of tree ensembles with MapReduce, in Proc. VLDB Endow. 2009. p. 1426-1437. 21. S. Papadimitriou and J. Sun, DisCo: Distributed Co-clustering with Map-Reduce: A Case Study towards Petabyte-Scale End-to-End Mining, in Proceedings of the 2008 Eighth IEEE International Conference on Data Mining. 2008, IEEE Computer Society. p. 512-521. 22. K. Shvachko, H. Kuang, S. Radia, and R. Chansler, The Hadoop Distributed File System, in Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST). 2010, IEEE Computer Society. p. 1-10. 23. A. Thusoo, J.S. Sarma, N. Jain, S. Zheng, P. Chakka, Z. Ning, S. Antony, L. Hao, and R. Murthy, Hive - a petabyte scale data warehouse using Hadoop, in Data Engineering (ICDE), 2010 IEEE 26th International Conference on. 2010. p. 996-1005. 24. A. Thusoo, J.S. Sarma, N. Jain, Z. Shao, P. Chakka, S. Anthony, H. Liu, P. Wyckoff, and R. Murthy, Hive: a warehousing solution over a map-reduce framework, in Proc. VLDB Endow. 2009. p. 1626-1629. 25. T. White, Hadoop: The Definitive Guide, Second Edition. 2010, O'Reilly Media. 26. W. Xu, L. Huang, A. Fox, D. Patterson, and M. Jordan, Mining console logs for large-scale system problem detection, in Proceedings of the Third conference on Tackling computer systems problems with machine learning techniques. 2008, USENIX Association: San Diego, California. p. 4-4. 27. L. Yang and Z. Shi, An Efficient Data Mining Framework on Hadoop using Java Persistence API, in Computer and Information Technology (CIT), 2010 IEEE 10th International Conference on. 2010. p. 203-209. 28. M. Zaharia, A. Konwinski, A.D. Joseph, R. Katz, and I. Stoica, Improving MapReduce performance in heterogeneous environments, in Proceedings of the 8th USENIX conference on Operating systems design and implementation. 2008, USENIX Association: San Diego, California. p. 29-42. 29. Amazon Web Services. Available from: http://aws.amazon.com/. 30. Apache Avro. Available from: http://avro.apache.org/. 31. Apache Hadoop. Available from: http://hadoop.apache.org/ 32. Apache Nutch. Available from: http://nutch.apache.org/. 33. Apache Pig. Available from: http://pig.apache.org/. 34. Apache ZooKeeper™. Available from: http://zookeeper.apache.org/. 35. CRM & Cloud Computing - salesforce.com. Available from: http://www.salesforce.com. 36. Facebook. Available from: http://www.facebook.com. 37. Google App Engine. Available from: http://code.google.com/intl/en/appengine/. 38. Gridmix3 Emulating Production Workload for Apache Hadoop. Available from: http://developer.yahoo.com/blogs/hadoop/posts/2010/04/gridmix3_emulating_production/. 39. Hadoop at Twitter. 2010; Available from: http://engineering.twitter.com/2010/04/hadoop-at-twitter.html. 40. Hadoop hdfs-default. Available from: http://hadoop.apache.org/common/docs/current/hdfs-default.html. 41. Hadoop mapred-default. Available from: http://hadoop.apache.org/common/docs/current/mapred-default.html. 42. Hadoop Taiwan User Group. Available from: http://www.hadoop.tw/. 43. Hadoop Wiki PoweredBy. Available from: http://wiki.apache.org/hadoop/PoweredBy 44. Hadoop™ Common. Available from: http://hadoop.apache.org/common/. 45. Hadoop™ Distributed File System. Available from: http://hadoop.apache.org/hdfs/. 46. Hadoop™ MapReduce. Available from: http://hadoop.apache.org/mapreduce/. 47. HBase. Available from: http://hbase.apache.org/. 48. Hive. Available from: http://hive.apache.org/. 49. HowManyMapsAndReduces - Hadoop Wiki. Available from: http://wiki.apache.org/hadoop/HowManyMapsAndReduces. 50. IBM. Available from: http://www.ibm.com/us/en/. 51. Microsoft Corporation. Available from: http://www.microsoft.com/en-us/default.aspx. 52. NCHC Cloud Computing Research Group. Available from: http://trac.nchc.org.tw/cloud. 53. Sort Benchmark Home Page. Available from: http://sortbenchmark.org/. 54. Sqoop. Available from: http://www.cloudera.com/downloads/sqoop/. 55. Wikipedia Hadoop. Available from: http://en.wikipedia.org/wiki/Hadoop. 56. Yahoo! Hadoop Tutorial: Managing a Hadoop Cluster. Available from: http://developer.yahoo.com/hadoop/tutorial/module7.html. 57. A. Anand. Hadoop Sorts a Petabyte in 16.25 Hours and a Terabyte in 62 Seconds. 2009; Available from: http://developer.yahoo.com/blogs/hadoop/posts/2009/05/hadoop_sorts_a_petabyte_in_162/. 58. D. Gottfrid. Self-Service, Prorated Supercomputing Fun! 2007; Available from: http://open.blogs.nytimes.com/2007/11/01/self-service-prorated-super-computing-fun/. 59. D. Tankel. Scalability of the Hadoop Distributed File System. 2010; Available from: http://developer.yahoo.com/blogs/hadoop/posts/2010/05/scalability_of_the_hadoop_dist/. 60. J. Zawodny. Yahoo! Launches World's Largest Hadoop Production Application. 2008; Available from: http://developer.yahoo.com/blogs/hadoop/posts/2008/02/yahoo-worlds-largest-production-hadoop/.
|