|
[1] Apache Hadoop. [Online]. Available: http://hadoop.apache.org/ [2] Apache HDFS. [Online]. Available: http://hadoop.apache.org/docs/r1.2.1/hdfs design. html [3] K. McKusick and S. Quinlan, “GFS: Evolution on Fast-Forward, Commun. ACM, vol. 53, no. 3, pp. 42–49, Jan. 2010. [4] S. Ghemawat, H. Gobioff, and S.-T. Leung, “The Google File System, in Proc. 19th ACM Symp. Operating Systems Principles (SOSP’03), Oct. 2003, pp. 29–43. [5] Apache HBase. [Online]. Available: http://hbase.apache.org/ [6] N. Leavitt, “Will NoSQL Databases Live Up to Their Promise? IEEE Computer, vol. 43, no. 2, pp. 12–14, Feb. 2010. [7] F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. Gruber, “Bigtable: A Distributed Storage System for Structured Data, in Proc. 7th USENIX Symp. Operating Systems Design and Implementation (OSDI’06), Nov. 2006, pp. 205–218. [8] The R Project for Statistical Computing. [Online]. Available: https://www.r-project.org/ [9] Python. [Online]. Available: https://www.python.org/ [10] G. C. Deka, “A Survey of Cloud Database Systems, IT Professional, vol. 16, no. 2, pp. 50–57, March-April 2014. [11] Apache Cassandra. [Online]. Available: http://cassandra.apache.org/ [12] Couchbase. [Online]. Available: http://www.couchbase.com/ [13] MongoDB. [Online]. Available: http://www.mongodb.org/ [14] MySQL. [Online]. Available: http://www.mysql.com/ [15] Apache Phoenix. [Online]. Available: http://phoenix.apache.org/ [16] Samba. [Online]. Available: https://en.wikipedia.org/wiki/Samba (software) [17] Apache Spark. [Online]. Available: https://spark.apache.org/ [18] M. Zaharia, R. S. Xin, P. Wendell, T. Das, M. Armbrust, A. Dave, X. Meng, J. Rosen, S. Venkataraman, M. J. Franklin, A. Ghodsi, J. Gonzalez, S. Shenker, and I. Stoica, “Apache Spark: A Unified Engine for Big Data Processing, Commun. ACM, vol. 59, no. 11, pp. 56–65, Nov. 2016. [19] Apache Hive. [Online]. Available: https://hive.apache.org/ [20] A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, N. Zhang, S. Anthony, H. Liu, and R. Murthy, “Hive—A Petabyte Scale Data Warehouse Using Hadoop, in Proc. of IEEE Int’l Conf. Data Engineering (ICDE), Mar. 2010, pp. 996–1005. [21] R. Ihaka and R. Gentleman, “R: A Language for Data Analysis and Graphics, Journal of Computational and Graphical Statistics, vol. 5, no. 3, pp. 299–314, Sep. 1995. [22] H.-C. Hsiao, H.-Y. Chung, H. Shen, and Y.-C. Chao, “Load Rebalancing for Distributed File Systems in Clouds, IEEE Transactions on Parallel and Distributed Systems, vol. 24, no. 5, pp. 951–962, May 2013. [23] Apache Hadoop YARN. [Online]. Available: https://hadoop.apache.org/docs/current/ hadoop-yarn/hadoop-yarn-site/YARN.html [24] V. K. Vavilapalli, A. C. Murthy, C. Douglas, S. Agarwal, M. Konar, R. Evans, T. Graves, J. Lowe, H. Shah, S. Seth, B. Saha, C. Curino, O. O’Malley, S. Radia, B. Reed, and E. Baldeschwieler, “Apache Hadoop YARN: Yet Another Resource Negotiator, in Proc. ACM Symp. Cloud Computing (SOCC’13), Oct. 2013. [25] J. Dean and S. Ghemawat, “MapReduce: Simplified Data Processing on Large Clusters, in Proc. 6th USENIX Symp. Operating System Design and Implementation (OSDI’04), Dec. 2004, pp. 137–150. [26] Apache ZooKeeper. [Online]. Available: https://zookeeper.apache.org/ [27] P. Hunt, M. Konar, F. P. Junqueira, and B. Reed, “ZooKeeper: Wait-free Coordination for Internet-scale Systems, in USENIX Annual Technical Conference, 2010. [28] S. Adve and K. Gharachorloo, “Shared Memory Consistency Models: A Tutorial, IEEE Computer, vol. 29, no. 12, pp. 66–76, Dec. 1996. [29] H.-C. Hsiao and C.-W. Chang, “A Symmetric Load Balancing Algorithm with Performance Guarantees for Distributed Hash Tables, IEEE Transactions on Computers, vol. 62, no. 4, pp. 662–675, Apr. 2013. [30] H.-C. Hsiao, H. Liao, S.-T. Chen, and K.-C. Huang, “Load Balance with Imperfect Information in Structured Peer-to-Peer Systems, IEEE Transactions on Parallel and Distributed Systems, vol. 22, no. 4, pp. 634–649, Apr. 2011. [31] M. Mitzenmacher and E. Upfal, Probability and Computing. Cambridge, 2005. [32] Hadoop Archives. [Online]. Available: https://hadoop.apache.org/docs/current/ hadoop-archives/HadoopArchives.html [33] Hadoop Sequence Files. [Online]. Available: https://wiki.apache.org/hadoop/ SequenceFile [34] HBase Coprocessor. [Online]. Available: https://blogs.apache.org/hbase/entry/ coprocessor introduction [35] VMware. [Online]. Available: http://www.vmware.com/ [36] Apache Web HDFS REST API. [Online]. Available: https://hadoop.apache.org/docs/ r1.0.4/webhdfs.html [37] Apache Hadoop HttpFS. [Online]. Available: https://hadoop.apache.org/docs/r2.4.1/ hadoop-hdfs-httpfs/index.html [38] Apache Sqoop. [Online]. Available: http://sqoop.apache.org/ [39] Apache Flume. [Online]. Available: https://flume.apache.org/ [40] Apache Kafka. [Online]. Available: https://kafka.apache.org/ [41] S. Tarkoma, Publish/Subscribe Systems: Design and Principles. WILEY, 2012. [42] S. Venkataraman, I. Roy, A. AuYoung, and R. S. Schreiber, “Using R for Iterative and Incremental Processing, in 4th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud’12), June 2012. [43] RevolutionAnalytics RHadoop. [Online]. Available: https://github.com/ RevolutionAnalytics/rhadoop/wiki [44] SparkR (R on Spark). [Online]. Available: https://spark.apache.org/docs/latest/sparkr. html [45] S. Venkataraman, Z. Yang, D. Liu, E. Liang, H. Falaki, X. Meng, R. Xin, A. Ghodsi, M. J. Franklin, I. Stoica, and M. Zaharia, “SparkR: Scaling R Programs with Spark, in Proc. ACM Int’l Conf. Management of Data (SIGMOD’16), June 2016, pp. 1099–1104. [46] A. R. Chang, Y.-L. Chen, Y.-Z. Huang, H.-C. Hsiao, M. Hsu, C.-C. Lee, H.-Y. Lee, W.-A. Shih, C.-P. Tsai, and K.-P. Tseng, “The Case of Operational Distributed Stor age Service for Big Data in a Semiconductor Wafer Fabrication Foundry, in Taiwan Academic Network Conf., Oct. 2018, Best Paper Award. [47] A. R. Chang, Y.-L. Chen, Y.-Z. Huang, H.-C. Hsiao, M. Hsu, C.-C. Lee, H.-Y. Lee, W.- A. Shih, H.-P. Su, C.-P. Tsai, and K.-P. Tseng, “The Case of a Novel Operational Distributed Storage Service for Big Data in a Semiconductor Wafer Fabrication Foundry, in Int’l Workshop BigData Processing Systems in conjunction with IEEE Int’l Conf. Parallel and Distributed Systems, Dec. 2018. [48] A. R. Chang, Y.-L. Chen, P.-Y. Chou, Y.-Z. Huang, H.-C. Hsiao, T.-T. Hsieh, M. Hsu, C.-C. Lee, H.-Y. Lee, Y.-C. Shih, W.-A. Shih, C.-H. Tang, C.-P. Tsai, and K.-P. Tseng, “The Case of Big Data Platform Services for Semiconductor Wafer Fabrication Foundries, in Int’l Conf. ICT Convergence, Oct. 2018. [49] ——, “A Distributed R-Language Computing Platform Service for a Semiconductor Wafer Fabrication Foundry, in Int’l Computer Symposium, Dec. 2018. [50] The Network Time Protocol. [Online]. Available: http://www.ntp.org/ [51] HBase Regions. [Online]. Available: https://hbase.apache.org/book/regions.arch.html [52] HBase APIs. [Online]. Available: http://hbase.apache.org/0.94/apidocs/ [53] IBM BladeCenter HS23. [Online]. Available: http://www-03.ibm.com/systems/ bladecenter/hardware/servers/hs23/ [54] B. F. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R. Sears, “Benchmarking Cloud Serving Systems with YCSB, in Proc. ACM Symp. Cloud Computing (SOCC’10), June 2010, pp. 143–154. [55] HBase Snapshots. [Online]. Available: https://hbase.apache.org/book/ops.snapshots. html [56] Cloudera Snapshots. [Online]. Available: http://www.cloudera.com/content/ cloudera-content/cloudera-docs/CM5/latest/Cloudera-Backup-Disaster-Recovery/ cm5bdr snapshot intro.html [57] HBase Replication. [Online]. Available: http://blog.cloudera.com/blog/2012/07/ hbase-replication-overview-2/ [58] HBase Export. [Online]. Available: https://hbase.apache.org/book/ops mgt.html# export [59] HBase CopyTable. [Online]. Available: https://hbase.apache.org/book/ops mgt.htm# copytable [60] Oracle Database Backup and Recovery. [Online]. Available: http://docs.oracle.com/ cd/E11882 01/backup.112/e10642/rcmintro.htm#BRADV8001 [61] J. Zhou, N. Bruno, and W. Lin, “Advanced Partitioning Techniques for Massively Distributed Computation, in Proc. of ACM SIGMOD, May 2012, pp. 13–24. [62] C. Hong, D. Zhou, M. Yang, C. Kuo, L. Zhang, and L. Zhou, “KuaFu: Closing the Parallelism Gap in Database Replication, in Proc. of IEEE Int’l Conf. Data Engineering (ICDE), April 2013, pp. 1186–1195. [63] S.-W. Lee and B. Moon, “Transactional In-Page Logging for Multiversion Read Consistency and Recovery, in Proc. of IEEE Int’l Conf. Data Engineering (ICDE), April 2011, pp. 876–887.
|