跳到主要內容

臺灣博碩士論文加值系統

(18.97.9.171) 您好!臺灣時間:2025/01/17 09:39
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

我願授權國圖
: 
twitterline
研究生:廖唯辰
研究生(外文):Wei-Chen Liao
論文名稱:基於虛擬機器之容錯系統實作
論文名稱(外文):Implementing a Virtual Machine based Fault Tolerance System
指導教授:徐慰中
口試日期:2017-07-23
學位類別:碩士
校院名稱:國立臺灣大學
系所名稱:資訊工程學研究所
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2017
畢業學年度:105
語文別:英文
論文頁數:37
中文關鍵詞:容錯虛擬化虛擬機器高可用性硬體錯誤
外文關鍵詞:Fault toleranceVirtualizationVirtual MachineHigh availabilityHardware failure
相關次數:
  • 被引用被引用:0
  • 點閱點閱:455
  • 評分評分:
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
近年來,虛擬機器在雲端伺服器中被廣泛使用。一台實體機器上可能運行著數個虛擬機器,並提供數個服務。此時在這台實體機器上發生的硬體錯誤和電源錯誤將會同時影響無數運行在這台機器上的服務。因此,服務的容錯對這些伺服器來說更加重要。然而,一些商業軟體像是 VMware 所提供的容錯系統價格過於昂貴,並非所有用戶都能負擔。在這篇論文中,我們實作了 CUJU,一個開源、基於虛擬機器的容錯系統在 QEMU 2.8/Linux kernel 4.4.0 平台上,並描述此實作的設計以及運作方法。在論文的最後,我們會基於CUJU的初始版本來量測輸出延遲以及吞吐量之額外負擔,並和 VMware 的容錯系統比較。
In recent years, virtual machines have been widely deployed in the Cloud. A physical machine may run multiple VMs, with multiple services. Failures of a physical machine, including hardware malfunctions and power losses could affect many services running on the machine at the same time. Thus, Fault tolerance for services in the Cloud is of paramount importance for the cloud computing. However, fault tolerance support in commercial products like VMware is expensive, not affordable by many users. In this thesis, we implement CUJU, an open source virtual machine based fault tolerance system using QEMU 2.8/Linux kernel 4.4.0 and describe how it works. We also evaluate the output latency and throughput overhead of this preliminary version of CUJU and compared its performance as well as functionality with the VMware offered Fault Tolerance system on the market.
口試委員會審定書 iv
摘要 v
Abstract vii
1 Introduction 1
2 Background 5
2.1 Live Migration 5
2.2 Known Virtualization-based FT Solution 6
2.2.1 Record and Replay 6
2.2.2 Checkpointing 7
2.2.3 COarse-grain LOck-stepping 7
2.2.4 Comparison of Methods above 7
2.2.5 Method used in CUJU 8
2.3 Epoch-based Fault Tolerance 9
2.3.1 Epoch 9
2.3.2 Response Latency and Throughput 9
2.4 Previous Work 10
2.4.1 Motivation 10
3 Design and Implementation 13
3.1 Overview 13
3.2 Architecture 14
3.3 Communication Protocol between Master and Slave 14
3.4 Correctness 15
3.5 Implementation Detail 16
3.5.1 Build Connection 17
3.5.2 Live Migration 18
3.5.3 Enter FT Mode & Trigger Timer 18
3.5.4 Output Buffering 18
3.5.5 Timer 19
3.5.6 Snapshot 19
3.5.7 Transfer Thread 21
3.5.8 Dirty Page Tracking 21
3.5.9 Diff Page & Transfer 23
3.5.10 RAM Loader 23
3.5.11 Device State Loader 23
3.5.12 Error Detection 24
3.5.13 Failover 24
4 Evaluation 27
4.1 Overview 27
4.2 Environment Setting 27
4.3 kernel Compilation 27
4.4 UnixBench 28
4.5 Apache Jmeter 29
4.6 SCP, WGET, PING 30
5 Conclusion and Future Work 33
Bibliography 35
[1] J. Gray and D. P. Siewiorek, “High-availability computer systems,” Computer, vol. 24, no. 9, pp. 39–48, Sept 1991.
[2] “VMware.” [Online]. Available: http://www.vmware.com
[3] “CUJU-FT.” [Online]. Available: https://cuju-ft.github.io/cuju-web/home.html
[4] F. Bellard, “Qemu, a fast and portable dynamic translator,” in Proceedings of the Annual Conference on USENIX Annual Technical Conference, ser. ATEC ’05. Berkeley, CA, USA: USENIX Association, 2005, pp. 41–41. [Online]. Available: http://dl.acm.org/citation.cfm?id=1247360.1247401
[5] C. Clark, K. Fraser, S. Hand, J. G. Hansen, E. Jul, C. Limpach, I. Pratt, and A. War eld, “Live migration of virtual machines,” in Proceedings of the 2Nd Conference on Symposium on Networked Systems Design & Implementation - Volume 2, ser. NSDI’05. Berkeley, CA, USA: USENIX Association, 2005, pp. 273–286. [Online]. Available: http://dl.acm.org/citation.cfm?id=1251203.1251223
[6] T. C. Bressoud and F. B. Schneider, “Hypervisor-based fault tolerance,” in Proceedings of the Fifteenth ACM Symposium on Operating Systems Principles, ser. SOSP ’95. New York, NY, USA: ACM, 1995, pp. 1–11. [Online]. Available: http://doi.acm.org/10.1145/224056.224058
[7] “Hardware and software approaches for deterministic multiprocessor replay of concurrent programs.” Intel Technology Journal, vol. 13, no. 4, pp. 20–41, 2009. [Online]. Available: http://iacoma.cs.uiuc.edu/iacoma-papers/replay_itj.pdf
[8] D.J.Scales,M.Nelson,and G.Venkitachalam,“The design of a practical system for fault-tolerant virtual machines,” SIGOPS Oper. Syst. Rev., vol. 44, no. 4, pp. 30–39, Dec. 2010. [Online]. Available: http://doi.acm.org/10.1145/1899928.1899932
[9] B.Cully,G.Lefebvre,D.Meyer,M.Feeley,N.Hutchinson,andA.War eld,“Remus: High availability via asynchronous virtual machine replication,” in Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation, ser. NSDI’08. Berkeley, CA, USA: USENIX Association, 2008, pp. 161–174. [Online]. Available: http://dl.acm.org/citation.cfm?id=1387589.1387601
[10] P. Barham, B. Dragovic, K. Fraser, S. Hand, T. Harris, A. Ho, R. Neugebauer, I. Pratt, and A. War eld, “Xen and the art of virtualization,” in Proceedings of the Nineteenth ACM Symposium on Operating Systems Principles, ser. SOSP ’03. New York, NY, USA: ACM, 2003, pp. 164–177. [Online]. Available: http://doi.acm.org/10.1145/945445.945462
[11] Y. Tamura, “Kemari: Fault tolerant vm synchronization based on kvm using domt,” in Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation, ser. NSDI’08. Technical report, NTT Cyber Space Labs, 2008. [Online]. Available: http://www-archive.xenproject.org/files/xensummitbo ston08/tamura_xen_summit_presentation_final.pdf
[12] A. Kivity, Y. Kamay, D. Laor, U. Lublin, and A. Liguori, “kvm: the linux virtual machine monitor,” Jul. 2007, pp. 225–230. [Online]. Available: http://www.kernel.org/doc/ols/2007/ols2007v1-pages-225-230.pdf
[13] “VMware vsphere 6 fault tolerance architecture and performance technical white
paper.” VMware, 2016. [Online]. Available: https://www.vmware.com/files/pdf/t echpaper/VMware-vSphere6-FT-arch-perf.pdf
[14] Y. Dong, W. Ye, Y. Jiang, I. Pratt, S. Ma, J. Li, and H. Guan, “Colo: Coarse-grained lock-stepping virtual machines for non-stop service,” in Proceedings of the 4th Annual Symposium on Cloud Computing, ser. SOCC ’13. New York, NY, USA: ACM, 2013, pp. 3:1–3:16. [Online]. Available: http://doi.acm.org/10.1145/2523616.2523630
[15] P. J. Tsao, Y. F. Sun, L. H. Chen, and C. Y. Cho, “E cient virtualization-based fault tolerance,” in 2016 International Computer Symposium (ICS), Dec 2016, pp. 114–119.
[16] Y. F. Sun, “Protection mechanisms for virtual machines on virtualized servers,” Ph.D. dissertation, Stony Brook University, 2017.
[17] R. Russell, “Virtio: Towards a de-facto standard for virtual i/o devices,” SIGOPS Oper. Syst. Rev., vol. 42, no. 5, pp. 95–103, Jul. 2008. [Online]. Available: http://doi.acm.org/10.1145/1400097.1400108
[18] “Page modification logging for virtual machine monitor white paper,” 2015. [Online]. Available: https://www.intel.com/content/dam/www/public/us/en/doc uments/white-papers/page-modi cation-logging-vmm-white-paper.pdf
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top
無相關期刊