博碩士論文 111522120 詳細資訊




以作者查詢圖書館館藏 以作者查詢臺灣博碩士 以作者查詢全國書目 勘誤回報 、線上人數:74 、訪客IP:13.59.153.178
姓名 楊淑君(Shu-Jun Yang)  查詢紙本館藏   畢業系所 資訊工程學系
論文名稱 利用 Linux Kernel Userfaultfd 實作投機式資料同步機制以改善 NCU MFTVM 停機時間
(Improving NCU MFTVM Downtime with Speculative Data Synchronization at VM Runtime based on Linux Kernel Userfaultfd)
相關論文
★ 以伸展樹為基礎的Android Binder Driver★ 應用增量式學習於多種農作物判釋之研究
★ 應用分類重建學習偵測航照圖幅中的新穎坵塊★ 用於輔助工業零件辨識之尺寸估算系統
★ 使用無紋理之3D CAD工業零件模型結合長度檢測實現細粒度真實工業零件影像分類★ 一個建立在平行工作系統上的動態全球計算平台
★ 用權重參照計數演算法執行主動物件垃圾收集★ 一個動態負載平衡之最大可能性估算計算架構
★ 利用多項系統負載資訊進行動態P2P系統重組的策略研究★ 基於Hadoop系統的雲端應用程式特徵擷取與計算監測架構
★ 適用於大型動態分散式系統的調適性計算模型★ 一個提供彈性虛擬資料中心的雲端服務平台
★ 雲端彈性虛擬機房服務平台之資源控管中心★ 一個適用於自動供應雲端系統的動態調適計算架構
★ 線性相關工作與非相關工作的探索式排程策略★ 適用於大資料集高效率的分散式階層分群演算法
檔案 [Endnote RIS 格式]    [Bibtex 格式]    [相關文章]   [文章引用]   [完整記錄]   [館藏目錄]   至系統瀏覽論文 (2026-9-1以後開放)
摘要(中) 近年來,雲端計算相關技術發展迅速,可更方便地取得計算資源進行應用,計算資源具備高可靠性與高可用性是雲端服務能穩定執行重要的指標。容錯技術是實現高可用性的重要技術之一,透過冗餘設計支持虛擬機器狀態備份,提供機器發生意外故障能快速恢復的能力,讓執行中的服務不被中斷。本研究致力於延續開發中央大學平行與分散式計算實驗室團隊在過去所開發及發展之虛擬機器容錯系統-NCU MFTVM。我們提出了一項投機式資料同步機制,透過追蹤系統記憶體狀態寫入狀態,並制定熱點寫入的頁面判斷規則,從而減少虛擬機器暫停期間需要同步的記憶體狀態資料量、降低虛擬機器暫停時間,增進系統效能。根據實驗結果顯示,改進後的方法能縮短0.35~16.88%的虛擬機器暫停時間。
摘要(英) In recent years, the rapid development of cloud computing related technology, more convenient access to computing resources for application, computing resources with high reliability and high availability is an important indicator of the stability of the implementation of cloud services. Fault tolerance is one of the most important technologies to achieve high availability. It supports virtual machine state backup through redundancy design and provides the ability to quickly recover from machine accidents, so that running services are not interrupted. This study is dedicated to the continuation of the development of the NCU MFTVM, a virtual machine fault-tolerant system developed by the Parallel and Distributed Computing Laboratory team at National Central University (NCU) in the past. We propose a speculative data synchronization mechanism that reduces the amount of RAM state data that needs to be synchronized during the VM stop time, reduces the VM stop time, and improves the system performance by tracking the memory state write state and formulating the page judgment rules for hotspot writes. According to the experimental results, the improved method can shorten the VM stop time by 0.35-16.88%.
關鍵字(中) ★ QEMU/KVM
★ 容錯系統
★ 虛擬機器
★ 持續同步
★ 頁面寫入保護
★ 頁面管理
關鍵字(英) ★ QEMU/KVM
★ Fault Tolerance
★ Virtual Machine
★ Continuous Checkpointing
★ Page Write Protection
★ Page Management
論文目次 摘要 i
ABSTRACT ii
目錄 iii
圖目錄 v
表目錄 vii
第一章、 緒論 1
1.1 研究背景 1
1.2 研究動機與目的 2
1.3 研究貢獻 3
1.4 論文架構 4
第二章、 相關研究 5
2.1 容錯系統類型 5
2.2 QEMU與KVM 6
2.3 虛擬機器容錯系統 7
2.3.1 Remus 7
2.3.2 Kemari 7
2.3.3 COLO 7
2.3.4 VMware vSphere ® Fault Tolerance 8
2.3.5 Cuju 8
2.3.6 NCU MFTVM 8
第三章、 NCU MFTVM虛擬機器系統 9
3.1 系統架構 9
3.2 系統設計 10
3.3 系統流程 12
第四章、 提出的方法 14
4.1 問題定義 14
4.2 Linux kernel支援功能 14
4.2.1 Userfaultfd 14
4.2.2 Pagemap_scan 15
4.3 NCU MFTVM效能改進 15
4.3.1 寫入追蹤流程 18
第五章、 系統效能評估 20
5.1 實驗環境 20
5.2 實驗設計 21
5.3 CPU intensive benchmark 22
5.4 I/O intensive benchmark 24
5.5 Memory intensive benchmark 30
第六章、 結論與未來方向 43
6.1 結論 43
6.2 未來方向 43
參考文獻 44
參考文獻 [1] Y. -C. Wang, J. Xue, C. Wei and C. . -C. J. Kuo, "An Overview on Generative AI at Scale With Edge–Cloud Computing," in IEEE Open Journal of the Communications Society, vol. 4, pp. 2952-2971, 2023.
[2] N. Mungoli, "Distributed AI Frameworks: Leveraging Cloud Computing for Enhanced Deep Learning Performance and Efficiency," in arXiv preprint arXiv:2304.13738, 2023.
[3] Y. Li, Y. Lin, Y. Wang, K. Ye and C. Xu, "Serverless Computing: State-of-the-Art, Challenges and Opportunities," in IEEE Transactions on Services Computing, vol. 16, no. 2, pp. 1522-1539, 1 Mar.-Apr. 2023.
[4] M. S. Jawed and M. Sajid, "A comprehensive survey on cloud computing: architecture, tools, technologies, and open issues," in International Journal of Cloud Applications and Computing (IJCAC), vol. 12, no.1, pp. 1-33, 2022.
[5] V. K. Netaji, and G. P. Bhole, "A comprehensive survey on container resource allocation approaches in cloud computing: State-of-the-art and research challenges," in Web Intelligence, vol. 19, no. 4, pp. 295-316, Jan. 2021.
[6] K. Mahajan and R. Desai, "Serving distributed inference deep learning models in serverless computing," 2022 IEEE 15th International Conference on Cloud Computing (CLOUD), Barcelona, Spain, 2022.
[7] R. M. Haris, K. M. Khan and A. Nhlabatsi, "Live migration of virtual machine memory content in networked systems," in Computer Networks, vol. 209, May 2022.
[8] J. Zhu, L. Wang, G. Xie, T. Pei, S. Oh and Z. Li, "A low redundancy and high time efficiency large-scale task assignment strategy for heterogeneous service-oriented cloud computing systems," in The Journal of Supercomputing, vol. 77, pp. 3450-3483, 2021.
[9] Y. L. Lee, S. N. Arizky, Y. R. Chen, D. Liang, and W. J. Wang, "High-availability computing platform with sensor fault resilience," Sensors, vol. 21, no. 2, p. 542, Jan. 2021.
[10] P. Kumari and P. Kaur, "A survey of fault tolerance in cloud computing," in Journal of King Saud University-Computer and Information Sciences, vol.33, no. 10, pp. 1159-1176, 2021.
[11] J. W. Wang, "An Adaptive Continuous Checkpointing Fault-Tolerant Virtual Machine System based on QEMU-KVM with libvirt," 2020. Master Thesis. National Central University.
[12] W. J. Hsu, "Use pipeline methods and Fault-Tolerant QEMU/KVM System implementation," 2021. Master Thesis. National Central University.
[13] T. C. Bressoud and F. B. Schneider, "Hypervisor-based fault tolerance," in ACM Transactions on Computer Systems (TOCS), vol. 14, no. 1, pp. 80-107, 1996.
[14] S. T. Hsu and R. C. Chang, "Continuous checkpointing: joining the checkpointing with virtual memory paging," in Software: Practice and Experience, vol. 27, no. 9, pp. 1103-1120, 1997.
[15] Y. Tamura, K. Sato, S. Kihara and S. Moriai, "Kemari: Virtual machine synchronization for fault tolerance," in Proc. USENIX Annu. Tech. Conf.(Poster Session), pp. 1-2, Jun. 2008.
[16] K. Whisnant, Z. Kalbarczyk and R. K. Iyer, "Micro-checkpointing: checkpointing for multithreaded applications," Proceedings 6th IEEE International On-Line Testing Workshop (Cat. No.PR00646), Palma de Mallorca, Spain, 2000.
[17] B. Cully, G. Lefebvre, D. Meyer, M. Feeley, N. Hutchinson and A. Warfield, "Remus: High availability via asynchronous virtual machine replication," in Proceedings of the 5th USENIX symposium on networked systems design and implementation, pp. 161-174, Apr. 2008.
[18] "Cuju: An Open Source Project for Virtualization-Based Fault Tolerance,"
https://github.com/Cuju-ft/Cuju, 2019 (accessed Jun. 2024).
[19] Y. Dong, W. Ye, Y. Jiang, I. Pratt, S. Ma, J. Li and H. Guan, "Colo: Coarse-grained lock-stepping virtual machines for non-stop service," in Proceedings of the 4th annual Symposium on Cloud Computing, pp. 1-16, Oct. 2013.
[20] F. Bellard, "QEMU, a fast and portable dynamic translator," in USENIX annual technical conference, FREENIX Track, vol. 41, no. 46, pp. 10-5555, Apr. 2005.
[21] A. Kivity, Y. Kamay, D. Laor, U. Lublin and A. Liguori, "kvm: the Linux virtual machine monitor," in Proceedings of the Linux symposium, vol. 1, no. 8, pp. 225-230, Jul. 2007.
[22] T. Bie and H. Zhu, "Vhost-User," in Data Plane Development Kit (DPDK): A Software Optimization Guide to the User Space-Based Network Applications, H. Zhu, Ed., 1st ed. Boca Raton: CRC Press, 2020, pp. 251-262. doi: 10.1201/9780429353512.
[23] T. Bie, C. Ouyang, and H. Zhu, "Virtio," in Data Plane Development Kit (DPDK): A Software Optimization Guide to the User Space-Based Network Applications, H. Zhu, Ed., 1st ed. Boca Raton: CRC Press, 2020, pp. 229-250. doi: 10.1201/9780429353512.
[24] "How Fault Tolerance Works," VMware,
https://docs.vmware.com/en/VMware-vSphere/8.0/vsphere-availability/GUID-623812E6-D253-4FBC-B3E1-6FBFDF82ED21.html, 2022 (accessed Jun. 2024).
[25] P. -J. Tsao, Y. -F. Sun, L. -H. Chen and C. -Y. Cho, "Efficient Virtualization-Based Fault Tolerance," 2016 International Computer Symposium (ICS), Chiayi, Taiwan, pp. 114-119, 2016.
[26] "Userfautltfd," The Linux Kernel,
https://docs.kernel.org/admin-guide/mm/userfaultfd.html, (accessed Jun. 2024).
[27] "Pagemap Scan IOCTL," The Linux Kernel,
https://docs.kernel.org/admin-guide/mm/pagemap.html#pagemap-scan-ioctl, (accessed Jun. 2024).
[28] "Phoronix Test Suite Suites," OpenBenchmarking.org,
https://openbenchmarking.org/suites/pts, 2024 (accessed Jun. 2024).
[29] "Timed Linux Kernel Compilation," OpenBenchmarking.org,
https://openbenchmarking.org/test/pts/build-linux-kernel-1.16.0,
2024 (accessed Jun. 2024).
[30] "Flexible IO Tester," OpenBenchmarking.org,
https://openbenchmarking.org/test/pts/fio, 2024 (accessed Jun. 2024).
[31] " RAMspeed SMP," OpenBenchmarking.org,
https://openbenchmarking.org/test/pts/ramspeed, 2024 (accessed Jun. 2024).
指導教授 王尉任(Wei-Jen Wang) 審核日期 2024-7-18
推文 facebook   plurk   twitter   funp   google   live   udn   HD   myshare   reddit   netvibes   friend   youpush   delicious   baidu   
網路書籤 Google bookmarks   del.icio.us   hemidemi   myshare   

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明