參考文獻 |
[1] K. Dooley, Designing Large Scale Lans, 1 edition. Beijing ; Sebastopol, CA: O’Reilly Media, 2001.
[2] A. Oliner and J. Stearley, “What Supercomputers Say: A Study of Five System Logs,” in 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2007. DSN ’07, pp. 575–584, 2007.
[3] W. Feng, “Making a Case for Efficient Supercomputing,” Queue, vol. 1, no. 7, pp. 54–64, Oct. 2003.
[4] C.-D. Lu, Scalable Diskless Checkpointing for Large Parallel Systems. University of Illinois at Urbana-Champaign, 2005.
[5] Schroeder, Bianca, and Garth A. Gibson. "Understanding failures in petascale computers." Journal of Physics: Conference Series. Vol. 78. No. 1. IOP Publishing, 2007.
[6] J. Lang, M. Liu, Q. Wang, W. Kuehn, Z. Liu, and H. Xu, “Intelligent Platform Management Controller for ATCA Compute Nodes,” in Real Time Conference, 2009. RT ’09. 16th IEEE-NPSS, pp. 35–37, 2009.
[7] P. Perek, D. Makowski, P. Predki, and A. Napieralski, “ATCA carrier board with dedicated IPMI controller,” in Mixed Design of Integrated Circuits and Systems (MIXDES), 2010 Proceedings of the 17th International Conference, pp. 139–143, 2010.
[8] “PICMG.” [Online]. Available: https://www.picmg.org/..
[9] “Intelligent Platform Management Interface (IPMI) Information,” Intel. [Online]. Available: http://www.intel.com/content/www/us/en/servers/ipmi/ipmi-home.html.
[10] “IPMItool.” [Online]. Available: http://sourceforge.net/projects/ipmitool/
[11] Zawada, A., et al. "ATCA Carrier Board with IPMI supervisory circuit." Mixed Design of Integrated Circuits and Systems, 2008. MIXDES 2008. 15th International Conference on. IET, 2008.
[12] Ketchum, Breton A., and Viswa N. Sharma. "Shelf management controller with hardware/software implemented dual redundant configuration." U.S. Patent No. 7,827,442. 2 Nov. 2010.
[13] I. Habib, “Virtualization with KVM,” Linux J, vol. 2008, no. 166, pp. 8, Feb. 2008.
[14] Y. Goto, “Kernel-based virtual machine technology,” Fujitsu Sci. Tech. J., vol. 47, pp. 362–368, 2011.
[15] T. Hirt, “KVM-The Kernel-Based virtual machine,” Red Hat Inc, 2010.
[16] D. J. Protti, “Linux KVM as a learning tool,” Linux J., vol. 2009, no. 186, p. 3, 2009.
[17] “QEMU.” [Online]. Available: http://wiki.qemu.org/Main_Page.
[18] “libvirt: The virtualization API.” [Online]. Available: http://libvirt.org/.
[19] M. Bolte, M. Sievers, G. Birkenheuer, O. Niehörster, and A. Brinkmann, “Non-intrusive virtualization management using libvirt,” in Proceedings of the Conference on Design, Automation and Test in Europe, pp. 574–579, 2010,
[20] B. Victoria, “Creating and Controlling KVM Guests using libvirt,” Univ. Vic., 2009.
[21] I. P. Egwutuoha, D. Levy, B. Selic, and S. Chen, “A survey of fault tolerance mechanisms and checkpoint/restart implementations for high performance computing systems,” J. Supercomput., vol. 65, no. 3, pp. 1302–1326, Sep. 2013.
[22] R. Rajachandrasekar, X. Besseron, and D. K. Panda, “Monitoring and Predicting Hardware Failures in HPC Clusters with FTB-IPMI,” in Parallel and Distributed Processing Symposium Workshops PhD Forum (IPDPSW), 2012 IEEE 26th International, pp. 1136–1143, 2012.
[23] C.-L. Fang, D. Liang, F. Lin, and C.-C. Lin, “Fault tolerant Web Services,” J. Syst. Archit., vol. 53, no. 1, pp. 21–38, Jan. 2007.
[24] A. Muller and S. Wilson, “Virtualization with VMware ESX server,”, Syngress Publishing, 2005.
[25] P. Li, “Selecting and using virtualization solutions: our experiences with VMware and VirtualBox,” J. Comput. Sci. Coll., vol. 25, no. 3, pp. 11–17, 2010.
[26] “VMware. (2012). vSphere Availability.” Available: http://pubs.vmware.com/vsphere-50/topic/com.vmware.ICbase/PDF/vsphere-esxi-vcenter-server-501-availability-guide.pdf
[27] Linux Programmer′s Manual : kill - send signal to a process. Available: http://man7.org/linux/man-pages/man2/kill.2.html
[28] KnowThyUbuntu. Available 2009: https://help.ubuntu.com/community/KnowThyUbuntu
[29] FIVE NINES: CHASING THE DREAM? Available: http://www.continuitycentral.com/feature0267.htm
[30] Achieving Backplane Redundancy in AdvancedTCA Systems Available: http://go.radisys.com/rs/radisys/images/paper-atca-achieving.pdf |