English  |  正體中文  |  简体中文  |  Items with full text/Total items : 73032/73032 (100%)
Visitors : 23358823      Online Users : 520
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version

    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/67589

    Title: NoSQL效能與穩定性之研究-以HBase為例
    Authors: 劉一正;Liu,Yi-Cheng
    Contributors: 資訊管理學系在職專班
    Keywords: 巨量資料;效能;穩定性;影響因子;Big Data;Efficiency;Stability;Impact factor
    Date: 2015-06-12
    Issue Date: 2015-07-30 22:53:11 (UTC+8)
    Publisher: 國立中央大學
    Abstract: 商業競爭益趨激烈的今天,如何貼近消費者需求,取得競爭優勢,無不是各家廠商競相努力的方向,近年來資料探勘與巨量資料Big Data變成一門顯學。如何萃取出資料中所隱含的資訊及價值,引起了廣大的關注。更精確的消費行為分析與預測,從顧客的各項資料中汲取有利於商業價值的資訊,以擬訂企業經營策略,強化競爭優勢,獲取最大利潤,巨量資料絕對是強而有力的工具。

    巨量資料平台Hadoop技術雖帶來了許多優點,如可横向擴展能力、單一節點故障不影響整體叢集系統的好處之外,亦有其先天上的限制。本研究藉由個案分析方法,針對NoSQL Database HBase運行時,所遇到的效能與穩定性問題進行探討,透過彙整HBase實際運行時所遇到的問題,以系統化、結構化問題分析與解決方法,找出影響HBase運行時問題發生與效能不佳的關鍵性因子,並依據各個影響因子,擬訂可行的解決方案,進行評估與追蹤後續問題的再發生率,以確認所提出之解決方案的有效性。

    研究結果顯示,HBase效能與穩定性會受到底層元件,如Hadoop平台的穩定性、Hadoop DataNode及作業系統相關參數設定影響之外,每一RegionServer所承載的region個數也是關鍵因子之一,資料索引鍵的設計對於讀取、寫入效能亦有重大的影響。本研究亦發現HBase效能與穩定性的問題發生與管理程序不當有很大的關係,故除了提出上述的系統性因素改善之外,亦針對系統參數一致性、版本控制與軟體佈署方式、異動作業程序管理、系統監控方式、緊急問題處理等管理程序提出改善建議,如此才能降低人為錯誤,建置更加穩定的叢集系統。
    ;Business competition has become more aggressive these days. Meeting consumer demand is the focus at which every company is aiming nowadays in order to win the competition. In recent years, data mining and big data have become a hot topic. How to extract valuable information from various databases to benefit business, conduct accurate analysis, and predict consumer behavior become critical while big date tools is helpful in all these. The tools can help business develop business strategies, strengthen competitive advantage, and maximize profits.

    Although Hadoop, a big data platform, has brought many advantages, such as the scalability and the tolerance of single node failure, it has its disadvantage as well. This research is based on the case study of NoSQL Database HBase, which has been applied in a prominent foundry company in Taiwan. This research systematically studied the problems and tried to find out the factors that affected the system’s efficiency and stability and searched for solutions for each factor, and then evaluated the effectiveness of the solutions for the repeated problems.

    The results of this study shows that HBase’s performance and stability will likely be impacted by the factors such as Hadoop platform stability, DataNode xceiver parameter, operation system parameters, the region count of each RegionServer. Also, the row key design will impact the read/write performance as well. This study also found the stability problem of HBase was related to the inappropriate process management. This suggests that to improve HBase’s performance and stability has to be performed not only from system level perspective but also from the management perspective. Through the improvement of management control like consistency of parameter, version control of parameters, deployment processes, change management, enhancing monitoring and emergency management can reduce human error and construct a more stable distributed cluster system.
    Appears in Collections:[資訊管理學系碩士在職專班 ] 博碩士論文

    Files in This Item:

    File Description SizeFormat

    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback  - 隱私權政策聲明