博碩士論文 88522021 詳細資訊




以作者查詢圖書館館藏 以作者查詢臺灣博碩士 以作者查詢全國書目 勘誤回報 、線上人數:35 、訪客IP:18.189.143.164
姓名 張祜嘉(Hu-Chia Chang)  查詢紙本館藏   畢業系所 資訊工程學系
論文名稱 分散式重複序列資料庫之效能評估
(Performance Evaluation of A Distributed Database of Repetitive Elements in Complete Genomes)
相關論文
★ 應用嵌入式系統於呼吸肌肉群訓練儀之系統開發★ 勃起障礙與缺血性心臟病的雙向研究: 以台灣全人口基礎的世代研究
★ 基質輔助雷射脫附飛行時間式串聯質譜儀 微生物抗藥性資料視覺化工具★ 使用穿戴式裝置分析心律變異及偵測心律不整之應用程式
★ 建立一個自動化分析系統用來分析任何兩種疾病之間的關聯性透過世代研究設計以及使用承保抽樣歸人檔★ 青光眼病患併發糖尿病,使用Metformin及Sulfonylurea治療得到中風之風險:以台灣人口為基礎的觀察性研究
★ 利用組成識別和序列及空間特性構成之預測系統來針對蛋白質交互作用上的特殊區段點位進行分析及預測辨識★ 新聞語意特徵擷取流程設計與股價變化關聯性分析
★ 藥物與疾病關聯性自動化分析平台設計與實作★ 建立財務報告自動分析系統進行股價預測
★ 建立一個分析疾病與癌症關聯性的自動化系統★ 基於慣性感測器虛擬鍵盤之設計與實作
★ 一個醫療照護監測系統之實作★ 應用手機開發手握球握力及相關資料之量測
★ 利用關聯分析全面性的搜索癌症關聯疾病★ 全面性尋找類風濕性關節炎之關聯疾病
檔案 [Endnote RIS 格式]    [Bibtex 格式]    [相關文章]   [文章引用]   [完整記錄]   [館藏目錄]   [檢視]  [下載]
  1. 本電子論文使用權限為同意立即開放。
  2. 已達開放權限電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。
  3. 請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。

摘要(中) 前一版的重複序列資料庫是建立在集中式資料庫系統上的,此資料庫目前包含了大量的資料,而生物資訊的資料也日漸擴增,重複序列資料庫的效能成為很重要的問題。為了得到更好的效能,我們建立了分散式重複序列資料庫。在分散式資料庫上,資料分散的方法是平衡負載的重要機制。我們設計了許多資料分散的方法來做實驗以得到最適合重複序列資料的方法,也發展了智慧型元件來輔助我們的系統以得到更好的效能。
摘要(英) The original version of Repeat Sequence Database (RSDB) was created based on centralized database systems (CDBSs). It contains large number of data currently, and the size of biological data is increasing rapidly. The performance of RSDB becomes an important issue. Distributed RSDB (DRSDB) is created based on distributed database systems (DDBSs) in order to obtain better performance. Data distribution serves as an important load-balancing mechanism. We design lots of data distribution approaches and try to find the proper approaches to our particular system with experiments. The results show that query processor does not always choose the right data access paths for queries, and we develop an intelligent component to assist our system executing queries wisely in order to obtain much better performance.
關鍵字(中) ★ 資料庫
★ 重複序列
★ 分散式
關鍵字(英) ★ Database
★ Complete Genomes
★ Repetitive Elements
★ Performance Evaluation
★ Distributed
論文目次 Contents......................................................................................................i
List of Figures...........................................................................................iii
List of Tables...............................................................................................v
Chapter 1 Introduction................................................................................1
1.1 Motivation.....................................................................................2
1.2 Goal and Purpose..........................................................................2
1.3 Related Work.................................................................................2
1.4 Organization of This Thesis..........................................................3
Chapter 2 Repeat Sequence Database.........................................................5
2.1 System Architecture......................................................................5
2.2 Queries on RSDB.........................................................................6
Chapter 3 Experimental Design................................................................12
3.1 Experimental Environment.........................................................12
3.2 Index strategies for DRSDB.......................................................12
3.3 Data Distribution.........................................................................14
3.4 Experimental Queries.................................................................17
Chapter 4 Performance Evaluation...........................................................20
4.1 Search By Feature.......................................................................20
4.2 Search By Range.........................................................................26
4.3 Search By Pattern........................................................................30
4.4 Search by Accession Number/Sequence ID...............................33
4.5 Experiments on Other Organisms...............................................33
4.6 Concurrent Requests...................................................................33
4.7 The Intelligent Component.........................................................36
i
Chapter 5 Discussion and Conclusion......................................................37
References.................................................................................................39
Appendix...................................................................................................41
ii
List of Figures
Figure 1. The relationship among three tables............................................5
Figure 2. System architecture of RSDB.....................................................6
Figure 3. Fields in Search By Feature web page........................................7
Figure 4. Search By Feature results............................................................9
Figure 5. Specify begin and end position in Search By Range web page..9
Figure 6. The number of identical repeats and repeat copies are shown in the result page...........................................................................................10
Figure 7. Search By Repeat Pattern web page..........................................11
Figure 8. RSDB returns 277 patterns matching user queries...................11
Figure 9. System architecture of DRSDB................................................12
Figure 10. Data structure of Trie for Search By Pattern...........................13
Figure 11. Distribution by interleaving with each accession number......14
Figure 12. Distribution by interleaving with each value of pos_begin or chromosome..............................................................................................15
Figure 13. Distribution by interleaving with each record of a column value..........................................................................................................16
Figure 14. Distribution by statistics in RSDB’s data warehouse..............17
Figure 15. Comparison among different indexes of Query 6...................20
Figure 16. Experimental results of Search By Feature in CDBSs............21
Figure 17. Comparison between CDBSs and DDBSs of Query 1...........22
Figure 18. Comparison between CDBSs and DDBSs of Query 7...........23
Figure 19. Comparison between CDBSs and DDBSs of Query 3...........23
Figure 20. Comparison between CDBSs and DDBSs of Query 2...........24
Figure 21. Comparison between CDBSs and DDBSs of Query 6...........24
iii
Figure 22. Comparison between CDBSs and DDBSs of Query 4...........25
Figure 23. Comparison between CDBSs and DDBSs of Query 5...........25
Figure 24. Experimental results of Search By Range in CDBSs.............26
Figure 25. Comparison between CDBSs and DDBSs of Query 1...........27
Figure 26. Comparison between CDBSs and DDBSs of Query 2...........28
Figure 27. Comparison between CDBSs and DDBSs of Query 3...........28
Figure 28. Comparison between CDBSs and DDBSs of Search By Range...................................................................................................................29
Figure 29. Comparison among queries using different range interval.....29
Figure 30. Experimental results in CDBSs of Search by Pattern.............30
Figure 31. Experimental result of pattern with or without ‘?’..................31
Figure 32. Comparison between different index approaches...................32
Figure 33. Comparison among three index methods................................32
Figure 34. Concurrent Requests of Search By Feature............................34
Figure 35. Concurrent Requests of Search By Range..............................35
Figure 36. Concurrent Requests of Search By Pattern.............................35
iv
List of Tables
Table 1. The amount of data in RSDB........................................................5
Table 2. Available options for fields...........................................................8
Table 3. Queries used to evaluate Search by Feature...............................18
Table 4. Queries used to evaluate Search By Range................................18
Table 5. Queries used to evaluate Search By Pattern...............................19
Table 6. Queries used to perform simulations..........................................19
Table 7. Experimental results of Search By Feature when the data size is 24.7M records...........................................................................................21
Table 8. Experimental results of Search By Range when the data size is 85.1M records...........................................................................................27
Table 9. Comparison amongs three index methods..................................31
Table 10. Experimental Results of Search By Pattern using Trie.............32
Table 11. Experimental result of Search By AC/ID..................................33
Table 12.The amount of identical repeats for each organism...................41
Table 13. The amount of total repeat copies for each organism...............42
v
參考文獻 [1] Elmasri,R. and Navathe,S.B. (1994) Fundamentals of Database Systems Second Edition. Addison-Wesley Publishing Company, Menlo Park, CA.
[2] Horng,J.T., Lin,J.H. and Kao,C.Y. (2001) RSDB – A Database of Repetitive Elements in Complete Genomes. Proceedings of the Atlantic Symposium on Computational Biology and Genome Information Systems & Technology, Burham, NC, USA, 220-223.
[3] Horowitz,E., Sahni,S. and Mehta,D. (1995) Fundamentals of data structures in C++. W. H. Freeman and Company.
[4] Mehta,M. and DeWitt,D.J. (1997) Data placement in shared-nothing parallel database systems. The VLDB journal, 6. 53-72.
[5] Mukkamala,R. (1989) Measuring the Effect of Data Distribution Models on Performance Evaluation of Distributed Database Systems. IEEE transactios on Knowledge and Data Engineering, 1. 494-507.
[6] Nicola,M. and Jarke,M. (2000) Performance Modeling of Distributed and Replicated Databases. IEEE transactions on Knowledge and Data Engineering, 12. 645-672.
[7] Özsu,M.T. and Valduriez,P. (1996) Distributed and Parallel Database Systems. ACM Computing Surveys, 28. 125-128.
[8] Özsu,M.T. and Valduriez,P. (1999) Principles of Distributed Database Systems Second Edition. Prentice-Hall.
[9] Tamhankar,A.M. and Ram,S. (1998) Database Fragmentation and Allocation: An Integrated Methodology and Case Study. IEEE transactions on System, Man, and Cybernetics – Part A: Systems
39
and Humans, 28. 288-305.
指導教授 洪炯宗(Jorng-Tzong Horng) 審核日期 2003-3-5
推文 facebook   plurk   twitter   funp   google   live   udn   HD   myshare   reddit   netvibes   friend   youpush   delicious   baidu   
網路書籤 Google bookmarks   del.icio.us   hemidemi   myshare   

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明