博碩士論文 87325035 詳細資訊




以作者查詢圖書館館藏 以作者查詢臺灣博碩士 以作者查詢全國書目 勘誤回報 、線上人數:45 、訪客IP:3.17.74.153
姓名 卓文福(Wen-Fu Cho)  查詢紙本館藏   畢業系所 資訊工程學系
論文名稱 應用資料採礦於基因體之重複序列資料庫
(Data Mining for Regulatory Elements in Repeat Sequences)
相關論文
★ 應用嵌入式系統於呼吸肌肉群訓練儀之系統開發★ 勃起障礙與缺血性心臟病的雙向研究: 以台灣全人口基礎的世代研究
★ 基質輔助雷射脫附飛行時間式串聯質譜儀 微生物抗藥性資料視覺化工具★ 使用穿戴式裝置分析心律變異及偵測心律不整之應用程式
★ 建立一個自動化分析系統用來分析任何兩種疾病之間的關聯性透過世代研究設計以及使用承保抽樣歸人檔★ 青光眼病患併發糖尿病,使用Metformin及Sulfonylurea治療得到中風之風險:以台灣人口為基礎的觀察性研究
★ 利用組成識別和序列及空間特性構成之預測系統來針對蛋白質交互作用上的特殊區段點位進行分析及預測辨識★ 新聞語意特徵擷取流程設計與股價變化關聯性分析
★ 藥物與疾病關聯性自動化分析平台設計與實作★ 建立財務報告自動分析系統進行股價預測
★ 建立一個分析疾病與癌症關聯性的自動化系統★ 基於慣性感測器虛擬鍵盤之設計與實作
★ 一個醫療照護監測系統之實作★ 應用手機開發手握球握力及相關資料之量測
★ 利用關聯分析全面性的搜索癌症關聯疾病★ 全面性尋找類風濕性關節炎之關聯疾病
檔案 [Endnote RIS 格式]    [Bibtex 格式]    [相關文章]   [文章引用]   [完整記錄]   [館藏目錄]   [檢視]  [下載]
  1. 本電子論文使用權限為同意立即開放。
  2. 已達開放權限電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。
  3. 請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。

摘要(中) 自人類基因計畫推行以來,各種大規模基因體定序技術高度發展,已知DNA序列數目大幅增加,這些基因中包含很多重覆出現的序列(Repeat Sequence)。重複序列在醫藥診斷和研究扮演重要的角色。目前已發展一個完整儲存重覆序列的資料庫,將提供國內生物資訊研究者研究及使用,將存放所有生物基因體中有重覆的序列。轉錄因子資料庫存放很多轉錄因子,本文標記轉錄因子於重複序列,應用資料採礦(Data Mining)的關聯性(Association Rule)技術於重複序列中轉錄因子的組合。我們將發現的關聯性規則找出較有意義的,並且去除多餘的規則,並應用關聯性對基因體中的重覆序列進行部份分類(Partial Classification)。我們進行的實驗包含人類第二十二條基因體及其它基因體。在面對生物基因體的研究上,將使我們得到相當有價值的資訊。
摘要(英) Human Genome Project began at 1988 and then lots of genomes will be sequencialized later. Repeat sequences in genome sequences play an important role in medical diagnosis and research. The Transcription factor database TRANSFAC collects many promoter classes. In this thesis, we first mark the transcription factor binding sites in the repeat sequences and then apply data mining techniques to mine the association rules from the combinations of binding sites. We further prune the discovered associations to remove those insignificant associations and find a set of useful rules. Finally, we use the discovered association rules to partially classify the repeat sequences in our repeat database. We also experiment on several genomes including C.Elegans, Human Chromosome 22, and Yeast.
關鍵字(中) ★ 轉錄因子資料庫
★ 轉錄因子
★ 關聯性規則
★ 資料採礦
★ 重複序列
★ 基因組序列
關鍵字(英) ★ Repeat Sequences
★ TRANSFAC
★ Transcription Factor
★ Association Rule
★ Data Mining
★ Genome Sequence
論文目次 Chapter 1Introduction5
1.1Problem Statement7
1.2Brief Description of Our Method and Goal8
1.3Organization of the Thesis8
Chapter 2Related Work9
2.1The Human Genome Project9
2.2Data Mining9
2.3Classification9
2.4Data Mining for Regulatory Elements10
2.5The Properties of Repeat Sequences in the Repeat Database10
2.6Association Rules11
(A)Algorithm Apriori12
(B)Algorithm AprioriTid14
2.7Discovering Rules14
Chapter 3Our Approach16
3.1Information in TRANSFAC16
3.1.1Parsing16
3.1.2Update18
3.2The properties of the data in TRANSFAC19
3.3Preprocessing and Mapping between the Data Repeat Database and TRANSFAC21
3.4Significance Measure26
3.5Pruning and Structuring Association Results27
Chapter 4The Implementation of Our Approach29
4.1The Flow of Our Approach29
Chapter 5Experiments31
Chapter 6Conclusions36
References37
參考文獻 [1] T.Heinemeyer, X.Chen, H.Karas, A. E.Kel, O. V.Kel, I.Liebich, T.Meinhardt, I.Reuter, F.Schacherer and E.Wingender, "Expanding the TRANSFAC database towards an expert system of regulatory molecular mechanisms". Nucleic Acids Research 27, 318-322 (1999).
[2] T.Heinemeyer, E. Wingender, I. Reuter, H. Hermjakob, A. E. Kel, O. V. Kel, E. V. Ignatieva, E. A. Ananko, O. A. Podkolodnaya, F. A. Kolpakov, N. L. Podkolodny and N. A. Kolchanov, "Databases on transcriptional regulation: TRANSFAC, TRRD and COMPEL", Nucleic Acids Research, 362-367(1998).
[3] A. Brazma, J. Vilo, E. Ukkonen and K. Valtonen, "Data Mining for Regulatory Elements in Yeast Genome". In 'Proceedings of the Fifth International Conference Intelligent Systems for Molecular Biology', AAAI Press, 65-74 (1997).
[4] A. Brazma, J. Vilo and E. Ukkonen, "Finding Transcription Factor Binding Site Combinations in Yeast Genome (Extended Abstract) ". Computer Science and Biology, Proceedings of the German Conference on Bioinformatics. Frishman and H.W. Mewes (ed.), 57-59 (1997).
[5] R. Agrawal and R. Srikant, "Fast Algorithms for Mining Association Rules', Proceedings of the 20th Int'l Conference on Very Large Databases, Santiago, Chile, Sept. 1994. Expanded version available as IBM Research Report RJ9839, 487-499(1994).
[6] R. Agrawal, T. Imielinski and A. Swami, "Mining Associations between Sets of Items in Large Databases', Proceedings of the ACM SIGMOD Int'l Conference on Management of Data, Washington D.C., 207-216 (1993).
[7] R. Srikant and R. Agrawal, "Mining Generalized Association Rules", Proc. of the 21st Int'l Conference on Very Large Databases, Zurich, Switzerland, Sep. 1995. Expanded version available as IBM Research Report RJ 9963, 407-419(1995).
[8] B. Kero, L. Russell, S. Tsur and W.M. Shen, "An Overview of Database Mining Techniques". KDOOD/ TDOOD, 1-8(1995).
[9] R. Srikant, Q. Vu and R. Agrawal, "Mining Association Rules with Item Constraints". Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 67-73(1997).
[10] R. Srikant, Q. Vu and R. Agrawal, "Mining Association Rules with Item Constraints". Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining: 67-73(1997).
[11] K. Ali, S. Manganaris and R. Srikant, "Partial Classification Using Association Rules". Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 115-118(1997).
[12] B. Liu, W. Hsu and Y. Ma, "Pruning and Summarizing the Discovered Associations", Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 125-134 (1999).
[13] M. Klemettinen, H. Mannila and H. Toivonen, "A Data Mining Methodology and Its Application to Semi-automatic Knowledge Acquisition". Database and Expert Systems Applications Workshop, 670-677(1997).
[14] H. Toivonen, M. Klemettinen, P. Ronkainen, K. Hatonen, and H. Mannila, "Pruning and grouping discovered association rules". In MLnet Workshop on Statistics, Machine Learning, and Discovery in Databases, Heraklion, Crete, Greece, 47-52 (1995).
[15] M. Klemettinen, H. Mannila, P. Ronkainen, H. Toivonen and A. Inkeri Verkamo, "Finding Interesting Rules from Large Sets of Discovered Association Rules". Proceedings of the 1994 ACM CIKM International Conference on Information and Knowledge Management, 401-407(1994).
指導教授 洪炯宗(Jorng-Tzong Horng) 審核日期 2000-7-6
推文 facebook   plurk   twitter   funp   google   live   udn   HD   myshare   reddit   netvibes   friend   youpush   delicious   baidu   
網路書籤 Google bookmarks   del.icio.us   hemidemi   myshare   

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明