摘要: | 替代性剪接 (Alternative splicing) 是真核生物的基因表現過程中一種重要的現象,使得一個基因能有多種表現的產物,即蛋白質 (Protein),而這個現象目前正引吸引許多生物學家積極地投入研究當中。蛋白質序列、訊息核醣核酸和表現序列標幟序列提供了關於基因的替代性剪接相關有用資訊。 本論文主要的貢獻為設計一分析平台,透過電腦運算分析的方式,可以大量地分析所有人類基因,萃取出含有替代性剪接資訊的基因,並綜合基因表現資料、功能資訊、以及跨物種基因的比較,運用統計(Statistics)和資料探勘 (Data mining)的方式,整理並找尋mRNA上的具功能的區塊(Functional sites),供生物學家驗證。 我們定義替代性剪接(Alternative splicing)的型態 (Types),並設計一套演算法 (Algorithm),萃取替代性剪接資訊,並將這些資訊,儲存成資料庫。此外,人類基因表現資訊及基因功能資訊也可以幫助替代性剪接(Alternative splicing)的分析。我們將基因,依據tissue-specificity和 Function做分類,綜合SpliceMotif 資料庫來進行Motif 的預測。我們應用DNA Motif預測的工具於已搜集好的SpliceMotif資料庫,並找出與替代性剪接相關連的 motif,稱SpliceMotif。透過實際案例探討,我們證明本論文所研究的工具,可以發現Exonic Splicing Enhancer (ESE)於所挑選的基因中。 Generally speaking, in eukaryotes, alternative splicing (AS) mechanism for pre-mRNA plays an important role to generate multiple isoforms. In this thesis, we propose an integrated approach to automatically identify the conserved sequences in selected exon/intron regions of a gene group. Firstly, the alternative splicing database, namely ProSplicer, is constructed in our previously research and used in the system. Secondly, several alternative splicing types such as exon skipping, alternative 5’ splicing sites, alternative 3’ splicing sites and mutually exclusive exons are derived and extracted from evidence data. Finally, for each type of alternative splicing, the flanking intronic sequences are collected and then used for motif discovery tools. Alternative splicing related conserved motif, namely SpliceMotif, are computationally detected. The tissue-specific information and gene functionalities corresponding to the selected regions are also taken into account. The main contribution of this work is to establish an integrated platform for detecting conserved sequences within exon/intron regions of particular alternative splicing type, e.g., exon skipping. After several case studies for experimenting the proposed system, the system can detect the known functional sites, which are experimentally verified, related alternative splicing modes, e.g., exonic splicing enhancer (ESE). |