中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/93364
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 80990/80990 (100%)
Visitors : 41651105      Online Users : 1471
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/93364


    Title: 基於深度學習從核醣核酸定序表達譜推斷外周血單核細胞之細胞組成;PBMC Cell Composition Inference from RNA-seq Expression Profile Based on Deep Learning
    Authors: 陳彥霖;Chen, Yen-Lin
    Contributors: 資訊工程學系
    Keywords: 外周血單核細胞;深度學習;狄利克雷分佈;Peripheral blood mononuclear cells;Deep learning;Dirichlet distribution
    Date: 2023-07-28
    Issue Date: 2024-09-19 16:56:11 (UTC+8)
    Publisher: 國立中央大學
    Abstract: 單細胞轉錄組定序是一項很有前途的技術,可提供有關單細胞水平的基因表達模式的詳細信息。然而,單細胞轉錄組定序成本非常高,尤其是在分析大量細胞時。為了克服這一限制,研究人員開發了從次世代核醣核酸定序數據推斷細胞組成的方法,同時還利用了深度學習算法。這些基於機器學習的方法通常需要大量訓練數據,從而導致數據生成技術的發展,這些技術可以生成用於訓練細胞反卷積模型的偽批量核醣核酸定序樣本。 但是,數據生成方法還有改進的空間來達到更佳的表現。在本研究中,我們使用狄利克雷分佈來生成更接近真實場景的合成核醣核酸定序樣本。我們構建了數個基於深度學習網路與自注意力機制的細胞反卷積模型,這些模型是在狄利克雷方法生成的數據上訓練的,旨在實現比現有方法更優越的性能。為了評估模型的有效性,我們使用兩個真實的人類外周血單核細胞數據集作為測試基準。我們的結果表明,我們的模型在這兩個外周血單核細胞數據集上優於其他現有方法,顯示皮爾森相關性分別約為0.87和0.78。值得注意的是,我們的模型對人類外周血單核細胞中比例較小的細胞類型有更精確的預測。由於數據集之間的差異,我們還強調了為特定數據集構建單獨模型以優化性能的重要性。;Single-cell RNA sequencing (scRNA-seq) is a promising technique that provides detailed information about gene expression patterns at single-cell level. However, it can be prohibitively expensive, particularly when profiling a large number of cells. To overcome this limitation, researchers have developed methods to infer cell composition from next-generation RNA sequencing (RNA-seq) data, also utilizing deep learning algorithms. These machine leaning-based methods typically require a large amount of training data, leading to the development of data generation techniques that produce pseudo-bulk RNA-seq samples for training cell deconvolution models. However, there is room to improve data generation methods to achieve better performance. In this study, we use the Dirichlet distribution to generate synthetic RNA-seq samples that more closely resemble real-world scenarios. We construct deep learning-based deconvolution models trained on this Dirichlet-generated data, aiming to achieve superior performance compared to existing methods. To evaluate the models′ effectiveness, we employ two real human peripheral blood mononuclear cell (PBMC) datasets as testing benchmarks. Our results demonstrate that our models outperform other existing methods on these two PBMC datasets, showcasing Pearson correlations of approximately 0.87 and 0.78, respectively. Notably, our models achieve more precise predictions for cell types with smaller proportions in human PBMCs. We also emphasize the importance of building individual models for specific datasets to optimize performance due to the variance between datasets.
    Appears in Collections:[Graduate Institute of Computer Science and Information Engineering] Electronic Thesis & Dissertation

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML9View/Open


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明