中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/93269
English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 80990/80990 (100%)
造訪人次 : 41270442      線上人數 : 461
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/93269


    題名: 集成樣態對特徵選擇的效能影響—以微陣列資料為例
    作者: 鄭淨文;Cheng, Ching-Wen
    貢獻者: 資訊管理學系
    關鍵詞: 特徵選擇;穩定性;微陣列資料集;高維度資料集;集成特徵選擇;feature selection;stability;microarray datasets;high-dimensional datasets;Ensemble Feature Selection
    日期: 2023-07-24
    上傳時間: 2024-09-19 16:51:16 (UTC+8)
    出版者: 國立中央大學
    摘要: 本研究旨在解決特徵選擇方法在高維度少樣本的應用領域中的穩定性問題。儘管特徵選擇方法在提升模型的預測性能方面發揮了重要作用,但在實驗中,資料的微小變動可能導致選擇的特徵有顯著差異,從而影響模型的可信度。為了提升特徵選擇的穩定性,本研究探討集成學習對於特徵選擇的影響,並進一步分析同質集成與異質集成架構的最佳參數與組合。
    集成特徵選擇主要可以分為同質集成、異質集成與混合集成,同質集成透過對訓練集進行多次抽樣來製造資料的多樣性,並使用同一特徵選擇方法進行多次評估。異質集成則是採用多種不同特徵選擇來製造方法的多樣性。混合集成則是同時採用資料多樣性與方法多樣性的特點。
    本研究根據混合集成的概念提出兩種混合式的集成架構:階層式集成和抽樣異質集成。研究結果顯示,同質集成能有助於提升特徵選擇的穩定性,但可能會微幅降低預測性能;異質集成對於提升特徵選擇的效能有限;混合集成中以階層式集成表現優於抽樣異質集成,能在保持預測性能的同時,進一步提升特徵選擇的穩定性。本研究期望這些研究成果能為高維度少樣本的研究領域,提供更穩定的特徵選擇方法。;This study addresses the stability issues of feature selection methods in high-dimensional and low-sample-size application domains. Despite the critical role of feature selection methods in enhancing prediction performance, minor variations in the data during experiments can lead to significant differences in the selected features, thereby impacting the credibility of the models. To improve the stability of feature selection, this study investigates the influence of ensemble learning on feature selection. Further, it analyzes the optimal parameters and combi-nations of the homogeneous and the heterogeneous ensemble frameworks.
    Ensemble feature selection can be divided into the homogeneous, the heterogeneous, and the hybrid ensembles. The homogeneous ensemble creates diversity in the data by performing multiple samplings on the training set and utilizing the same feature selection method for mul-tiple evaluations. In contrast, the heterogeneous ensemble introduces methodological diversity by employing various distinct feature selection methods. The hybrid ensembles, meanwhile, leverage both data diversity and method diversity.
    Based on the concept of the hybrid ensemble, this study proposes two hybrid ensemble frameworks: the hierarchical ensemble and the sampling heterogeneous ensemble. The results show that while the homogeneous ensemble can enhance the stability of feature selection, they may slightly decrease prediction performance. The heterogeneous ensemble has limited effects on improving the overall evaluation of feature selection. Among the hybrid ensembles, the hi-erarchical ensemble outperforms the sampling heterogeneous ensemble, as it maintains predic-tion performance and further enhances the stability of feature selection. This study hopes these findings can provide more stable feature selection methods for the research domain of high-dimensional and low-sample-size datasets.
    顯示於類別:[資訊管理研究所] 博碩士論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML16檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明