English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 94201/94201 (100%)
造訪人次 : 80415553      線上人數 : 138
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: https://ir.lib.ncu.edu.tw/handle/987654321/106495


    題名: Combining instance selection for better missing value imputation
    作者: 蔡志豐;Tsai, Chih-Fong;Chang, Fu-Yu
    貢獻者: 管理學院資訊管理學系
    關鍵詞: Data mining;Datasets;Decision making;Incomplete data;Instance selection;Missing value imputation;Studies
    日期: 2016-12-01
    上傳時間: 2026-04-23 13:24:59 (UTC+8)
    出版者: Elsevier Inc.;New York: Elsevier Inc
    摘要: 摘要: •The effect of performing instance selection (IS) on missing value imputation (MVI) is studied.•Four different processes for combining IS and MVI are proposed and compared.•Performing IS first and MVI second outperforms other processes over categorical and numerical dataset. In practice, the data collected from data mining usually contain some missing values. Imputation is the process of replacing the missing values in incomplete datasets. It is usually based on providing estimations for missing values by reasoning from the observed data. Consequently, the effectiveness of missing value imputation is heavily dependent on the observed data (or complete data) in the incomplete datasets. The objective of this study is to investigate the effect of performing instance selection to filter out some noisy data (or outliers) from a given dataset on the imputation task. Specifically, four different processes for combining instance selection and missing value imputation are proposed and compared in terms of data classification. The experimental results based on 29 datasets containing categorical, numerical, and mixed attribute types of data show that the process of performing instance selection first and imputation second allows the k-NN and SVM classifiers to outperform the other processes over the categorical and numerical datasets. For the mixed type of datasets, k-NN performs the best when instance selection is performed again on the datasets produced by the second process. Finally, some specific decision rules about when to employ which process are also provided for future research.
    出版者: New York: Elsevier Inc
    出版日期: 2016-12
    出處: The Journal of systems and software, 2016-12, Vol.122, p.63-71
    資源來源: Elsevier ScienceDirect Journals
    版權: 2016 Elsevier Inc.
    版權: Copyright Elsevier Sequoia S.A. Dec 2016
    識別號: ISSN: 0164-1212
    識別號: EISSN: 1873-1228
    識別號: DOI: 10.1016/j.jss.2016.08.093
    識別號: CODEN: JSSODM
    顯示於類別:[資訊管理學系] 期刊論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML21檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明