English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 94201/94201 (100%)
造訪人次 : 80415552      線上人數 : 137
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: https://ir.lib.ncu.edu.tw/handle/987654321/107082


    題名: The distance function effect on k-nearest neighbor classification for medical datasets
    作者: 蔡志豐;Hu, Li-Yu;Huang, Min-Wei;Ke, Shih-Wen;Tsai, Chih-Fong
    貢獻者: 管理學院資訊管理學系
    關鍵詞: Case Study;Classification;Computer Science;Humanities and Social Sciences;multidisciplinary;Science;Science (multidisciplinary)
    日期: 2016-12-01
    上傳時間: 2026-04-23 13:55:45 (UTC+8)
    出版者: Springer Science and Business Media Deutschland GmbH;Cham: Springer Science and Business Media LLC
    摘要: 摘要: Introduction K-nearest neighbor (k-NN) classification is conventional non-parametric classifier, which has been used as the baseline classifier in many pattern classification problems. It is based on measuring the distances between the test data and each of the training data to decide the final classification output. Case description Since the Euclidean distance function is the most widely used distance metric in k-NN, no study examines the classification performance of k-NN by different distance functions, especially for various medical domain problems. Therefore, the aim of this paper is to investigate whether the distance function can affect the k-NN performance over different medical datasets. Our experiments are based on three different types of medical datasets containing categorical, numerical, and mixed types of data and four different distance functions including Euclidean, cosine, Chi square, and Minkowsky are used during k-NN classification individually. Discussion and evaluation The experimental results show that using the Chi square distance function is the best choice for the three different types of datasets. However, using the cosine and Euclidean (and Minkowsky) distance function perform the worst over the mixed type of datasets. Conclusions In this paper, we demonstrate that the chosen distance function can affect the classification accuracy of the k-NN classifier. For the medical domain datasets including the categorical, numerical, and mixed types of data, K-NN based on the Chi square distance function performs the best.
    其他題名: SpringerPlus
    其他題名: Springerplus
    出版者: Cham: Springer Science and Business Media LLC
    出版日期: 2016-08-09
    出處: SpringerPlus, 2016-08, Vol.5 (1), p.1304-1304, Article 1304
    資源來源: Agricultural & Environmental Science Collection
    版權: The Author(s) 2016
    版權: SpringerPlus is a copyright of Springer, 2016.
    識別號: ISSN: 2193-1801
    識別號: EISSN: 2193-1801
    識別號: DOI: 10.1186/s40064-016-2941-7
    識別號: PMID: 27547678
    顯示於類別:[資訊管理學系] 期刊論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML23檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明