English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 94201/94201 (100%)
造訪人次 : 81558151      線上人數 : 3747
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: https://ir.lib.ncu.edu.tw/handle/987654321/107049


    題名: SVOIS: Support vector oriented instance selection for text classification
    作者: 蔡志豐;Tsai, Chih-Fong;Chang, Che-Wei
    貢獻者: 管理學院資訊管理學系
    關鍵詞: Data reduction;Instance selection;Machine learning;Support vector machines;Text classification
    日期: 2013-06-19
    上傳時間: 2026-04-23 13:54:37 (UTC+8)
    出版者: Elsevier Ltd.;Elsevier Ltd
    摘要: 摘要: Automatic text classification is usually based on models constructed through learning from training examples. However, as the size of text document repositories grows rapidly, the storage requirements and computational cost of model learning is becoming ever higher. Instance selection is one solution to overcoming this limitation. The aim is to reduce the amount of data by filtering out noisy data from a given training dataset. A number of instance selection algorithms have been proposed in the literature, such as ENN, IB3, ICF, and DROP3. However, all of these methods have been developed for the k-nearest neighbor (k-NN) classifier. In addition, their performance has not been examined over the text classification domain where the dimensionality of the dataset is usually very high. The support vector machines (SVM) are core text classification techniques. In this study, a novel instance selection method, called Support Vector Oriented Instance Selection (SVOIS), is proposed. First of all, a regression plane in the original feature space is identified by utilizing a threshold distance between the given training instances and their class centers. Then, another threshold distance, between the identified data (forming the regression plane) and the regression plane, is used to decide on the support vectors for the selected instances. The experimental results based on the TechTC-100 dataset show the superior performance of SVOIS over other state-of-the-art algorithms. In particular, using SVOIS to select text documents allows the k-NN and SVM classifiers perform better than without instance selection. •A novel Support Vector Oriented Instance Selection (SVOIS) approach is introduced.•SVOIS is particularly proposed for high dimensional text classification.•SVOIS has shown its outperformance over state-of-the-art algorithms.•In addition, state-of-the-art algorithms are not good at high dimensional data reduction.
    出版者: Elsevier Ltd
    出版日期: 2013-11-01
    出處: Information Systems, 2013-11, Vol.38 (8), p.1070-1083
    資源來源: Elsevier ScienceDirect Journals
    版權: 2013 Elsevier Ltd
    識別號: ISSN: 0306-4379
    識別號: EISSN: 1873-6076
    識別號: DOI: 10.1016/j.is.2013.05.001
    顯示於類別:[資訊管理學系] 期刊論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML18檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明