中大學術數位典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/107049
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 94201/94201 (100%)
Visitors : 81585054      Online Users : 2456
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: https://ir.lib.ncu.edu.tw/handle/987654321/107049


    Title: SVOIS: Support vector oriented instance selection for text classification
    Authors: 蔡志豐;Tsai, Chih-Fong;Chang, Che-Wei
    Contributors: 管理學院資訊管理學系
    Keywords: Data reduction;Instance selection;Machine learning;Support vector machines;Text classification
    Date: 2013-06-19
    Issue Date: 2026-04-23 13:54:37 (UTC+8)
    Publisher: Elsevier Ltd.;Elsevier Ltd
    Abstract: 摘要: Automatic text classification is usually based on models constructed through learning from training examples. However, as the size of text document repositories grows rapidly, the storage requirements and computational cost of model learning is becoming ever higher. Instance selection is one solution to overcoming this limitation. The aim is to reduce the amount of data by filtering out noisy data from a given training dataset. A number of instance selection algorithms have been proposed in the literature, such as ENN, IB3, ICF, and DROP3. However, all of these methods have been developed for the k-nearest neighbor (k-NN) classifier. In addition, their performance has not been examined over the text classification domain where the dimensionality of the dataset is usually very high. The support vector machines (SVM) are core text classification techniques. In this study, a novel instance selection method, called Support Vector Oriented Instance Selection (SVOIS), is proposed. First of all, a regression plane in the original feature space is identified by utilizing a threshold distance between the given training instances and their class centers. Then, another threshold distance, between the identified data (forming the regression plane) and the regression plane, is used to decide on the support vectors for the selected instances. The experimental results based on the TechTC-100 dataset show the superior performance of SVOIS over other state-of-the-art algorithms. In particular, using SVOIS to select text documents allows the k-NN and SVM classifiers perform better than without instance selection. •A novel Support Vector Oriented Instance Selection (SVOIS) approach is introduced.•SVOIS is particularly proposed for high dimensional text classification.•SVOIS has shown its outperformance over state-of-the-art algorithms.•In addition, state-of-the-art algorithms are not good at high dimensional data reduction.
    出版者: Elsevier Ltd
    出版日期: 2013-11-01
    出處: Information Systems, 2013-11, Vol.38 (8), p.1070-1083
    資源來源: Elsevier ScienceDirect Journals
    版權: 2013 Elsevier Ltd
    識別號: ISSN: 0306-4379
    識別號: EISSN: 1873-6076
    識別號: DOI: 10.1016/j.is.2013.05.001
    Appears in Collections:[Department of Information Management] journal & Dissertation

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML19View/Open


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明