中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/10344
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 78937/78937 (100%)
Visitors : 39357381      Online Users : 385
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/10344


    Title: 應用主成份分析及支持向量機於特徵擷取之研究;Feature Extraction Using Principal Component Analysis and Support Vectors Machines
    Authors: 黃勇迪;Yong-Di Huang
    Contributors: 電機工程研究所
    Keywords: 主成份分析;支持向量機;support vector machines;principal companent analysis
    Date: 2008-06-17
    Issue Date: 2009-09-22 12:13:11 (UTC+8)
    Publisher: 國立中央大學圖書館
    Abstract: 研究上指出資料本身的特性會直接影響到分類能力。因此我們設計出一種資料研究的方法,將特徵做最好的應用。在一些模糊不清的資料上增加必要的特徵,提高模式識別的應用,以保證類別分離性。本論文結合主成份分析(PCA)於特徵擷取之研究。因此我們提出了LPCSVM和FCLSVM兩種演算法。 在LPCSVM演算法中,外部的類別標籤被視為有用的特徵資訊並將其加入原始資料中,而形成一個新增加的資料集。對於支持向量機(SVM)而言,主成份分析的功能是擷取這些新資料的特徵。在FCLSVM演算法中,我們討論相同的類別標籤觀念,將第一主成份當作一個代表性的指標於增加的資料集中。如此,這些代表性的第一主成份可以成功經由數學式計算而被呈現;且於分類之前, 對於任何驗證與測試資料也能做相同的轉換。 實驗數據顯示,應用代表性指標資料,分類誤差將會被降低。這結果證實代表性的指標提供給特徵擷取額外有價值的資訊。 Several studies have been reported that the characteristics of data sets are directly correlated with the capability of the classifier. Therefore, a study in the cognition is conceived, and we suggest the feature optimization. It adds necessary features based on some vague and insufficient knowledge in the pattern recognition applications to guarantee class separability. We present that the available resource of class labels and feature extraction concepts of principal component analysis (PCA) can be applied to the feature optimization problem. Thus, we propose the LPCSVM and FCLSVM to set a sufficient number of features compensating for the lack of information. In the LPCSVM algorithm, the class labels of outputs firstly are regarded as useful feature information, and thus they are incorporated into the original inputs to form a new augmented data set. Then principal component analysis (PCA) is applied to the augmented data to extract features for support vector machines (SVM) classification. Above all, in the FCLSVM algorithm we discuss the concept of an equivalent class label, which describes this first principal component as a kind of representative label in the augmented data set. In this way, the representative indices can be successfully represented by a mathematical function in the first principal component form, which is benefiting any validation set and test set subjected to the same transformation before it is classified by the classifier. The experiments on several existing data sets show that, when the augmented data are utilized, the classification errors estimated are reduced by experimental evidence. This implies that the class labels can be used as extra helpful information to feature extraction.
    Appears in Collections:[Graduate Institute of Electrical Engineering] Electronic Thesis & Dissertation

    Files in This Item:

    File SizeFormat


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明