English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 81570/81570 (100%)
造訪人次 : 47010930      線上人數 : 295
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/61780


    題名: 大量專利類別自動分類演算法研究;An automatic classification algorithm for a large number of patent categorization
    作者: 張元哲;Chang,Yuan-Che
    貢獻者: 資訊管理學系
    關鍵詞: 專利分類;向量空間模型;國際專利分類法;支持向量機;K-質心法分群法;k-近鄰法;Patent classification;Vector space model (VSM);IPC taxonomy;Support vector machines (SVM);K-means;K nearest neighbors (KNN)
    日期: 2013-10-21
    上傳時間: 2013-11-27 11:33:17 (UTC+8)
    出版者: 國立中央大學
    摘要: 自動專利分類系統可以快速比對識別現有專利的可能衝突,對發明者以及專利律師而言,可幫他們節省許多人工比對成本與時間,因此是相當有價值的研究。近年來,使用國際專利分類(IPC)來進行專利文件的分類已日益普遍,而此一國際專利分類則是一個複雜的階層式分類系統,它包含了8個部(section)、128個主類(class)、648個次類(subclass),約有7,200個主目(main group)及72,000個次目(subgroup)。儘管已有一些研究著眼於IPC的自動分類,但截至目前為止,並沒有任何分類方法適合用來進行次目層級的自動分類(IPC的底層分類),因此,本研究提出一個全新的分類方法,稱之為三階段分類演算法(簡稱為TPC演算法),它可以進行次目層級的自動分類,並獲得合理的正確率。此一方法是由三個階段所組成,前兩個階段運用了支持向量機進行可能類別的預測,而最後一個階段則運用分群演算法決定最終的次目標籤。本研究使用世界智慧財產權組織的WIPO-alpha專利資料集進行實驗,其結果顯示TPC演算法可以在次目層級的自動分類上,達到36.07%的正確率,此一數據若與隨機猜測一個次目標籤的機率相比,約已提升了26,020倍的正確率。此外,我們額外搜集96,654份與WIPO-alpha專利資料集不重複的專利文件,再與WIPO-alpha專利資料集合併進行測試,實驗結果顯示正確率提升至38.01%。
    An automatic patent categorization system would be invaluable to individual inventors and patent attorneys, saving them time and effort by quickly identifying conflicts with existing patents. In recent years, it has become more and more common to classify all patent documents using the International Patent Classification (IPC), a complex hierarchical classification system comprised of 8 sections, 128 classes, 648 subclasses, about 7,200 main groups, and approximately 72,000 subgroups. So far, however, no patent categorization method has been developed that can classify patents down to the subgroup level (the bottom level of the IPC). Therefore, this dissertation presents a novel categorization method, the three phase categorization (TPC) algorithm, which classifies patents down to the subgroup level with reasonable accuracy. The method is composed of three phases, where the first two are performed using SVM classification and the last one employs clustering. The experimental results for the TPC algorithm, using the WIPO-alpha collection, indicate that our classification method can achieve 36.07% accuracy at the subgroup level. This is approximately a 26,020-fold improvement over a random guess. In addition, a collection of 96,654 distinct patent documents that we collect from Internet has been combined with WIPO-alpha collection. We evaluate the TPC algorithm on this collection and it achieved an accuracy of 38.01% at the subgroup level.
    顯示於類別:[資訊管理研究所] 博碩士論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML825檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明