English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 78818/78818 (100%)
造訪人次 : 34694423      線上人數 : 1363
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/54786


    題名: 利用專利文件主題辨識科技趨勢;Identifying technology trend in patentdocuments with themes
    作者: 呂國彥;Lu,Kuo-yen
    貢獻者: 企業管理研究所
    關鍵詞: 專利文件;中文斷詞;期望值最大演算法;新興科技;emerging technology;patent document;Cross-Collection Mixture Model
    日期: 2012-07-18
    上傳時間: 2012-09-11 19:02:08 (UTC+8)
    出版者: 國立中央大學
    摘要: 專利文獻記載了全球90%的技術成果,記載的技術受到各國專利法的保護,但隨著世界技術競爭日益激烈,各國企業紛紛展開專利的戰略研究,因此在專利的分析和運用就受到了企業的重視,專利分析是針對專利說明書和專利文件中大量的訊息內容進行分析、加工、組合並利用統計、資料探勘(Data-mining)、文本挖掘(Text-mining)技巧使這些信息轉換成能幫助企業進行決策、預測的競爭情報,因此專利分析成為企業永續生存和保護商業技術的武器之一,在過去專利分析上針對趨勢分析的研究大都以統計分析的方式針對關鍵字的數量和專利數量進行預測分析,但所能找出的關鍵字(keyword)都侷限於已然成熟的技術並無法找出隱含的新興字詞,因此過去的專利分析都只能找到明顯且具有重要性的字詞,但並未能找到不明顯但對未來技術有重要影響的新興字詞,因此如何找出這些低頻性質的字詞做出正確的趨勢預測是非常重要的研究議題。本研究採用中文斷詞系統找尋專利文件的字詞,根據Cross-Collection Mixture Model的機率模型來萃取字詞,此模型將針對字詞在時間序列的變化之下,藉由模型中background model及common theme去除掉過於頻繁且不具有分辨意義的字詞和收集在時間變化之下持續出現的字詞,此方法可以快速且大量地篩選專利文件,並且從專利摘要萃取出具有低頻性質的新興字詞,此方法可以順利的篩選掉熱門字詞並且準確的從專利文件偵測出新興技術(emerging technology)的未來趨勢。Patent has recorded over 90% of the technique worldwide, patent has also been protected by the law in each country. However, as the technology completion has risen up nowadays, the business in each country has started the patent war, therefore, the analysis and implementation of patent has became more important in every business. Patent analysis is focusing on analyzing and combining the message from patent documentations. With statistics, data mining, and text mining, the message can be transformed into a huge role in decisions making and future predictions. Therefore, patent analysis has become a weapon for business to survive and protect their technology. In the past, the majority of the research in trend analysis uses statistics analysis to analyze the amount of keywords and patents. However, the keywords that could be found are limited in the technique that has been developed in years and no more new words could be found. And due to patent documents has the necessity to unveil the technique, the business uses substitute words or phrases to avoid the new words been found. Therefore, patent analysis can only find some obvious and important words but not the key words. This research use Chinese break words system to find the key word in patent documents, and based on Cross-Collection Mixture Model’s probability model to pick the words. This model uses the time sequences difference of the words, and uses the background model and common theme to delete frequent and indistinguishable word and common theme to collect the words the keep appearing under times. The patent documents can be quickly filtered and found the low appearing frequency and distinguishable words due to automation. Therefore, the searching and filter the popular but aged technology, and precisely detect the emerging technology from patent documents.
    顯示於類別:[企業管理研究所] 博碩士論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML772檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明