利用專利文件主題辨識科技趨勢;Identifying technology trend in patentdocuments with themes

NCU Institutional Repository > 管理學院 > 企業管理研究所 > 博碩士論文 > Item 987654321/54786

請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/54786

題名:	利用專利文件主題辨識科技趨勢;Identifying technology trend in patentdocuments with themes
作者:	呂國彥;Lu,Kuo-yen
貢獻者:	企業管理研究所
關鍵詞:	專利文件;中文斷詞;期望值最大演算法;新興科技;emerging technology;patent document;Cross-Collection Mixture Model
日期:	2012-07-18
上傳時間:	2012-09-11 19:02:08 (UTC+8)
出版者:	國立中央大學
摘要:	專利文獻記載了全球90%的技術成果，記載的技術受到各國專利法的保護，但隨著世界技術競爭日益激烈，各國企業紛紛展開專利的戰略研究，因此在專利的分析和運用就受到了企業的重視，專利分析是針對專利說明書和專利文件中大量的訊息內容進行分析、加工、組合並利用統計、資料探勘(Data-mining)、文本挖掘(Text-mining)技巧使這些信息轉換成能幫助企業進行決策、預測的競爭情報，因此專利分析成為企業永續生存和保護商業技術的武器之一，在過去專利分析上針對趨勢分析的研究大都以統計分析的方式針對關鍵字的數量和專利數量進行預測分析，但所能找出的關鍵字(keyword)都侷限於已然成熟的技術並無法找出隱含的新興字詞，因此過去的專利分析都只能找到明顯且具有重要性的字詞，但並未能找到不明顯但對未來技術有重要影響的新興字詞，因此如何找出這些低頻性質的字詞做出正確的趨勢預測是非常重要的研究議題。本研究採用中文斷詞系統找尋專利文件的字詞，根據Cross-Collection Mixture Model的機率模型來萃取字詞，此模型將針對字詞在時間序列的變化之下，藉由模型中background model及common theme去除掉過於頻繁且不具有分辨意義的字詞和收集在時間變化之下持續出現的字詞，此方法可以快速且大量地篩選專利文件，並且從專利摘要萃取出具有低頻性質的新興字詞，此方法可以順利的篩選掉熱門字詞並且準確的從專利文件偵測出新興技術(emerging technology)的未來趨勢。Patent has recorded over 90% of the technique worldwide, patent has also been protected by the law in each country. However, as the technology completion has risen up nowadays, the business in each country has started the patent war, therefore, the analysis and implementation of patent has became more important in every business. Patent analysis is focusing on analyzing and combining the message from patent documentations. With statistics, data mining, and text mining, the message can be transformed into a huge role in decisions making and future predictions. Therefore, patent analysis has become a weapon for business to survive and protect their technology. In the past, the majority of the research in trend analysis uses statistics analysis to analyze the amount of keywords and patents. However, the keywords that could be found are limited in the technique that has been developed in years and no more new words could be found. And due to patent documents has the necessity to unveil the technique, the business uses substitute words or phrases to avoid the new words been found. Therefore, patent analysis can only find some obvious and important words but not the key words. This research use Chinese break words system to find the key word in patent documents, and based on Cross-Collection Mixture Model’s probability model to pick the words. This model uses the time sequences difference of the words, and uses the background model and common theme to delete frequent and indistinguishable word and common theme to collect the words the keep appearing under times. The patent documents can be quickly filtered and found the low appearing frequency and distinguishable words due to automation. Therefore, the searching and filter the popular but aged technology, and precisely detect the emerging technology from patent documents.
顯示於類別:	[企業管理研究所] 博碩士論文

文件中的檔案:

檔案	描述	大小	格式	瀏覽次數
index.html		0Kb	HTML	745	檢視/開啟

在NCUIR中所有的資料項目都受到原著作權保護.

社群 sharing

資料載入中.....