中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/9874
English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 80990/80990 (100%)
造訪人次 : 41668698      線上人數 : 1384
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/9874


    題名: 英文郵政地址與鄰近相關資訊擷取之研究;Application and Extraction of Postal Addresses and Related Information
    作者: 李淑瑩;Shu-Ying Li
    貢獻者: 資訊工程研究所
    關鍵詞: Conditional Random Fields;Postal Addresses;地址擷取;條件式隨機域
    日期: 2009-07-23
    上傳時間: 2009-09-22 11:58:29 (UTC+8)
    出版者: 國立中央大學圖書館
    摘要: 地址資訊和人們的日常生活息息相關,人們常需要透過網路查詢相關實體商店、學校或組織的地址,再經由地圖標示服務確定其實際方位。然而並不是每一個網站同時提供地址與地圖標示的功能,因此本研究目的是希望設計一個能從網頁中自動擷取英文地址的服務,並結合地圖標示功能,將擷取到的地址以及其相關資訊,一併標示在地圖上,提供使用者簡單方便的地圖標記資訊服務。我們的系統分為兩個部分,第一部分,將網頁透過條件式隨機域的方式訓練出地址擷取的模型,輸入的網頁經過此模型的測試過程後並擷取地址;第二部份,則以擷取到的地址為基礎,在網頁中擷取與地址相關的資訊,找出包含地址和相關資訊的地址區塊邊界,並且針對包含多餘資訊的區塊提出調整的作法。實驗結果得知,我們的地址擷取效能可以提升F-measure至0.913,同時對於八成六的資料可以正確的擷取到相關資訊。 Address Information is closely linked to people's daily life. People often need to query addresses of related brick-and-mortar shopping malls、schools and organization. And using the service of map marking identified the real direction. But there are not all web pages providing addresses and facility of map marking. Therefore, designing a service of extracting English addresses automatically from web pages is the goal of our research. And the service combines the facility of map marking and marks the extracted addresses and the related information on the map. The service provides users in a convenient and easy way to using the information service of map marking. Our system is divided into two steps: the first step is using Conditional Random fields to train the model of address extraction. Page we input enters the testing process of model of address extraction and outputs the segment of address. The second step is using extracted addresses as landmarks to extract related information and finding out the correct boundary of address blocks. In terms of the result of experiment, the F-measure of extraction by Conditional Random field is up to 0.913. And we also propose the method of adjustment to revise the incorrect boundary. The accuracy after adjusting is from 0.8506 to 0.8689.
    顯示於類別:[資訊工程研究所] 博碩士論文

    文件中的檔案:

    檔案 大小格式瀏覽次數


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明