English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 94201/94201 (100%)
造訪人次 : 81536592      線上人數 : 2278
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: https://ir.lib.ncu.edu.tw/handle/987654321/106134


    題名: A resource-saving collective approach to biomedical semantic role labeling
    作者: 蔡宗翰;Tsai, Richard Tzong-Han;Lai, Po-Ting
    貢獻者: 資訊電機學院資訊工程學系
    關鍵詞: Accuracy;Algorithms;Bioinformatics;Biomedical and Life Sciences;Biomedical Research;Computational Biology/Bioinformatics;Computer Appl. in Life Sciences;Computer science;Data Mining;Databases, Factual;Knowledge-based analysis;Labeling;Life Sciences;Markov Chains;Microarrays;Proteins;Research Article;Semantics;Trees
    日期: 2014-05-27
    上傳時間: 2026-04-23 13:09:50 (UTC+8)
    出版者: BioMed Central Ltd.;London: BioMed Central
    摘要: 摘要: Background Biomedical semantic role labeling (BioSRL) is a natural language processing technique that identifies the semantic roles of the words or phrases in sentences describing biological processes and expresses them as predicate-argument structures (PAS’s). Currently, a major problem of BioSRL is that most systems label every node in a full parse tree independently; however, some nodes always exhibit dependency. In general SRL, collective approaches based on the Markov logic network (MLN) model have been successful in dealing with this problem. However, in BioSRL such an approach has not been attempted because it would require more training data to recognize the more specialized and diverse terms found in biomedical literature, increasing training time and computational complexity. Results We first constructed a collective BioSRL system based on MLN. This system, called collective BIOSMILE (CBIOSMILE), is trained on the BioProp corpus. To reduce the resources used in BioSRL training, we employ a tree-pruning filter to remove unlikely nodes from the parse tree and four argument candidate identifiers to retain candidate nodes in the tree. Nodes not recognized by any candidate identifier are discarded. The pruned annotated parse trees are used to train a resource-saving MLN-based system, which is referred to as resource-saving collective BIOSMILE (RCBIOSMILE). Our experimental results show that our proposed CBIOSMILE system outperforms BIOSMILE, which is the top BioSRL system. Furthermore, our proposed RCBIOSMILE maintains the same level of accuracy as CBIOSMILE using 92% less memory and 57% less training time. Conclusions This greatly improved efficiency makes RCBIOSMILE potentially suitable for training on much larger BioSRL corpora over more biomedical domains. Compared to real-world biomedical corpora, BioProp is relatively small, containing only 445 MEDLINE abstracts and 30 event triggers. It is not large enough for practical applications, such as pathway construction. We consider it of primary importance to pursue SRL training on large corpora in the future.
    其他題名: BMC Bioinformatics
    出版者: London: BioMed Central
    出版日期: 2014-05-27
    出處: BMC bioinformatics, 2014-05, Vol.15 (1), p.160-160, Article 160
    資源來源: Publicly Available Content Database
    版權: Tsai and Lai; licensee BioMed Central Ltd. 2014
    版權: COPYRIGHT 2014 BioMed Central Ltd.
    版權: 2014 Tsai and Lai; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
    版權: Copyright © 2014 Tsai and Lai; licensee BioMed Central Ltd. 2014 Tsai and Lai; licensee BioMed Central Ltd.
    識別號: ISSN: 1471-2105
    識別號: EISSN: 1471-2105
    識別號: DOI: 10.1186/1471-2105-15-160
    識別號: PMID: 24884358
    顯示於類別:[資訊工程學系] 期刊論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML17檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明