English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 94201/94201 (100%)
造訪人次 : 81559432      線上人數 : 3594
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: https://ir.lib.ncu.edu.tw/handle/987654321/106490


    題名: Collective instance-level gene normalization on the IGN corpus
    作者: 蔡宗翰;Dai, Hong-Jie;Wu, Johnny Chi-Yang;Tsai, Richard Tzong-Han
    貢獻者: 資訊電機學院資訊工程學系
    關鍵詞: Algorithms;Annotations;Artificial intelligence;Bioinformatics;Biological activity;Biological effects;Computational Biology - methods;Computer science;Data Mining - methods;Databases, Genetic;Genes;Humans;Molecular Sequence Annotation;Names;Natural language;Natural Language Processing;Nomenclature;Object recognition;Proteins;Software;Studies
    日期: 2013-11-25
    上傳時間: 2026-04-23 13:24:52 (UTC+8)
    出版者: Public Library of Science;United States: Public Library of Science
    摘要: 摘要: A high proportion of life science researches are gene-oriented, in which scientists aim to investigate the roles that genes play in biological processes, and their involvement in biological mechanisms. As a result, gene names and their related information turn out to be one of the main objects of interest in biomedical literatures. While the capability of recognizing gene mentions has made significant progress, the results of recognition are still insufficient for direct use due to the ambiguity of gene names. Gene normalization (GN) goes beyond the recognition task by linking a gene mention to a database ID. Unlike most previous works, we approach GN on the instance-level and evaluate its overall performance on the recognition and normalization steps in abstracts and full texts. We release the first instance-level gene normalization (IGN) corpus in the BioC format, which includes annotations for the boundaries of all gene mentions and the corresponding IDs for human gene mentions. Species information, along with existing co-reference chains and full name/abbreviation pairs are also provided for each gene mention. Using the released corpus, we have designed a collective instance-level GN approach using not only the contextual information of each individual instance, but also the relations among instances and the inherent characteristics of full-text sections. Our experimental results show that our collective approach can achieve an F-score of 0.743. The proposed approach that exploits section characteristics in full-text articles can improve the F-scores of information lacking sections by up to 1.8%. In addition, using the proposed refinement process improved the F-score of gene mention recognition by 0.125 and that of GN by 0.03. Whereas current experimental results are limited to the human species, we seek to continue updating the annotations of the IGN corpus and observe how the proposed approach can be extended to other species.
    其他題名: PLoS One
    出版者: United States: Public Library of Science
    出版日期: 2013-11-25
    出處: PloS one, 2013-11, Vol.8 (11), p.e79517
    資源來源: [Open Access] DOAJ 오픈액세스 저널 디렉토리
    版權: COPYRIGHT 2013 Public Library of Science
    版權: 2013 Dai et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/3.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
    版權: 2013 Dai et al 2013 Dai et al
    識別號: ISSN: 1932-6203
    識別號: EISSN: 1932-6203
    識別號: DOI: 10.1371/journal.pone.0079517
    識別號: PMID: 24282506
    顯示於類別:[資訊工程學系] 期刊論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML16檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明