English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 84303/84303 (100%)
造訪人次 : 63476828      線上人數 : 92
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: https://ir.lib.ncu.edu.tw/handle/987654321/99400


    題名: Summarization-Enhanced BERT with Phonetic and Glyph embeddings for Chinses Spelling Check
    作者: 郭憶萱;Kuo, Yi-Hsuan
    貢獻者: 資訊管理學系
    關鍵詞: 自然語言處理;中文拼寫檢查
    日期: 2026-02-02
    上傳時間: 2026-03-06 18:54:11 (UTC+8)
    出版者: 國立中央大學
    摘要: 因中文字結構複雜、同音字多與字形相似等特性,中文拼寫檢查(Chinese Spelling Check, CSC)面臨諸多挑戰,使得錯字偵測高度依賴語境理解。本研究提出一個新穎的 CSC 框架—Summarization-Enhanced BERT(SE-BERT),結合句子摘要特徵、字音資訊與字形嵌入,以提升模型在錯誤偵測與糾正任務中的語意感知能力。該模型由摘要模組、偵測網路與糾正網路三部分組成,並加入錯誤導向遮罩機制,提供更具針對性的修正指引。在 SIGHAN 標準資料集上進行實驗後顯示,SE-BERT 在準確率與錯誤識別能力方面皆優於現有基準模型,且能有效降低過度修正的情形。注意力視覺化與個案分析亦驗證模型能聚焦於語意關鍵位置。整體而言,本研究證實整合語意、語音與視覺資訊對於提升中文拼寫校正成效的重要性,並提供一個具結構性且可擴展的拼字校正解決方案,適用於語言特性多變的應用場景。;Due to the structural complexity of Chinese characters, the high occurrence of homophones, and visual similarity among glyphs, Chinese spelling check (CSC) presents unique challenges. These factors make typo detection highly context-dependent. This study proposes a novel CSC framework, Summarization-Enhanced BERT (SE-BERT), which integrates phonetic and glyph embeddings with sentence-level summarization features to enhance context awareness in error detection and correction. The model consists of a summarization module, a detection network, and a correction network, augmented with an error-guided mask that guides more precise correction. Experiments conducted on benchmark datasets, including SIGHAN, demonstrate that SE-BERT achieves superior performance compared to existing baselines, particularly in reducing miscorrections and improving accuracy. Attention visualization and case studies further confirm the model′s ability to focus on key contextual cues. These findings highlight the importance of multi-source information integration like semantic, phonetic, and visual, for effective CSC, offering a structured and adaptable approach for spelling correction in linguistically complex environments.
    顯示於類別:[資訊管理研究所] 博碩士論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML27檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明