博碩士論文 110353006 完整後設資料紀錄

DC 欄位 語言
DC.contributor機械工程學系在職專班zh_TW
DC.creator陳冠帆zh_TW
DC.creatorKuan-Fan Chenen_US
dc.date.accessioned2023-6-28T07:39:07Z
dc.date.available2023-6-28T07:39:07Z
dc.date.issued2023
dc.identifier.urihttp://ir.lib.ncu.edu.tw:88/thesis/view_etd.asp?URN=110353006
dc.contributor.department機械工程學系在職專班zh_TW
DC.description國立中央大學zh_TW
DC.descriptionNational Central Universityen_US
dc.description.abstract本文研究一種使用形態學與光學字符辨識功能取得特定工程圖表影像中單元格的文字內容,並記錄結果的快速辨識方法。本研究適用於特定工程圖表影像,如果需要應用於不同形式的工程圖表影像,可以修改相應工程圖表影像規則的參數。 本研究以Python程式語言作為基礎,前處理使用Otsu閾值法進行圖像二值化處理,並使用形態學操作提取特定工程圖表影像之單元格位置。在文字辨識的過程中,使用Tesseract-OCR套件分為三個階段進行文字辨識與提取:1.使用全自動頁面分割搭配預訓練的英語模型、2.使用單詞分割搭配重新訓練的英語模型與3.使用單字分割搭配重新訓練的英語模型。最後,使用正規表達式搭配窮舉法修正錯誤以及與規則不符的內容。 實驗結果表明,Tesseract-OCR套件雖然提供使用者預訓練的英語模型,並且這個英語模型在長字串的辨識能力非常卓越,但是在單元格中的單詞或單字辨識卻容易產生錯誤,使用三個階段搭配預訓練的英語模型辨識結果,正確率僅14.65%。而本研究使用特定工程圖表影像製作數據集重新訓練的英語模型,對於單元格中的單詞或單字辨識能力更好,正確率可以提升至58.04%。在後處理的過程中,依特殊工程圖表規則列出所有錯誤以及與規則不符的內容並使用正確字符取代,則可以讓正確率達到100%。zh_TW
dc.description.abstractThis study investigates a rapid recognition method for extracting text content from cells in specific engineering chart images using morphology and optical character recognition (OCR) techniques and recording the results. The research is applicable to specific engineering chart images, and if it needs to be applied to different types of engineering chart images, the parameters of the corresponding engineering chart image rules can be modified. Python programming language serves as the foundation for this research. In the preprocessing stage, the Otsu thresholding method is utilized for image binarization, and morphology operations are employed to extract the positions of cells in specific engineering chart images. In the text recognition process, the Tesseract-OCR package is used and divided into three stages for text recognition and extraction: 1. automatic page segmentation with a pre-trained English model, 2. word segmentation with a retrained English model, and 3. character segmentation with a retrained English model. Finally, regular expressions combined with an exhaustive approach are used to correct errors and content that deviate from the rules. The experimental results indicate that although the Tesseract-OCR package provides users with a pre-trained English model, which exhibits excellent recognition capabilities for long strings, it tends to generate errors in recognizing words or individual characters within cells. Using the three-stage approach with the pre-trained English model, the recognition accuracy is only 14.65%. However, by retraining the English model using a dataset created from specific engineering chart images, the recognition capability for words or individual characters within cells improves, achieving an accuracy of 58.04%. In the post-processing stage, by listing all errors and content that deviate from the rules based on specific engineering chart rules and replacing them with correct characters, the accuracy can be enhanced to 100%.en_US
DC.subject文字辨識zh_TW
DC.subject表格辨識zh_TW
DC.subject信息提取zh_TW
DC.subject形態學操作zh_TW
DC.subject光學字符辨識zh_TW
DC.subjectText recognitionen_US
DC.subjectTable extractionen_US
DC.subjectInformation extractionen_US
DC.subjectMorphological operationsen_US
DC.subjectOptical Character Recognitionen_US
DC.title一種應用於特定工程圖表影像的文字智慧辨識與提取之技術研究zh_TW
dc.language.isozh-TWzh-TW
DC.type博碩士論文zh_TW
DC.typethesisen_US
DC.publisherNational Central Universityen_US

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明