邊緣特徵於英文連字切割之研究

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：13

、訪客IP：18.118.140.89

姓名

林志瑋(Chih-Wei Lin) 查詢紙本館藏

畢業系所

資訊工程學系

論文名稱

邊緣特徵於英文連字切割之研究
(Camera Based Touching Character Segmentation using Peripheral Feature)

相關論文

★ 使用視位與語音生物特徵作即時線上身分辨識	★ 以影像為基礎之SMD包裝料帶對位系統
★ 手持式行動裝置內容偽變造偵測暨刪除內容資料復原的研究	★ 基於SIFT演算法進行車牌認證
★ 基於動態線性決策函數之區域圖樣特徵於人臉辨識應用	★ 基於GPU的SAR資料庫模擬器：SAR回波訊號與影像資料庫平行化架構 (PASSED)
★ 利用掌紋作個人身份之確認	★ 利用色彩統計與鏡頭運鏡方式作視訊索引
★ 利用欄位群聚特徵和四個方向相鄰樹作表格文件分類	★ 筆劃特徵用於離線中文字的辨認
★ 利用可調式區塊比對並結合多圖像資訊之影像運動向量估測	★ 彩色影像分析及其應用於色彩量化影像搜尋及人臉偵測
★ 中英文名片商標的擷取及辨識	★ 利用虛筆資訊特徵作中文簽名確認
★ 基於三角幾何學及顏色特徵作人臉偵測、人臉角度分類與人臉辨識	★ 一個以膚色為基礎之互補人臉偵測策略

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

[檢視]

[下載]

本電子論文使用權限為同意立即開放。
已達開放權限電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。
請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。

摘要(中)

在現今科技日新月異的社會中，電子產品越做越精緻小巧，但其功能卻是越來越強大，因此，如何輔助現代人，在利用高科技電子產品來擷取數位影像資料後，將其檔案電子化以節省資料整理上的人力資源及時間耗費，這是本研究所重視的課題。本研究目的即在利用現今解析度已相當不錯之數位相機，以其可攜性、便利性及高解析度等特性，擷取欲分析文件為數位影像資料，並在文件影像中，於文字資訊辨識前，進行英文字元影像的切割研究。因為要有好的辨識效果，必定要有好的文字切割機制將連字正確的切割開來。
數位相機有著隨時取像的優點，但同時也伴隨著光線來源不均勻的影響，其並不像掃描器於取像時有著穩定的光線來源，而且由於取像大多是利用手持相機的方法，因此也會因手抖動，而造成影像發生輕微傾斜或模糊的現象，由於以上外在因素的影響，使得影像在二值化後往往容易發生連字的情形。
本研究提供了一個有效正確的連字切割方法，利用影像前處理包含全域二值化、文字區塊標記、區域二值化，來擷取欲分析之影像資料，並利用本研究中所提出之過濾機制，將正確完整之字元給過濾出來，對於淘汰出來的連字部分，則使用本研究中之邊緣特徵切割機制將其進行切割分析，並可將此正確之切割結果提供後續辨識系統之用。
本研究針對50張名片，總共約有10600個字元，其中正常字元約9550個字元，而約有419組連字；約為1050個字元，其平均過濾篩選正確率為92.14%，切割正確率為98.57%，而文字切割正確率為99.71%。

摘要(英)

Due to the rapid development of scientific technology, electronic products have become smaller with the adding of stronger functions. It is an important issue to assist user how to fully utilize modern Hi-Tech electronic products in storing and retrieving data while saving tremendous human resources and operation time. The purpose of this research is to use a commercialized digital camera to capture the images of name cards or A4-size documents while achieving the goal of segmenting English character images from the documents before performing the task of Optical Character Recognition (OCR). It is important to devise a good segmentation method that can effectively solve the problems of touching characters to obtain good recognition results.
Although digital cameras are portable and easy to use, they suffer the problems resulted from the effects of non-uniform light sources. Moreover, the images captured by digital cameras always slanting or blurring due to the vibration or shaking of hands in taking pictures. Due to the above reasons, the appearing probability of touching characters after binarization becomes much higher comparing with the images captured by using traditional scanners.
In this thesis, we present an effective method to achieve the goal of touching character segmentation. Firstly, image preprocessing is performed including global binarization, connected-component labeling and local binarization to extract the image information for later analysis. Next, a filtering mechanism is devised to segment the correct characters. As to the touching characters, a segmentation method developed by analyzing the peripheral features of character can effectively resolve the problem and produce correct segmentation result.
In the experiments, 50 name cards are tested with totally 10600 characters. Among them, 9550 characters are normal characters and 419 groups of touching characters with 1050 characters are the rest. The average filter accuracy rate is 92.14%, segmentation accuracy rate is 98.57%, and character segmentation accuracy rate is 99.71%. The results demonstrate that the proposed method can effectively segment touching characters.

關鍵字(中)

★ 邊緣特徵

關鍵字(英)

★ Peripheral Feature

論文目次

中文摘要 I
Abstract II
誌謝 IV
目錄 V
圖目錄 VII
表目錄 IX
第一章緒論 1
1-1 研究背景及目的 1
1-2 相關研究 3
1-3 系統架構 6
1-4 論文架構 7
第二章影像前處理 9
2-1 彩色影像轉換灰階影像 9
2-2 全域二值化 10
2-3 文字區塊標記 14
2-4 文字行串連 16
2-5 TypoLine設定 19
2-6 方向校正 22
2-7 區域二值化 24
2-8 破碎字合併 26
第三章連字切割與合併 29
3-1 連字切割前處理 29
3-2 連字過濾 31
3-2-1 特徵擷取 33
3-2-2 資料訓練 37
3-2-3 SVM過濾 38
3-3 應用邊緣特徵於連字切割 40
3-3-1 文字區塊邊緣特徵 40
3-3-2 應用邊緣特徵過濾 43
3-3-3 找尋切點機制 46
3-3-4 邊緣特徵切割 50
3-4 連字切割後處理 55
第四章實驗結果 57
4-1 過濾效果評估 57
4-2 連字切割效果評估 63
4-3 文字切割性能評估 66
第五章結論與未來工作 68
5-1 結論 68
5-2 未來工作 69
參考文獻 70
附錄 A 國中小學常用詞彙 72

參考文獻

[1]. A. Zramdini, and R. Ingold, "Optical Font Recognition Using Typographical Features", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 20, No. 8, August. 1998.
[2]. A. Nomura, K. Michishita, S. Uchida, and M. Suzuki, "Detection and Segmentation of Touching Characters in Mathematical", International Conference on Document Analysis and Recognition , 2003.
[3]. http://www.csie.ntu.edu.tw/~cjlin/libsvm/
[4]. K. Gebze and I. Bebek, "Survey over image thresholding techniques and quantitative performance evaluation", Journal of Electronic Imaging 13(1), 146–165, January 2004.
[5]. M.C. Jung, Y.C. Shin and S.N. Srihari, "Machine Printed Character Segmentation Method using Side Profiles", IEEE International Conference on Systems, Man and Cybernetics , 1999.
[6]. N. Otsu, " A threshold selection method from gray level histograms ", IEEE Transactions on Systems, Man, and Cybernetics, Vol. SMC-9, No. 1, pp.62-66, 1979.
[7]. R.C. Gonzalez , R.E. Woods , "Digital Image Processing, 2nd ed".
[8]. S. Watcharabutsarakham, "Using Projection and Loop for Segmentation of Touching Thai Type", International Symposium on Communications and Information Technologies, 2004.
[9]. S. Kahan, T. Pavlidis, H.S. Baird, "On the Recognition of Printed Characters of Any Font and Size", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. PAMI-9, pp. 274-287, March 1987.
[10]. S. Liang , M. Ahmadi, M. Shridhar, "Segmentation of Touching Characters in Printed Document Recognition", International Conference on Document Analysis and Recognition, 1993.
[11]. U.K.S. Jayarathna and G.E.M.D.C. Bandara, "A Junction Based Segmentation Algorithm for Offline Handwritten Connected Character Segmentation", Computational Intelligence for Modelling, Control and Automation, 2006.
[12]. Y.K. Chen and J.F. Wang, "Segmentation of Single- or Multiple-Touching Handwritten Numeral String Using Background and Foreground Analysis", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 11, November 2000.
[13]. Y. Lu, "On the Segmentation of Touching Characters", International Conference on Document Analysis and Recognition, 1993.
[14]. 維基百科 , http://en.wikipedia.org/wiki/RGB_color_space
[15]. 維基百科 , http://en.wikipedia.org/wiki/HSL_color_space
[16]. 維基百科 , http://en.wikipedia.org/wiki/Support_vector_machine

指導教授

溫敏淦、范國清
(Ming-Gang Wen、Kuo-Chin Fan)

審核日期

2007-7-23

推文