以相機取像之中文文件辨識前處理系統

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：12

、訪客IP：3.16.29.218

姓名

黃自達(Tzu-Ta Huang) 查詢紙本館藏

畢業系所

資訊工程學系

論文名稱

以相機取像之中文文件辨識前處理系統
(Camera based Preprocessing System for ChineseDocument Image Recognition)

相關論文

★ 使用視位與語音生物特徵作即時線上身分辨識	★ 以影像為基礎之SMD包裝料帶對位系統
★ 手持式行動裝置內容偽變造偵測暨刪除內容資料復原的研究	★ 基於SIFT演算法進行車牌認證
★ 基於動態線性決策函數之區域圖樣特徵於人臉辨識應用	★ 基於GPU的SAR資料庫模擬器：SAR回波訊號與影像資料庫平行化架構 (PASSED)
★ 利用掌紋作個人身份之確認	★ 利用色彩統計與鏡頭運鏡方式作視訊索引
★ 利用欄位群聚特徵和四個方向相鄰樹作表格文件分類	★ 筆劃特徵用於離線中文字的辨認
★ 利用可調式區塊比對並結合多圖像資訊之影像運動向量估測	★ 彩色影像分析及其應用於色彩量化影像搜尋及人臉偵測
★ 中英文名片商標的擷取及辨識	★ 利用虛筆資訊特徵作中文簽名確認
★ 基於三角幾何學及顏色特徵作人臉偵測、人臉角度分類與人臉辨識	★ 一個以膚色為基礎之互補人臉偵測策略

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

[檢視]

[下載]

本電子論文使用權限為同意立即開放。
已達開放權限電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。
請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。

摘要(中)

文件傳達許多重要的資訊，如何將文件影像數位化並擷取文字訊息這個議題，隨著數位相機的普及而逐漸受到重視。為了獲得正確的文字辨識結果，一個以相機取像的中文文件辨識前處理系統，必須能處理不同排版格式、多種字體大小的行列、與使用者拍攝產生的文件輕微歪斜等問題，使得擷取出的文字區塊不會產生嚴重的謬誤。
中文文件與英文文件前處理最大的不同，是中文字由多個相連元件組成，如何將組成中文字的相連元件正確合併來進行中文字切割，是擷取中文字訊息最重要的步驟。本文提出的中文字的行列串連演算法與中文字部件合併判斷法則，可以克服影像小角度傾斜與中英字元混雜時的中文字合併問題，並能提供文字區塊讀序的資訊。另外論文中提出兩個文字訊息的保護機制:第一個是反白字區域偵測，通常反白字於相連元件抽取時會被視為背景雜訊過濾掉，為了保護資料完整性，有必要對反白字組成原件另外偵測；第二個是非正向文件的偵測，為了相機取像的便利性，以及使文件內容有較清晰的入鏡範圍，經常有將文件平面垂直光軸旋轉的需要，通常拍攝出的文件影像為接近正矩形的影像，但若要呈現正向的文字內容，可能仍需要旋轉0o、90o、180o、270o四種情形的校正。本研究提出一個以統計為基礎的方法，透過分析中文字筆劃的輪廓像素的方向性與文章中的中文字垂直投影波型，總合判斷中文文件的旋轉方向，提供文字識別模組，一個自動化的方向判斷機制。
本論文以名片測試本研究提出的辨識前處理系統，結果文字區塊正確切割擷取文字影像的成功率可達到98%，足以證明前處理系統設計方法的正確性。

摘要(英)

As we know, Chinese documents convey a lot of meaningful and useful information. Due to the popularization of digital cameras, it is convenient to take picture and retrieve important text information from the digitalized Chinese document images. A successful camera-based Chinese document processing system should overcome the problems resulted from various document formats, font sizes, and document skewing to extract correct text block without generating erroneous results.
The major difference between Chinese documents and English documents is that Chinese characters are mainly composed of multiple connected components. The most important step in obtaining the message of the existence of Chinese documents is to merge connected components with correct combining and produce complete Chinese character blocks. In this thesis, we propose a method to link Chinese characters into text line and develop a rule to discriminate the merging condition of ordering connected components to hypothesize the existence of skewing documents. Two mechanisms are developed in the thesis. The first mechanism is the detection of inversed text blocks which may be filtered out as oversize noise blocks in the preprocessing. The second mechanism is the detection of document images laid in incorrect direction because sometimes people will rotate camera 90o or 270o to capture document images. A two pass statistical method is proposed to automatically determine the rotating degree of documents images(0o、90o、180o、270o). The first step is devised by using the phenomenon that horizontal strokes appear more frequently than vertical strokes in Chinese characters. The second step is devised by analyzing the vertical projection histogram of each text block and defining keywords that assist in deciding the rotating degree.

關鍵字(中)

★ 前處理系統
★ 中文字切割
★ 中文文件方向校正

關鍵字(英)

★ chinese character segmentation
★ preprocessing System
★ direction rectification of chinese document

論文目次

中文摘要 iii
Abstract v
總目錄 viii
圖目錄 x
表目錄 xiii
第一章 1
1.1 研究動機 1
1.2 文獻回顧 2
1.2.1 文字切割 2
1.2.2 文件方向校正 4
1.2.3 文件排版分析 5
1.3 論文架構 6
第二章文字組成元件偵測 8
2.1 彩色至灰階轉換 8
2.2 灰階至二值化的轉換 9
2.4 反白字區域偵測 15
第三章非正向文件偵測與校正 23
3.1 文件90 o或270 o旋轉偵測 25
3.2 正反向文件之判別 26
3.2.1 垂直投影量統計分析 29
3.2.2 輔助方向判讀之關鍵字選取 30
3.2.3 文件旋轉方向判讀範例 33
第四章文字行擷取與行中文字切割 37
4.1 文字區域群聚切割 37
4.1.1 遞迴水平垂直切割方法 39
4.2 文字行擷取 43
4.3 中文文字行合併 45
4.3.1 重疊相連元件行偵測 45
4.3.2 傾斜文件的異常合併檢查 46
4.3.3 文字行合併 49
4.4 相連元件語言辨識 50
4.4.1 特徵擷取 51
4.4.2 文字區塊分類器 53
4.5 中文字組成元件合併 55
第五章實驗結果 58
5.1 中文字合併效能評估 58
5.2 方向判別關鍵字效能評估 65
第六章結論與未來工作 68
6.1 結論 68
6.2 未來工作 69
參考文獻 70
附錄A 72

參考文獻

[1] F. Chang, C. H. Chou, C. C. Lin, and C. J. Chen “A Prototype Classification Method and its Application to Handwritten Character Recognition”. International Conference on Systems, Man and Cybernetics, 2004
[2] Y. Cao, H. Li, "Skew Detection and Correction in Document Images Based on Straight-Line Fitting," Pattern Recognition Letters, , Vol. 24, No. 12, pp. 1871-1879, August. 2003.
[3] R. G.. Casey and E. Lecolinet, “A Survey of Methods and Strategies in Character Segmentation”, IEEE Transaction on Pattern Recognition Analysis and Machine Intelligence. Vol 18, No.7, pp 690 - 706, 1996.
[4] J. Ha, R. M. Haralick and I. T. Phillips“Recursive X-Y Cut using Bounding boxes of Connected Components”, International Conference on Document Analysis and Recognition, Vol 2, 1995
[5] L. Jagannathan and C.V. Jawahar, “perspective correction methods for camera-based document analysis” . International Workshop on Camera-based Document Analysis and Recognition, 2005
[6] Y. Liu, S. Goto, T. Ikenaga. “A Robust Algorithm for Text Detection in Color Images”. International Conference on Document Analysis and Recognition, 2005
[7] P. S. Liao, T. S. Chen, and P. C. Chung. “A Fast Algorithm for Multilevel Thresholding.” Journal of Information Science and Engineering, Vol 17, pp 713-737, 2001.
[8] M. A. Fischler, R. C. Holles. “Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography.” Comm. of the ACM, Vol 24, pp 381-395, 1981.
[9] P. Clark, and M. Mirmehdi, “Rectifying perpective views of text in 3D scenes using vanishing points,” Pattern Recognition, Vol. 36, 2673-2683, 2003.
[10] H. M. Sun , “Page Segmentation for Manhattan anf Non-Manhattan Layout Documents via Selective CRLA”. International Conference on Document Analysis and Recognition, 2005
[11] Y. Zhong, K. Karu, and A.K. Jain, "Locating Text in Complex Color Images," Pattern Recognition, Vol. 28, No. 10, pp. 1,523-1,536, Oct. 1995.
[12] S. Zhao, Z. Chi, P. Shi and Q. Wang, “Handwritten Chinese Character Segmentation Using a Two-Stage Approach”, International Conference on Document Analysis and Recognition, 2001
[13] 王亮聖, ”利用文件分析作文件之無失真重現”, 國立中央大學資訊工程研究所博士論文, 中華民國86年六月
[14] 林宗勳,Support Vector Machines簡介 ,台灣大學資訊工程研究所, 2000.
[15] 林家禎, ”中英文名片商標的擷取及辨識”, 國立中央大學資訊工程研究所碩士論文, 中華民國90年六月
[16] 李祐昇, “利用小波轉換自動偵測影像中的文字”, 國立台灣大學資訊管理研究所碩士論文, 中華民國89年六月
[17] 孫宇、江崇禮、董明, “GIS電子地圖中文字標注方向矯正演算法” ,大連理工大學學報, Vol. 42, 2002.
[18] 賴逸嶺, ”中文名片處理系統”, 國立中央大學電機工程研究所碩士論文, 中華民國87年六月
[19] 廖紹鋼(編譯), Gonzalez Woods(原著), ”數位影像處理”, 普林斯頓國際有限公司,第二版
[20] 教育部全球資訊網
[21] http://www.cs.mcgill.ca/~aghnei/square.html
[22] 維基百科 : http://zh.wikipedia.org/w/index.php?title=%E9%83%A8%E9%A6%96&variant=zh-tw

指導教授

范國清、溫敏淦
(Kuo-Chin Fan、Ming-Gang Wen)

審核日期

2007-7-23

推文