摘要(英) |
Due to the rapid development of scientific technology, electronic products have become smaller with the adding of stronger functions. It is an important issue to assist user how to fully utilize modern Hi-Tech electronic products in storing and retrieving data while saving tremendous human resources and operation time. The purpose of this research is to use a commercialized digital camera to capture the images of name cards or A4-size documents while achieving the goal of segmenting English character images from the documents before performing the task of Optical Character Recognition (OCR). It is important to devise a good segmentation method that can effectively solve the problems of touching characters to obtain good recognition results.
Although digital cameras are portable and easy to use, they suffer the problems resulted from the effects of non-uniform light sources. Moreover, the images captured by digital cameras always slanting or blurring due to the vibration or shaking of hands in taking pictures. Due to the above reasons, the appearing probability of touching characters after binarization becomes much higher comparing with the images captured by using traditional scanners.
In this thesis, we present an effective method to achieve the goal of touching character segmentation. Firstly, image preprocessing is performed including global binarization, connected-component labeling and local binarization to extract the image information for later analysis. Next, a filtering mechanism is devised to segment the correct characters. As to the touching characters, a segmentation method developed by analyzing the peripheral features of character can effectively resolve the problem and produce correct segmentation result.
In the experiments, 50 name cards are tested with totally 10600 characters. Among them, 9550 characters are normal characters and 419 groups of touching characters with 1050 characters are the rest. The average filter accuracy rate is 92.14%, segmentation accuracy rate is 98.57%, and character segmentation accuracy rate is 99.71%. The results demonstrate that the proposed method can effectively segment touching characters. |
參考文獻 |
[1]. A. Zramdini, and R. Ingold, "Optical Font Recognition Using Typographical Features", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 20, No. 8, August. 1998.
[2]. A. Nomura, K. Michishita, S. Uchida, and M. Suzuki, "Detection and Segmentation of Touching Characters in Mathematical", International Conference on Document Analysis and Recognition , 2003.
[3]. http://www.csie.ntu.edu.tw/~cjlin/libsvm/
[4]. K. Gebze and I. Bebek, "Survey over image thresholding techniques and quantitative performance evaluation", Journal of Electronic Imaging 13(1), 146–165, January 2004.
[5]. M.C. Jung, Y.C. Shin and S.N. Srihari, "Machine Printed Character Segmentation Method using Side Profiles", IEEE International Conference on Systems, Man and Cybernetics , 1999.
[6]. N. Otsu, " A threshold selection method from gray level histograms ", IEEE Transactions on Systems, Man, and Cybernetics, Vol. SMC-9, No. 1, pp.62-66, 1979.
[7]. R.C. Gonzalez , R.E. Woods , "Digital Image Processing, 2nd ed".
[8]. S. Watcharabutsarakham, "Using Projection and Loop for Segmentation of Touching Thai Type", International Symposium on Communications and Information Technologies, 2004.
[9]. S. Kahan, T. Pavlidis, H.S. Baird, "On the Recognition of Printed Characters of Any Font and Size", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. PAMI-9, pp. 274-287, March 1987.
[10]. S. Liang , M. Ahmadi, M. Shridhar, "Segmentation of Touching Characters in Printed Document Recognition", International Conference on Document Analysis and Recognition, 1993.
[11]. U.K.S. Jayarathna and G.E.M.D.C. Bandara, "A Junction Based Segmentation Algorithm for Offline Handwritten Connected Character Segmentation", Computational Intelligence for Modelling, Control and Automation, 2006.
[12]. Y.K. Chen and J.F. Wang, "Segmentation of Single- or Multiple-Touching Handwritten Numeral String Using Background and Foreground Analysis", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 11, November 2000.
[13]. Y. Lu, "On the Segmentation of Touching Characters", International Conference on Document Analysis and Recognition, 1993.
[14]. 維基百科 , http://en.wikipedia.org/wiki/RGB_color_space
[15]. 維基百科 , http://en.wikipedia.org/wiki/HSL_color_space
[16]. 維基百科 , http://en.wikipedia.org/wiki/Support_vector_machine |