摘要(英) |
Due to the rapid development of scientific technology, electronic products becomes more advanced functions. It used to use scanner to get high resolution 2D image, but now we can do that by using ordinary digital cameras. Portability and convenience of using digital cameras. Character recognition is no longer limited by recognizing 2D characters. It can also be extended to recognize 3D characters because of the inherent characteristic of digital cameras.
Since the images and texts are no longer planes, the recognition of them has brings in many problems. The major problem is that the plane to which the texts belong causes the sphere change of texts. When it formed the images of cylinder, the texts locate on both sides of the cylinder will slant because the camera is not totally horizontal in the images. In addition to the change of texts, the texts horizontal will be transformed into a curve which increases the difficulty in recognizing the texts.
In this thesis, we present an effective method to correct the text images of cylinder sphere. Firstly, image preprocessing is performed including global binarization and connected-component labeling to extract the image information. Next, a modified connected-component labeling is employed to will correct the labeling characters, and link these components to a curved text line. After the using of regression analysis to analyze the curve function, we will correct the curved text line to a horizontal line. The result can be used to facilitate later segmentation and recognition system.
|
參考文獻 |
[1]. H. Yan, “Fuzzy Curve-Tracing Algorithm”, IEEE Transactions on Systems, Msn, and Cybernetics-PartB: CyberNetics, Vol.31, No.5, October 2001.
[2]. H. Yan, “Detection of Curved Text Path Based on the Fuzzy Curve-tracing (FCT) Algorithm”, International Journal on Document Analysis and Recognition, 2001.
[3]. B. S. Y Lum and H. Yan, “Complex Curve Tracing Based on a Minimum Spanning Tree Model and Regulrized Fuzzy Clustering”, International Conference on Image Processing, 2004.
[4] G. Toussaint, “Solving Geometric Problems with the Rotating Calipers”, IEEE proceedings In MELECON, 1983.
[5]. A. Zramdini and R. Ingold, “Optical Font Recognition Using Typographical Features”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 20, No. 8, August 1998.
[6]. T. Yamaguchi, Y. Nakano, M. Maruyama, H. Miyao and T. Hananoi, “Digit Classification on Signboards for Telephone Number Recognition”, conference on the Seventh International Document Analysis and Recognition, 2003.
[7]. H. Goto, K. Aoba, H. Aso, “A Framework for Detecting and Selecting Text Line Candidates of Correct Orientation”, IEEE conference on 14th International Pattern Recognition, icpr, vol. 2, pp.1074, 1998.
[8]. H. Goto, H. Aso, “Extracting curved text lines using local linearity of the text line”, International Journal on Document Analysis and Recognition, 1999.
[9]. Z. Zhang, C. L. Tan, "Correcting Document Image Warping Based on Regression of Curve Text Lines ", IEEE Conference on the Seventh International Document Analysis and Recognition, 2003.
[10]. H. Hase, M. Yoneda, T. Shinokawa, C. Y. Suen, “Alignment of Free Layout Color Texts for Character Recognition”, Sixth International Conference on Document Analysis and Recognition, pp.0932, 2001.
[11]. Din-Chang Tseng, “Image Processing”, Institute of Compute. Sciences & Information Engineering National Central University.
[12]. N. Otsu, "A Threshold Selection Method from gray-level Histograms." IEEE International Conference on System, Man, and Cybernetics. vol. 9, pp. 62–66, 1979..
[13]. J. T. Tou and R. C. Gonzalez, “Pattern Recognition Principles.” Addision-Wesley Publishing Company, 1974.
[14]. S. S. Khan and A. Ahmad, “Cluster center initialization algorithm for K-means Clustering.” IEEE Pattern Recognition Letters, vol. 25, pp. 1293-1302, 2004..
|