摘要(英) |
Business cards usually convey two kinds of information. One is the personal information, such as holder’s name, address, telephone number, e-mail address, etc. The other is the information of the company, such as company name, logo, address, telephone number, etc. In business cards, the relationship between company name and logo is one-to-one mapping. Thus, if we can recognize the logo, we can also know the company name of the business card. The goal of this thesis is to acquire the company name of any business card by means of extraction and recognition of its corresponding logo. Once we have acquired the company name of the business card from its corresponding logo, we can use it to correct the OCR result of company name in the business card. Besides, the formats of business cards are usually the same for the same companies. Thus, we can use the layouts that are extracted from the business cards to classify them.
Extraction and recognition of logo are the focus of this thesis. In the extraction stage, an input business card is first segmented into many homogeneous blocks and each block is then given some attributes. Some rules are constructed based on the extracted attributes to extract the logo. In the recognition stage, the extracted logo is first normalized into and then transformed into matrix of wavelet coefficients. In our work, we use the coefficients of index [0,0] and 40 largest-magnitude as the features. Finally, the features of the considered logo are compared with those stored in logo database to obtain the recognition result.
In our experiments, 90 business cards are tested. Some are Chinese formats and some are English ones. Experimental results reveal the feasibility and validity of our proposed method. |
參考文獻 |
[1] Alghoniemy M. and A. H. Tewfik, “Geometric distortion correction through image normalization,” IEEE International Conference on Multimedia and Expo, Vol. 3, pp. 1291 -1294 , 2000.
[2] Bunke H. and J. Csirik, “Inference of edit costs using parametric string matching,” Pattern Recognition, Vol. 2, pp. 549 - 552, 1992.
[3] Chiou Y. H. and H. J. Lee, “Recognition of Chinese business cards,” IPPR Conference on Computer Vision, Graphics and Image Processing, pp. 438-447, 1995.
[4] Fu H. C., C. S. Chen and K. T. Sun, “Recognition of Chinese business cards,” Proc. of the 5th OCR & DA Conference, Hshinchu, Taiwan, R. O. C., pp. 169-175, 1996.
[5] Ha Jaekyu, R. M. Haralick and I. T. Phillips, “Recursive X-Y cut using bounding boxes of connected components,” Proc. of the 3th Int. Conf. on Document Analysis and Recognition, Vol. 2, pp. 952 -955, Aug., 1995.
[6] J. Bernsen, “Dynamic thresholding of grey-level images,” Proceedings of the 8th International Conference on Pattern Recognition, pp. 1251-1255, 1986.
[7] Marzal A. and E. Vidal, “Computation of normalized edit distance and applications,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 15, pp. 926 -932, Sept., 1993.
[8] Shih F. Y., S. S. Chen, D. C. D. Hung and P. A. Ng, “A document segmentation, classification and recognition system,” Proc. of 2nd Int. Conf. Systems Integration, pp. 258-267, June, 1992.
[9] Shapiro V. A., P. K. Veleva and V. S. Sgurev, “An adaptive method for image thresholding,” Pattern Recognition, Vol.3, pp. 696 - 699, 1992.
[10] Saiga H., Y. Nakamura, Y. Kitamura, T. Morita, “An OCR System for Business Cards,” Proc. of the 2nd Int. Conf. on Document Anal. and Recognition, pp.802-805, Oct., 1993.
[11] Stollnitz E. J., T. D. DeRose and D. H. Salesin, Wavelets for computer graphics Theory and application, Morgan Kaufmann publishers Inc., San Francisco, California, 1996, Ch5, pp. 43-55.
[12] Sauvola J., T. Seppanen, S. Haapakoski and M. Pietikainen, “Adaptive document binarization,” Proc. of the 4th Int. Conf. on Document Analysis and Recognition, Vol. 1, pp. 147 -152, 1997.
[13] Wagner R. A. and M. J. Fischer, “The string-to-string correction problem,” Journal of the ACM, Vol. 21, No. 1, pp. 168 -173, 1974.
[14] Weigel A. and F. Fein, “Normalizing the weighted edit distance,” Pattern Recognition, Vol. 2, pp. 399 -402, 1994.
[15] Wang D. T., Yi Zhao, V. Beskin, D. D. C. Hung and D. Y. Chao, “Bandwidth Algorithm for String Matching,” IEEE International Conference on Systems, Man and Cybernetics, Vol. 1, pp. 328, Oct., 1995.
[16] Watanabe T. and Xiaoou Huang, “Automatic Acquisition of Layout Knowledge for Understanding Business Cards,” Proc. of the 4th Int. Conf. on Document Anal. and Recognition, Vol. 1, pp. 216 -220, Aug., 1997.
[17] 賴逸嶺, “中文名片處理系統”, 國立中央大學電機工程研究所碩士論文, 中華民國87年六月. |