參考文獻 |
[1] Q. Huang, Z. Liu, A. Rosenberg, D. Gibbon, and B. Shahraray, “Automated generation of news content hierarchy by integrating audio, video, and text information,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. 6, 1999, pp. 3025–3028.
[2] W. Qi, L. Gu, H. Jiang, X.-R. Chen, and H.-J. Zhang, “Integrating visual, audio and text analysis for news video,” in Proc. Int. Conf. Image Process., vol. 3, 2000, pp. 520–523.
[3] MPEG-7 Description Schemes, ISO/IEC/JTC1/SC29/WG11/N2844, July 1999.
[4] MPEG Requirements Group, MPEG-7 Requirements Document, Doc. ISO/MPEG N2461, MPEG Atlantic City Meeting, October 1998
[5] MPEG-7 Description Schemes (V0.6), ISO/IEC/JTC1/SC29/WG11/M5040, Version 0.6-a, September 1999.
[6] C. Dorai, R. Bolle, N. Dimitrova, L. Agnihotri, “MPEG-7 Videotext Description Scheme, Doc. ISO/MPEG M5206, MPEG Melbourne Meeting”, October 1999.
[7] H. Li, D. Doermann, and O. Kia, “Automatic text detection and tracking in digital video,” IEEE Trans. Image Process., vol. 9, no. 1, Jan. 2000, pp. 147–156.
[8] Y. Zhong, H.-J. Zhang, and A. K. Jain, “Automatic caption localization in compressed video,” in Proc. Int. Conf. Image Process., vol. 2, 1999, pp. 96–100.
[9] R. Lienhart and A. Wernicke, “Localizing and segmenting text in images, videos and web pages,” IEEE Trans. Circuits Syst. Video Technol., vol. 12, no. 4, Apr. 2002, pp. 256–268.
[10] I. Sobel, “An isotropic 3_3 image gradient operator,” in Machine Vision for Three-Dimensional Scenes, H. Freeman, Ed. New York: Academic, 1990, pp. 376–379.
[11] N. Otsu, “A threshold selection method from gray-level histograms,” IEEE Trans. Syst., Man, Cybernet., vol. SMC-9, no. 1, Jan. 1979, pp. 62–66.
[12] N. Dimitrova, L. Agnihotri, C. Dorai, and R. Bolle, “MPEG-7 Videotext Descriptor for Superimposed Text in Images and Video”, Signal Processing: Image Communication, 16 (2000), October 2000, pp. 137-155.
[13] T. Sato, T. Kanade, E. K. Hughes, and M. A. Smith, “Video OCR for digital news archive,” in Proc. IEEE Workshop Content-Based Access Image Video Database, 1998, pp. 52–60.
[14] A. K. Jain and B. Yu, “Automatic text location in images and video frames,” Pattern Recognit., vol. 31, no. 12, 1998, pp. 2055–2076.
[15] L. Agnihotri and N. Dimitrova, “Text detection for video analysis,” in Proc. IEEE Workshop Content-Based Access Image Video Libraries, 1999, pp. 109–113.
[16] V. Y. Mariano and R. Kasturi, “Locating uniform-colored text in video frames,” in Proc. 15th Int. Conf. Pattern Recognit., vol. 4, 2000, pp. 539–542.
[17] D. Chen, K. Shearer, and H. Bourlard, “Text enhancement with asymmetric filter for video OCR,” in Proc. 11th Int. Conf. Image Anal. Process., 2001, pp. 192–197.
[18] B. T. Chun, Y. Bae, and T.-Y. Kim, “Text extraction in videos using topographical features of characters,” in Proc. IEEE Int. Fuzzy Syst. Conf., vol. 2, 1999, pp. 1126–1130.
[19] X. Gao and X. Tang et al., “Automatic news video caption extraction and recognition,” in Proc. LNCS 1983: 2nd Int. Conf. Intell. Data Eng. Automated Learning Data Mining, Financial Eng., Intell. Agents, K. S. Leung et al., Eds., Hong Kong, 2000, pp. 425–430.
[20] V. Wu, R. Manmatha, and E. M. Riseman, “Textfinder: An automatic system to detect and recognize text in images,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 21, no. 11, Nov. 1999, pp. 1224–1229.
[21] A. Wernicke and R. Lienhart, “On the segmentation of text in videos,” in Proc. IEEE Int. Conf. Multimedia Expo, vol. 3, Jul. 2000, pp. 1511–1514.
[22] M. Cai, J. Song, and M. R. Lyu, “A new approach for video text detection,” in Proc. Int. Conf. Image Process., Rochester, NY, Sep. 2002, pp. 117–120.
[23] C. Wolf, J.-M. Jolion, F. Chassaing, “Text localization, enhancement and binarization in multimedia documents” Pattern Recognition, 2002. Proceedings. 16th International Conference on, Volume 2, 11-15 Aug. 2002, pp. 1037 – 1040.
[24] S. Antani, D. Crandall, and R. Kasturi, “Robust extraction of text in video,” in Proc. 15th Int. Conf. Pattern Recognit., vol. 1, 2000, pp. 831–834.
[25] Lyu, M.R., Jiqiang Song, Min Cai, “A comprehensive method for multilingual video text detection, localization, and extraction”, IEEE Trans. Circuits Syst. Video Technol., Volume 15, Issue 2, Feb. 2005, pp. 243 – 255.
[26] S. Kwak, K. Chung, Y. Choi, “Video Caption Image Enhancement for an Efficient Character Recognition”, in Proc. 15th Int. Conf. Pattern Recognit., vol. 2, 2000, pp. 2606–2609. |