參考文獻 |
[1] Idcar 2015 incidental scene text-task 4.1: Text localization. https://rrc.cvc.uab.es/?ch=4&com=tasks. Accessed: 2020-03-17.
[2] Kolourpaint. https://kde.org/applications/en/graphics/org.kde. kolourpaint. Accessed: 2020-07-11.
[3] Ministry of transportation and communications. https://www.motc.gov.tw/ch/ index.jsp. Accessed: 2020-07-11.
[4] Motor vehicle office https://www.mvdis.gov.tw/. Accessed: 2020-07-11.
[5] Fred L. Bookstein. Principal warps: Thin-plate splines and the decomposition of deformations. IEEE Transactions on pattern analysis and machine intelligence, 11(6):567–585, 1989.
[6] Corinna Cortes and Vladimir Vapnik. Support-vector networks. Machine learning, 20(3):273–297, 1995.
[7] Ivan Culjak, David Abram, Tomislav Pribanic, Hrvoje Dzapo, and Mario Cifrek. A brief introduction to opencv. In 2012 proceedings of the 35th international convention MIPRO, pages 1725–1730. IEEE, 2012.
[8] Boris Epshtein, Eyal Ofek, and Yonatan Wexler. Detecting text in natural scenes with stroke width transform. In 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pages 2963–2970. IEEE, 2010.
[9] Naresh Garg and N Garg. Binarization techniques used for grey scale images. Inter- national Journal of Computer Applications, 71(1):8–11, 2013.
[10] google. Google vision ocr. https://cloud.google.com/vision. Accessed: 2020-07- 11.
[11] Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
[12] Tin Kam Ho. The random subspace method for constructing decision forests. IEEE transactions on pattern analysis and machine intelligence, 20(8):832–844, 1998.
[13] Max Jaderberg, Karen Simonyan, Andrew Zisserman, et al. Spatial transformer networks. In Advances in neural information processing systems, pages 2017–2025, 2015.
[14] Alex Krizhevsky, Ilya Sutskever, and Geo↵rey E Hinton. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, pages 1097–1105, 2012.
[15] Yann LeCun, L ́eon Bottou, Yoshua Bengio, and Patrick Ha↵ner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278– 2324, 1998.
[16] Vladimir I Levenshtein. Binary codes capable of correcting deletions, insertions, and reversals. In Soviet physics doklady, volume 10, pages 707–710, 1966.
[17] Hui Li, Peng Wang, and Chunhua Shen. Towards end-to-end text spotting with convolutional recurrent neural networks. In Proceedings of the IEEE International Conference on Computer Vision, pages 5238–5246, 2017.
[18] Minghui Liao, Baoguang Shi, Xiang Bai, Xinggang Wang, and Wenyu Liu. Textboxes: A fast text detector with a single deep neural network. In Thirty-First AAAI Con- ference on Artificial Intelligence, 2017.
[19] Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng- Yang Fu, and Alexander C Berg. Ssd: Single shot multibox detector. In European conference on computer vision, pages 21–37. Springer, 2016.
[20] Jonathan Long, Evan Shelhamer, and Trevor Darrell. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3431–3440, 2015.
[21] Simon M Lucas, Alex Panaretos, Luis Sosa, Anthony Tang, Shirley Wong, Robert Young, Kazuki Ashida, Hiroki Nagai, Masayuki Okamoto, Hiroaki Yamamoto, et al. Icdar 2003 robust reading competitions: entries, results, and future directions. Inter- national Journal of Document Analysis and Recognition (IJDAR), 7(2-3):105–122, 2005.
[22] Jiri Matas, Ondrej Chum, Martin Urban, and Toma ́s Pajdla. Robust wide- baseline stereo from maximally stable extremal regions. Image and vision computing, 22(10):761–767, 2004.
[23] Anand Mishra, Karteek Alahari, and CV Jawahar. Scene text recognition using higher order language priors. 2012.
[24] photock.jp. Free background dataset. https://www.photock.jp/. Accessed: 2020- 07-11.
[25] Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U-net: Convolutional net- 60
works for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention, pages 234–241. Springer, 2015.
[26] Baoguang Shi, Xiang Bai, and Serge Belongie. Detecting oriented text in natural images by linking segments. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2550–2558, 2017.
[27] Baoguang Shi, Xiang Bai, and Cong Yao. An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE transactions on pattern analysis and machine intelligence, 39(11):2298–2304, 2016.
[28] Baoguang Shi, Mingkun Yang, Xinggang Wang, Pengyuan Lyu, Cong Yao, and Xiang Bai. Aster: An attentional scene text recognizer with flexible rectification. IEEE transactions on pattern analysis and machine intelligence, 41(9):2035–2048, 2018.
[29] Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large- scale image recognition. arXiv preprint arXiv:1409.1556, 2014.
[30] Ray Smith. An overview of the tesseract ocr engine. In Ninth international confer- ence on document analysis and recognition (ICDAR 2007), volume 2, pages 629–633. IEEE, 2007.
[31] Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1–9, 2015.
[32] Zhi Tian, Weilin Huang, Tong He, Pan He, and Yu Qiao. Detecting text in natural 61
image with connectionist text proposal network. In European conference on computer vision, pages 56–72. Springer, 2016.
[33] Kai Wang, Boris Babenko, and Serge Belongie. End-to-end scene text recognition. In 2011 International Conference on Computer Vision, pages 1457–1464. IEEE, 2011.
[34] Song Yuheng and Yan Hao. Image segmentation algorithms overview. arXiv preprint arXiv:1707.02051, 2017. |