參考文獻 |
[1] J. J. Lee, P. H. Lee, S. W. Lee, A. Yuille, C. Koch, “AdaBoost for text detection in natural scene.” IEEE International Conference on Document Analysis and Recognition(ICDAR), pp. 429-434, 2011.
[2] R. Minetto, N. Thomeb, M. Cord, “T-HOG: an effective gradient-based descriptor for single line text regions.” Pattern Recognition, vol.46(3), pp. 1078-1090, 2013.
[3] A. Bissacco, M. Cummins, Y. Netzer, H. Neven, “PhotoOCR: Reading Text in Uncontrolled Conditions.” IEEE International Conference on Computer Vision(ICCV), 2013.
[4] A. Mishra, K. Alahari, C. V. Jawahar, “Top-down and bottom-up cues for scene text recognition.” IEEE International Conference Computer Vision and Pattern Recognition (CVPR), 2012.
[5] K. Wang, B. Babenko, S. Belongie, “End-to-end scene text recognition.” IEEE International Conference on Computer Vision(ICCV), 2011.
[6] T. Wang, D. J. Wu, A. Coates, A. Y. Ng, “End-to-end text recognition with convolutional neural network.” IEEE International Conference on Pattern Recognition (ICPR), 2012.
[7] N. Dalal, B. Triggs, “Histograms of oriented gradients for human detection.” IEEE International Conference Computer Vision and Pattern Recognition (CVPR), 2005.
[8] A. Coates, B. Carpenter, C. Case, S. Satheesh, B. Suresh, T. Wang, D. J. Wu, A. Y. Ng, “Text detection and character recognition in scene images with unsupervised feature learning.” IEEE International Conference on Document Analysis and Recognition, pp. 440–445, 2011.
[9] Y. F. Pan, X. Hou, C. L. Liu, "Text localization in natural scene images based on conditional random field." IEEE International Conference on Document Analysis and Recognition(ICDAR), 2009.
[10] J. Matas, O. Chum, M. Urban, T. Pajdla, "Robust wide-baseline stereo from maximally stable extremal regions." Image and Vision Computing vol.22(10), pp.761-767, 2004.
[11] B. Epshtein, O. Eyal, W. Yonatan, "Detecting text in natural scenes with stroke width transform." IEEE International Conference Computer Vision and Pattern Recognition (CVPR), 2010.
[12] C. Yao, X. Bai, W. Liu, Y. Ma, Z. Tu, “Detecting texts of arbitrary orientations in natural images.” IEEE International Conference Computer Vision and Pattern Recognition (CVPR), 2012.
[13] W. Huang, Z. Lin, J. Yang, J. Wang, “Text localization in natural images usingstroke feature transform and text covariance descriptors.” IEEE International Conference on Computer Vision (ICCV), 2013.
[14] L. Neumann, K. Matas, “Text localization in real-world images using eficiently pruned exhaustive search.” IEEE International Conference on Document Analysis and Recognition (ICDAR), 2011.
[15] L. Neumann, K. Matas, “Real-time scene text localization and recognition.” IEEE International Conference Computer Vision and Pattern Recognition (CVPR), 2012.
[16] W. Huang, Q. Yu, X. Tang, "Robust scene text detection with convolution neural network induced mser trees." European Conference on Computer Vision (ECCV), 2014.
[17] M. Jaderberg, K. Simonyan, A. Vedaldi, A. Zisserman, "Reading text in the wild with convolutional neural networks." International Journal of Computer Vision (IJCV), vol.116(1), pp.1-20, 2016.
[18] Z. Zhang, C. Zhang, W. Shen, C. Yao, W. Liu, X. BaiZhang, "Multi-oriented text detection with fully convolutional networks." IEEE International Conference Computer Vision and Pattern Recognition (CVPR), 2016.
[19] T. He, W. Huang, Y. Qiao, J. Yao, "Text-attentional convolutional neural network for scene text detection." IEEE Transactions on Image Processing, vol.25(6), pp.2529-2541, 2016.
[20] 葉怡成, ”類神經網路模式應用與實作.” 儒林圖書有限公司, 2004.
[21] W.S. McCulloch, W. Pitts, "A logical Calculus of the Ideas Immanent in Nervous activity." Bulletin of Mathematical Biophysics, vol.5, pp.115-133,1943.
[22] D.O. Hebb, “Organization of Behavior.” Wiley, 1949.
[23] F. Rosenblatt, "The perception: A probabilistic model for information storage and organization in the brain." Psychological Review, vol. 65, pp.386-408,1958.
[24] M. L. Minsky, S. Papert, “An Introduction to Computational Geometry.” MIT Press, 1969.
[25] D. E. Rumelhart, G. E. Hinton, R. J. Williams, “Learning representations by back-propagating errors.” Nature, vol.323 (6088), pp.533- 536, 1986.
[26] Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackel, “Backpropagation applied to handwritten zip code recognition.” Neural computation, pp.541-551, 1989.
[27] Y. Sun, X. Wang, X. Tang, “Deep learning face representation from predicting 10,000 classes.” IEEE International Conference on Computer Vision and Pattern Recognition(ICPR), pp.1891-1898, 2014.
[28] Y. Taigman, M. Yang, M. Ranzato, L. Wolf, “Deepface: closing the gap to human-level performance in face verification.” IEEE International Conference on Computer Vision and Pattern Recognition(ICPR), pp.1701-1708, 2014.
[29] D. Silver, A. Huang, C. J. Maddison, A. Guez, L. Sifre, G. Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, I. Sutskever, T. Lillicrap, M. Leach, K. Kavukcuoglu, T. Graepel, D. Hassabis, "Mastering the game of Go with deep neural networks and tree search," Nature,vol. 529(7587), pp.484-489, 2016.
[30] J. Deng, W. Dong, R. Socher, “Imagenet: A large-scale hierarchical image database.” IEEE International Conference on Computer Vision and Pattern Recognition(ICPR), 2009.
[31] W. Wang, B. C. Ooi, W. Y. Yang, “Effective multimodal retrieval based on stacked auto-encoders.” The Proceedings of the VLDB Endowment (PVLDB), vol.7 (8), pp.649 – 660, 2014.
[32] L. Yandong, H. Zongbo, L. Hang, “Survey of convolutional neural network.” Journal of Computer Applications, vol.36(9), pp.8-2515, 2016.
[33] D. H. Hubel, T. N. Wiesel, “Receptive fields, binocular interaction, and functional architecture in the cat′s visual cortex.” Journal of Physiology, vol.160, pp.106-154, 1962.
[34] K. Fukushima1, “A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position.” Biological Cybernetics, vol.36(4), pp.193-202, 1980.
[35] Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackel, “Backpropagation Applied to Handwritten Zip Code Recognition.” Neural Computation, vol.1(4), pp.541-551, 1989.
[36] Y. LeCun, L. Bottou, Y. Bengio, P. Haffner, “Gradient-Based Learning Applied to Document Recognition.” Proceedings of the IEEE, vol. 86(11), pp. 2278-2324,1998.
[37] A. Krizhevsky, I. Sutskever, G. E. Hinton, “ImageNet classification with deep convolutional neural networks.” Advances in Neural Information Processing Systems 25, pp.1097-1105, 2012.
[38] K. Simonyan, A. Zisserman, “Very deep convolutional networks for large-scale image recognition.” IEEE International Conference on Computer Vision and Pattern Recognition(ICPR), 2015.
[39] C. Szegedy, W. Liu, Y. Jia, “Going deeper with convolutions.” IEEE Conference on Computer Vision and Pattern Recognition(ICPR), pp.01-08, 2015.
[40] K. He, X. Zhang, S. Ren, “Deep residual learning for image recognition.” IEEE Conference on Computer Vision and Pattern Recognition(ICPR), 2016.
[41] “Deep Learning Tutorial”, Chapter 6.
[42] K. L. Bouman, G. Abdollahian, M. Boutin, E. J. Delp, "A low complexity sign detection and text localization method for mobile applications." IEEE Transactions on multimedia: 922-934, 2011
[43] Signs N800 dataset: http://cobweb.ecn.purdue.edu/~ace/kbsigns/
[44] V. Dumoulin, F. Visin, “A guide to convolution arithmetic for deep learning.” arXiv preprint arXiv:1603.07285, 2016
|