參考文獻 |
[1] J. Deng, W. Dong, R. Socher, L. Li, K. Li, and F. Li, “Imagenet: a large-scale hierarchical image database,” in Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Miami, FL, Jun.20-25, 2009, pp.2-9.
[2] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “ImageNet classification with deep convolutional neural networks,” in Proc. of Neural Information Processing Systems 2012 (NIPS 2012), Advances in Neural Information Processing Systems 25, Lake Tahoe, Nevada, Dec.3-8, 2012, pp.1-9.
[3] B. M. Lake, R. Salakhutdinov, and J. B. Tenenbaum, “Human level concept learning through probabilistic program induction,” Science, vol.350, pp.1332-1338, 2015.
[4] L. A. Schmidt, Meaning and Compositionality as Statistical Induction of Categories and Constraints, Ph.D. dissertation, Dept. of Brain and Cognitive Sciences, Univ. of Massachusetts Institute of Technology, MA, 2009.
[5] B. Lake, R. Salakhutdino, J. Gros, and J. B. Tenenbaum, “One shot learning of simple visual concepts,” in Proc. Conf. on the Cognitive Science Society, Boston, MA, Jul.20-23, 2011, pp. 2-7.
[6] G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, “Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups,” IEEE Trans. on Signal Processing Magazine, vol.29, Is.6, pp.82-97, 2012.
[7] T. Mikolov, M. Karafiat, L. Burget, J. Cernocky, and S. Khudanpur, “Recurrent neural network based language model,” in Interspeech Conf., Makuhari, Japan, Sep.26-30, 2010, pp.1045-1048.
[8] F. Li, R. Fergus, and P. Perona, “One-shot learning of object categories,” IEEE Trans. on Pattern Analysis and Machine Intelligence, vol.28, Is.4, pp.594-611, 2006.
[9] S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: Towards real-time object detection with region proposal networks,” IEEE Trans. on Pattern Analysis and Machine Intelligence, vol.39, Is.6, pp.1137-1149, 2016.
[10] J. Long, E. Shelhamer, and T. Darrell, “Fully convolutional networks for semantic segmentation,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Boston, MA, Jun.8-10, 2015, pp.3431-3440.
[11] P. Pinheiro and R Collobert, ”From image-level to pixel-level labeling with convolutional networks,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Boston, MA, Jun.7-12, 2015, pp.1713-1721.
[12] J. Wang, Y. Yang, J. Mao, Z. Huang, C. Huang, and W. Xu, “CNN-RNN: A unified framework for multi-label image classification,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, Jun.26 - Jul.1, 2016, pp.2285-2294.
[13] L. Wang, W. Ouyang, X. Wang, and H. Lu, ”Visual tracking with fully convolutional networks,” in Proc. of IEEE Int. Conf. on Computer Vision (ICCV), Santiago, Chile, Dec.11-18, 2015, pp.3119-3127.
[14] N. Chawla, K. Bowyer, L. Hall, and W. Kegelmeyer, “SMOTE: synthetic minority over-sampling technique,” Journal of Artificial Intelligence Research (JAIR), vol.16, Is.1, pp.321-357, 2002.
[15] D. Tax, One-class Classification: Concept-learning in The Absence of Counter-examples, Ph.D. Dissertation, Delft University of Technology, Netherlands, 2001.
[16] G. Koch, R. Zemel, and R. Salakhutdinov, Siamese Neural Networks for One-Shot Image Recognition, Master thesis, Sci. Graduate Dept. of Computer Science, Univ. of Toronto, Canada, 2015.
[17] O. Vinyals, C. Blundell, T. Lillicrap, K. Kavukcuoglu, and D. Wierstra, “Matching networks for one shot learning,” in Proc. of Conf. on Neural Information Processing Systems (NIPS), Barcelona, Spain, Dec. 5-10, 2016.
[18] B. Hariharan and R. Girshick, “Low-shot visual recognition by shrinking and hallucinating features,” in Proc. of IEEE Int. Conf. on Computer Vision (ICCV), Venice, Italy, Oct.22-29, 2017.
[19] R. Girshick, J. Donahue, T. Darrell, and J. Malik, “Rich feature hierarchies for accurate object detection and semantic segmentation,” in Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, Jun.23-28, 2014, pp.580-587.
[20] J. Uijlings, K. Sande, T. Gevers, and A. Smeulders, “Selective search for object recognition,” Int. Journal of Computer Vision (IJCV), vol.104, Is.2, pp.154-171, 2013.
[21] R. Girshick, “Fast R-CNN,” in Proc. of IEEE Int. Conf. on Computer Vision (ICCV), Santiago, Chile, Dec.11-18, 2015, pp.1440-1448.
[22] K. Simonyan and A. Zisserman, “Very deep convolutional network for large-scale image recognition,” in Proc. Int. Conf. on Learning Represent (ICIR), San Diego, CA, May 7-9, 2015, pp.1150-1210.
[23] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, “Going deeper with convolutions,” in Proc. of IEEE Int. Conf. on Computer Vision and Pattern Recognition (CVPR), Boston, MA, Jun.7-12, 2015, pp.1-9.
[24] K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proc. of IEEE Int. Conf. on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, Jun.27-30, 2016, pp.770-778.
[25] Y. Bengio, P. Simard,and P. Frasconi, “Learning long-term dependencies with gradient descent is difficult,” IEEE Trans. on Neural Networks, vol.5, Is.2, pp.157-166, 1994.
[26] S. Ioffe and C. Szegedy, “Normalization: accelerating deep network training by reducing internal covariate shift,” in Proc. Int. Conf. on Machine Learning (ICML), Lille, France, Jul.5-11, 2015, pp.29-37.
[27] K. He and J. Sun, “Convolutional neural networks at constrained time cost,” in Proc. of IEEE Int. Conf. on Computer Vision and Pattern Recognition (CVPR), Boston, MA, Jun.7-12, 2015, pp.5353-5360.
[28] K. He, X. Zhang, S. Ren, and J. Sun, “Spatial pyramid pooling in deep convolutional networks for visual recognition,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol.37, Is.9, pp.1904-1916, 2015.
[29] S. Zagoruyko and N. Komodakis, “Learning to compare image patches via convolutional neural networks,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Boston, MA, Jun.8-10, 2015, pp.4353-4361.
[30] Jia, Y., E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell, ”Caffe: Convolutional architecture for fast feature embedding,” in Proc. of the 22nd ACM Int. Conf. on Multimedia, Orlando, FL, 2014, pp.675-678.
[31] T. Beier, S. Neely, “Feature-based image metamorphosis,” in Proc. of the 19th annual conf. on Computer graphics and interactive techniques, New York, NY, July, 1992, pp.35-42.
[32] I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, “Generative adversarial nets,” in Proc. of Conf. on Neural Information Processing Systems (NIPS), Montréal, Canada, Dec.8-13, 2014, pp.2672-2680. |