參考文獻 |
[1] A. Krizhevsky, I. Sutskever, and G. Hinton. “Imagenet classification with deep convolutional neural networks.” In NIPS, 2012.
[2] O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, et al. “Imagenet large scale visual recognition challenge.” arXiv:1409.0575, 2014.
[3] K. He, X. Zhang, S. Ren, and J. Sun. “Identity mappings in deep residual networks.” In ECCV, 2016
[4] K. He, X. Zhang, S. Ren, and J. Sun. “Deep residual learning for image recognition.” In CVPR, 2016.
[5] S. Zagoruyko and N. Komodakis. “Wide residual networks.” arXiv:1605.07146, 2016.
[6] S. Xie, R. Girshick, P. Dollar, Z. Tu, and K. He. “Aggregated residual transformations for deep neural networks.” In CVPR, 2017.
[7] G. Huang, Z. Liu, K. Q. Weinberger, and L. Maaten. “Densely connected convolutional networks.”, In CVPR, 2017.
[8] C. Szegedy, S. Ioffe, V. Vanhoucke, and A. Alemi. “Inceptionv4, inception-resnet and the impact of residual connections on learning.”, In ICLR Workshop, 2016.
[9] X. Zhang, X. Zhou, M. Lin, and J. Sun. “Shufflenet: An extremely efficient convolutional neural network for mobile devices.”, arXiv:1707.01083, 2017
[10] A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam. “Mobilenets: Efficient convolutional neural networks for mobile vision applications.”, arXiv:1704.04861, 2017.
[11] D. Eigen, C. Puhrsch, and R. Fergus. “Depth map prediction from a single image using a multi-scale deep network.” arXiv:1406.2283, 2014.
[12] D. Eigen and R. Fergus, “Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture,” arXiv:1411.4734, 2014
[13] Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron C. Courville, Ruslan Salakhutdinov, Richard S. Zemel, and Yoshua Bengio, “Show, attend and tell: Neural image caption generation with visual attention.” In ICML, 2015.
[14] Q. Zhang, Y. N. Wu, and S.-C. Zhu. “Interpretable convolutional neural networks.” In CVPR, 2018.
[15] K. Simonyan and A. Zisserman. “Very deep convolutional networks for large-scale image recognition.” In ICLR, 2015.
[16] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich. “Going deeper with convolutions.” In CVPR, 2015
[17] O. Ronneberger, P. Fischer, and T. Brox, “U-Net: Convolutional Networks for Biomedical Image Segmentation,” Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, Cham, pp. 234-241, 2015.
[18] Girshick, R. B., Donahue, J., Darrell, T., and Malik, J. Rich “Feature hierarchies for accurate object detection and semantic segmentation.”, CVPR, 2014.
[19] R. Girshick. “Fast R-CNN. “In ICCV, 2015.
[20] W. Liu, D. Anguelov, D. Erhan, C. Szegedy, and S. Reed, “SSD: Single shot multibox detector,” arXiv:1512.02325, 2015.
[21] M. Liang and X. Hu. “Recurrent convolutional neural network for object recognition.” In CVPR, 2015.
[22] T.-Y. Lin, P. Dollar, R. Girshick, K. He, B. Hariharan, and S. Belongie. “Feature pyramid networks for object detection.” In CVPR, 2017.
[23] H. Zheng, J. Fu, T. Mei, and J. Luo. “Learning multi-attention convolutional neural network for fine-grained image recognition.” In ICCV, 2017.
[24] Brazil, G., Yin, X., Liu, X.: “Illuminating Pedestrians via Simultaneous Detection & Segmentation.” In ICCV, 2017.
[25] F. Wang, M. Jiang, C. Qian, S. Yang, C. Li, H. Zhang, X. Wang, and X. Tang. “Residual attention network for image classification.” In CVPR, 2017.
[26] C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie. “The Caltech-UCSD Birds-200-2011 Dataset.” Technical Report CNS-TR-2011-001, California Institute of Technology, 2011
[27] Jun Fu, Jing Liu, Haijie Tian, Zhiwei Fang, and Hanqing Lu. “Dual attention network for scene segmentation.” arXiv:1809.02983, 2018.
[28] Aditya Khosla, Nityananda Jayadevaprakash, Bangpeng Yao and Li Fei-Fei. “Novel dataset for Fine-Grained Image Categorization. First Workshop on Fine-Grained Visual Categorization (FGVC)”, In CVPR, 2011.
[29] J. Hu, L. Shen, and G. Sun. “Squeeze-and-excitation networks.” arXiv:1709.01507, 2017.
[30] J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, “ImageNet: A Large-Scale Hierarchical Image Database.” In CVPR, 2009.
[31] S. Ren, K. He, R. Girshick, and J. Sun. “Faster R-CNN: Towards real-time object detection with region proposal networks.”, In NIPS, 2015.
[32] T.-Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollar. “Focal loss for dense object detection.”, In ICCV, 2017
[33] Everingham, M., Van Gool, L., Williams, C. K. I., Winn, J. and Zisserman, A. “The PASCAL Visual Object Classes Challenge: A Retrospective”, International Journal of Computer Vision, 88(2), 303-338, 2010
|