參考文獻 |
[1] V. Badrinarayanan, A. Kendall and R. Cipolla, "SegNet: A Deep convolutional encoder-decoder architecture for image segmentation," in IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 39, no. 12, pp. 2481-2495, 2017.
[2] J. Long, E. Shelhamer and T. Darrell, "Fully convolutional networks for semantic segmentation." 2015 The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, U.S.A., 2015, pp. 3431-3440.
[3] O. Ronneberger, P. Fischer and T. Brox, "U-net: Convolutional networks for biomedical image segmentation." Medical Image Computing and Computer-Assisted Intervention(MICCAI), Munich, Germany, 2015, pp. 234-241.
[4] G. Lin, A. Milan, C. Shen and I. Reid, "RefineNet: multi-path refinement networks for high-resolution semantic segmentation," 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, U.S.A., 2017, pp. 5168-5177.
[5] Y. Lecun, L. Bottou, Y. Bengio and P. Haffner, "Gradient-based learning applied to document recognition," in Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1 Nov. 1998.
[6] L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy and A. L. Yuille, "Semantic image segmentation with deep convolutional nets and fully connected CRFs," International Conference on Learning Representations (ICLR), San Diego, U.S.A., 2015, pp. 1-14.
[7] L. Chen, G. Papandreou, I. Kokkinos, K. Murphy and A. L. Yuille, "DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 4, pp. 834-848, 1 April 2018.
[8] F. Yu and V. Koltun, "Multi-scale context aggregation by dilated convolutions," International Conference on Learning Representations (ILCR), San Juan, U.S.A., 2016, pp. 1-13.
[9] D. Eigen and R. Fergus, "Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture," The IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 2015, pp. 2650-2658.
[10] L. Chen, Y. Yang, J. Wang, W. Xu and A. L. Yuille, "Attention to scale: Scale-aware semantic image segmentation," The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, U.S.A., 2016, pp. 3640-3649.
[11] L. Chen, G. Papandreou, F. Schroff and H. Adam, "Rethinking atrous convolution for semantic image segmentation," arXiv preprint arXiv:1706.05587, 2017.
[12] H. Zhao, J. Shi, X. Qi, X. Wang and J. Jia, "Pyramid scene parsing network," 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, U.S.A., 2017, pp. 6230-6239.
[13] H. Noh, S. Hong and B. Han, "Learning deconvolution network for semantic segmentation," The IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 2015, pp. 1520-1528.
[14] O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg and F. Li, "ImageNet large scale visual recognition challenge," in International Journal of Computer Vision, vol. 115, no. 3, pp. 211-252, 1 Dec. 2015.
[15] H. Zhao, X. Qi, X. Shen, J. Shi and J. Jia, “ICNet for real-time semantic segmentation on high-resolution images,” arXiv preprint arxiv:1704.08545, 2018.
[16] A. Krizhevsky, I. Sutskever and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," The 25th International Conference on Neural Information Processing Systems(NIPS′12), Lake Tahoe, U.S.A., 2012, vol.1, pp. 1097-1105.
[17] K. He, X. Zhang, S. Ren and J. Sun, "Deep residual learning for image recognition," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, U.S.A., 2016, pp. 770-778.
[18] M. Cordts, M. Omran, S. Ramos, T. Rehfeld, M. Enzweiler, R. Benenson, U. Franke, S. Roth and B. Schiele, “The cityscapes dataset for semantic urban scene understanding,” 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, U.S.A., 2016, pp. 3213-3223.
[19] L. Chen, Y. Zhu, G. Papandreou, F. Schroff and H. Adam, “Encoder-decoder with atrous separable convolution for semantic image segmentation,” European Conference on Computer Vision (ECCV), Munich, Germany, 2018, pp. 833-851.
[20] K. Simonyan, A. Zisserman, “Very deep convolutional networks for large-scale image recognition.” arXiv preprint arXiv:1409.1556.
[21] Mollahosseini, Ali, David Chan, and Mohammad H. Mahoor. "Going deeper in facial expression recognition using deep neural networks." 2016 IEEE Winter conference on applications of computer vision (WACV). IEEE, 2016.
[22] Hu, Jie, Li Shen, and Gang Sun. "Squeeze-and-excitation networks." Proceedings of the IEEE conference on computer vision and pattern recognition. 2018.
[23] X. Wang, R. Girshick, A. Gupta and K. He, "Non-local neural networks." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018.
[24] J. Fu, J. Liu, H. Tian, Z. Fang, H. Lu, “Dual attention network for scene segmentation.” arXiv preprint arXiv:1809.02983, 2018.
[25] F. Chollet. "Xception: Deep learning with depthwise separable convolutions." Proceedings of the IEEE conference on computer vision and pattern recognition. 2017.
[26] M. Yang, K. Yu, C. Zhang, Z. Li and K. Yang, "Denseaspp for semantic segmentation in street scenes." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018.
[27] H. Zhang, K. Dana, J. Shi, Z. Zhang, X. Wang, A. Tyagi and A. Agrawal, “Context encoding for semantic segmentation.” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018.
[28] S. Woo, J. Park, L. Joon-Young and I. So Kweon, "Cbam: Convolutional block attention module." Proceedings of the European Conference on Computer Vision (ECCV). 2018.
|