參考文獻 |
[1] Y. LeCun, L. Bottou, Y. Bengio and P. Haffner, “Gradient-based learning applied to document recognition”, Proc. IEEE, vol.86, no.11, pp.2278-2324, 1998.
[2] J. Long, E. Shelhamer, and T. Darrell, “Fully convolutional networks for semantic segmentation,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Boston, MA, Jun.7-12, 2015, pp.3431-3440.
[3] V. Badrinarayanan, A. Kendall, and R. Cipolla, "Segnet: a deep convolutional encoder-decoder architecture for image segmentation," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol.39, no.12, pp.2481-2495, 2017.
[4] C. Peng, X. Zhang, G. Yu, G. Luo, and J. Sun, “Large kernel matters - improve semantic segmentation by global convolutional network,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, Jul.21-26, 2017, pp.4353-4361.
[5] P. Fischer, O. Ronneberger, and T. Brox, “U-Net: convolutional networks for biomedical image segmentation”, Medical Image Computing and Computer-Assisted Intervention MICCAI 2015, vol. 9351, pp.234-241, 2015.
[6] C. Yu, J. Wang, C. Peng, C. Gao, G. Yu, and N. Sang, “Learning a discriminative feature network for semantic segmentation,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, Jun.18-23, 2018, pp.1857-1866.
[7] L. Chen, Y. Zhu, G. Papandreou, and F. Schroff, H. Adam, “Encoder-decoder with atrous separable convolution for semantic image segmentation,” in Proc. of the European Conf. on Computer Vision (ECCV), Munich, DE, Sep.8-14, 2018, pp.801-818.
[8] K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” arXiv:1409.1556v6.
[9] K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, Jun.27-30, 2016, pp.770-778.
[10] X. Wang, R. Girshick, A. Gupta, and K. Hein, “Non-local neural networks,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, Jun.18-23, 2018, pp.7794-7803.
[11] J. Hu, L. Shen, and G . Sun, “Squeeze-and-excitation networks,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, Jun.18-23, 2018, pp.7132-7141.
[12] S. Woo, J. Park, J. Lee, and I. Kweon, “CBAM: convolutional block attention module,” in Proc. of the European Conf. on Computer Vision (ECCV), Munich, DE, Sep.8-14, 2018, pp.3-19.
[13] J. Fu, J. Liu, H. Tian, Y. Li, Y. Bao, Z. Fang, and H . Lu, “Dual attention network for scene segmentation,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, Jun.16-20, 2019, pp.3146-3154.
[14] H. Zhang, I. Goodfellow, D. Metaxas, and A. Odena, “Self-attention generative adversarial networks,” arXiv:1805.08318 [stat.ML].
[15] L. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. Yuille, “Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs,” IEEE Trans. on Pattern Analysis and Machine Intelligence, vol.40, no.4, pp.834-848, 2018.
[16] L. Chen, G. Papandreou, F. Schroff, and H. Adam, “Rethinking atrous convolution for semantic image segmentation,” arXiv:1706.05587.
[17] A. Giusti, D. Ciresan, J. Masci, L. Gambardella, and J. Schmidhuber, “Fast image scanning with deep max-pooling convolutional neural networks,” in Proc. of IEEE Int. Conf. on Image Processing (ICIP), Melbourne, AU, Sep.15-18, 2013, pp.4034-4038.
[18] G. Papandreou, I. Kokkinos, and P.-A. Savalle, “Modeling local and global deformations in deep learning: epitomic convolution, multiple instance learning, and sliding window detection,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Boston, MA, Jun.7-12, 2015, pp.390-399.
[19] P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and Y. Le-Cun, “Overfeat: Integrated recognition, localization and detection using convolutional networks,” in Proc. of International Conference on Learning Representations Conf. (ICLR), Banff, CA, Apr.14-16, 2014.
[20] H. Zhao, J. Shi, X. Qi, X. Wang, and J. Jia, “Pyramid scene parsing network,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, Jul.21-26, 2017, pp.6230-6239.
[21] K. He, X. Zhang, S. Ren, and J. Sun, “Spatial pyramid pooling in deep convolutional networks for visual recognition,” arXiv:1406.4729v4.
[22] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. Gomez, L. Kaiser, and I. Polosukhin, “Attention is all you need,” in Advances in Neural Information Processing Systems 30 (NIPS), Long Beach, CA, Dec.4-9, 2017, pp.6000-6010.
[23] B. Jimmy, M. Volodymyr, and K. Koray, “Multiple object recognition with visual attention,” arXiv:1412.7755.
[24] V. Mnih, N. Heess, and A. Graves et al., “Recurrent models of visual attention,” in Proc. of Neural Information Processing Systems (NIPS), Montreal, CA, Dec.8-13, 2014, pp.2204-2212.
[25] D. Wang, Z. Shen, J. Shao, W. Zhang, X. Xue, and Z. Zhang, “Multiple granularity descriptors for fine-grained categorization,” in Proc. of IEEE Conf. on International Conference on Computer Vision (ICCV), Santiago, Chile, Dec.11-18, 2015, pp.2399-2406.
[26] J. Fu, H. Zheng, and T. Mei, ‘‘Look closer to see better: recurrent attention convolutional neural network for fine-grained image recognition,’’ in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Honolulu, Hawaii, Jul.21-26, 2017, pp. 4476–4484.
[27] S. Jetley, N. A. Lord, N. Lee, and P. H. Torr, ‘‘Learn to pay attention,’’ in Proc. Int. Conf. Learn. Representations (ICLR), Vancouver, CA, Apr.30-May.3, 2018, pp.1-14.
[28] H. Zhao, Y. Zhang, S. Liu, J. Shi, C. C. Loy, D. Lin, and J. Jia, “Psanet: point-wise spatial attention network for scene parsing,” in Proc. of the European Conf. on Computer Vision (ECCV), Munich, DE, Sep.8-14, 2018, pp.267-283.
[29] Y. Yuan and J. Wang, “Ocnet: object context network for scene parsing,” arXiv:1809.00916.
[30] Y. Du, C. Yuan, B. Li, L. Zhao, Y. Li, and W. Hu, ”Interaction-aware spatio-temporal pyramid attention networks for action classification,” in Proc. of the European Conf. on Computer Vision (ECCV), Amsterdam, Netherlands, Oct.8-16, 2016, pp.388-404.
[31] H. Zhang, K. Dana, J. Shi, Z. Zhang, X. Wang, A. Tyagi, and A. Agrawal, “Context encoding for semantic segmentation,” in Proc. of IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, Jun.18-23, 2018, pp.7151-7160.
[32] H. Qin, W. Chihao, X. Chunyang, W. Ye, Kuo, and C.-C. Jay, “Semantic segmentation with reverse attention,” in Proc. of the British Machine Vision Conference (BMVC), London, UK, Sep.4-7, 2017, pp.1-13.
[33] S. Ioffe and C. Szegedy, “Batch normalization: accelerating deep network training by reducing internal covariate shift,” in Proc. of ICML Conf. , Lille, France, Jul.7-9, 2015, vol.37, pp.448-456.
[34] Bing Xu, Naiyan Wang, Tianqi Chen, and Mu Li, “Empirical evaluation of rectified activations in convolutional network,” arXiv:1505.00853.
[35] X. Xiao, et al., “Weighted res-unet for high-quality retina vessel segmentation,” in Proc. of IEEE Int. Conf. on Information Technology in Medicine and Education (ITME), Hangzhou, PRC, Oct.19-21, 2018, pp. 327-331.
[36] O. Oktay, J. Schlemper, L. L. Folgoc, et al., “Attention u-net: learning where to look for the pancreas,” arXiv:1804.03999.
[37] J. He, Z. Deng, L. Zhou, Y. Wang, and Y. Qiao, “Adaptive pyramid context network for semantic segmentation,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, Jun.16-20, 2019, pp.7519-7528.
[38] J. Konig, M. D. Jenkins, P. Barrie, M. Mannion, and G. Morison, ‘‘A convolutional neural network for pavement surface crack segmentation using residual connections and attention gating,’’ in Proc. of IEEE Int. Conf. on Image Processing (ICIP), Taipei, ROC, Sep.22-25, 2019, pp. 1460-1464.
[39] C. Kaul, S. Manandhar, and N. Pears, ‘‘FocusNet: an attention-based fully convolutional network for medical image segmentation,’’ in Proc. IEEE 16th Int. Symposium on Biomedical Imaging (ISBI), Hilton Molino Stucky, Venice, Italy, Apr.8-11, 2019, pp.455-458.
[40] H. Li, P. Xiong, J. An, and L. Wang, “Pyramid attention network for semantic segmentation,” arXiv:1805.10180.
[41] D. P. Kingma and J. Ba, “Adam: a method for stochastic optimization,” arXiv:1412.6980. |