參考文獻 |
[1] L. Wang et al., "UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery," ISPRS Journal of Photogrammetry and Remote Sensing, vol. 190, pp. 196-214, 2022.
[2] Y. Bengio, "Deep learning of representations: Looking forward," in International conference on statistical language and speech processing, 2013: Springer, pp. 1-37.
[3] I. Goodfellow et al., "Generative adversarial nets," Advances in neural information processing systems, vol. 27, 2014.
[4] J. Long, E. Shelhamer, and T. Darrell, "Fully convolutional networks for semantic segmentation," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 3431-3440.
[5] R. Kemker, C. Salvaggio, and C. Kanan, "Algorithms for semantic segmentation of multispectral remote sensing imagery using deep learning," ISPRS journal of photogrammetry and remote sensing, vol. 145, pp. 60-77, 2018.
[6] I. Kotaridis and M. Lazaridou, "Remote sensing image segmentation advances: A meta-analysis," ISPRS Journal of Photogrammetry and Remote Sensing, vol. 173, pp. 309-322, 2021.
[7] X.-Y. Tong et al., "Land-cover classification with high-resolution remote sensing images using transferable deep models," Remote Sensing of Environment, vol. 237, p. 111322, 2020.
[8] W. Zhao and S. Du, "Learning multiscale and deep representations for classifying remotely sensed imagery," ISPRS Journal of Photogrammetry and Remote Sensing, vol. 113, pp. 155-165, 2016.
[9] X. X. Zhu et al., "Deep learning in remote sensing: A comprehensive review and list of resources," IEEE geoscience and remote sensing magazine, vol. 5, no. 4, pp. 8-36, 2017.
[10] O. Ronneberger, P. Fischer, and T. Brox, "U-net: Convolutional networks for biomedical image segmentation," in Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III 18, 2015: Springer, pp. 234-241.
[11] V. Badrinarayanan, A. Kendall, and R. Cipolla, "Segnet: A deep convolutional encoder-decoder architecture for image segmentation," IEEE transactions on pattern analysis and machine intelligence, vol. 39, no. 12, pp. 2481-2495, 2017.
[12] F. I. Diakogiannis, F. Waldner, P. Caccetta, and C. Wu, "ResUNet-a: A deep learning framework for semantic segmentation of remotely sensed data," ISPRS Journal of Photogrammetry and Remote Sensing, vol. 162, pp. 94-114, 2020.
[13] K. Yue, L. Yang, R. Li, W. Hu, F. Zhang, and W. Li, "TreeUNet: Adaptive tree convolutional neural networks for subdecimeter aerial image segmentation," ISPRS Journal of Photogrammetry and Remote Sensing, vol. 156, pp. 1-13, 2019.
[14] Z. Zhou, M. M. Rahman Siddiquee, N. Tajbakhsh, and J. Liang, "Unet++: A nested u-net architecture for medical image segmentation," in Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support: 4th International Workshop, DLMIA 2018, and 8th International Workshop, ML-CDS 2018, Held in Conjunction with MICCAI 2018, Granada, Spain, September 20, 2018, Proceedings 4, 2018: Springer, pp. 3-11.
[15] Q. Liu, M. Kampffmeyer, R. Jenssen, and A.-B. Salberg, "Dense dilated convolutions’ merging network for land cover classification," IEEE Transactions on Geoscience and Remote Sensing, vol. 58, no. 9, pp. 6309-6320, 2020.
[16] W. Zhao, S. Du, Q. Wang, and W. J. Emery, "Contextually guided very-high-resolution imagery classification with semantic segments," ISPRS journal of Photogrammetry and Remote Sensing, vol. 132, pp. 48-60, 2017.
[17] D. Marmanis, K. Schindler, J. D. Wegner, S. Galliani, M. Datcu, and U. Stilla, "Classification with an edge: Improving semantic image segmentation with boundary detection," ISPRS Journal of Photogrammetry and Remote Sensing, vol. 135, pp. 158-172, 2018.
[18] K. Nogueira, M. Dalla Mura, J. Chanussot, W. R. Schwartz, and J. A. Dos Santos, "Dynamic multicontext segmentation of remote sensing images based on convolutional networks," IEEE Transactions on Geoscience and Remote Sensing, vol. 57, no. 10, pp. 7503-7520, 2019.
[19] J. Sherrah, "Fully convolutional networks for dense semantic labelling of high-resolution aerial imagery," arXiv preprint arXiv:1606.02585, 2016.
[20] L. Wang, R. Li, C. Duan, C. Zhang, X. Meng, and S. Fang, "A novel transformer based semantic segmentation scheme for fine-resolution remote sensing images," IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1-5, 2022.
[21] J. Fu et al., "Dual attention network for scene segmentation," in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 3146-3154.
[22] Z. Huang, X. Wang, L. Huang, C. Huang, Y. Wei, and W. Liu, "Ccnet: Criss-cross attention for semantic segmentation," in Proceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 603-612.
[23] Y. Yuan, X. Chen, and J. Wang, "Object-contextual representations for semantic segmentation," in Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part VI 16, 2020: Springer, pp. 173-190.
[24] X. Yang et al., "An attention-fused network for semantic segmentation of very-high-resolution remote sensing imagery," ISPRS Journal of Photogrammetry and Remote Sensing, vol. 177, pp. 238-262, 2021.
[25] H. Li, K. Qiu, L. Chen, X. Mei, L. Hong, and C. Tao, "SCAttNet: Semantic segmentation network with spatial and channel attention mechanism for high-resolution remote sensing images," IEEE Geoscience and Remote Sensing Letters, vol. 18, no. 5, pp. 905-909, 2020.
[26] A. Vaswani et al., "Attention is all you need," Advances in neural information processing systems, vol. 30, 2017.
[27] N. He, X. Qu, Z. Yang, L. Xu, and F. Gurkalo, "Disaster Mechanism and Evolution Characteristics of Landslide–Debris-Flow Geohazard Chain Due to Strong Earthquake—A Case Study of Niumian Gully," Water, vol. 15, no. 6, p. 1218, 2023.
[28] A. Dosovitskiy et al., "An image is worth 16x16 words: Transformers for image recognition at scale," arXiv preprint arXiv:2010.11929, 2020.
[29] X. Zhu, W. Su, L. Lu, B. Li, X. Wang, and J. Dai, "Deformable detr: Deformable transformers for end-to-end object detection," arXiv preprint arXiv:2010.04159, 2020.
[30] S. Zheng et al., "Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers," in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 6881-6890.
[31] Y. Bazi, L. Bashmal, M. M. A. Rahhal, R. A. Dayil, and N. A. Ajlan, "Vision transformers for remote sensing image classification," Remote Sensing, vol. 13, no. 3, p. 516, 2021.
[32] D. Hong et al., "SpectralFormer: Rethinking hyperspectral image classification with transformers," IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-15, 2021.
[33] R. Li, S. Zheng, C. Duan, J. Su, and C. Zhang, "Multistage attention ResU-Net for semantic segmentation of fine-resolution remote sensing images," IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1-5, 2021.
[34] H. Chen, Z. Qi, and Z. Shi, "Remote sensing image change detection with transformers," IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-14, 2021.
[35] L. Wang, R. Li, D. Wang, C. Duan, T. Wang, and X. Meng, "Transformer meets convolution: A bilateral awareness network for semantic segmentation of very fine resolution urban scene images," Remote Sensing, vol. 13, no. 16, p. 3065, 2021.
[36] E. Xie, W. Wang, Z. Yu, A. Anandkumar, J. M. Alvarez, and P. Luo, "SegFormer: Simple and efficient design for semantic segmentation with transformers," Advances in neural information processing systems, vol. 34, pp. 12077-12090, 2021.
[37] H. Cao et al., "Swin-unet: Unet-like pure transformer for medical image segmentation," in European conference on computer vision, 2022: Springer, pp. 205-218.
[38] J. Chen et al., "Transunet: Transformers make strong encoders for medical image segmentation," arXiv preprint arXiv:2102.04306, 2021.
[39] Z. Liu et al., "Swin transformer: Hierarchical vision transformer using shifted windows," in Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 10012-10022.
[40] T. Panboonyuen, K. Jitkajornwanich, S. Lawawirojwong, P. Srestasathiern, and P. Vateekul, "Transformer-based decoder designs for semantic segmentation on remotely sensed images," Remote Sensing, vol. 13, no. 24, p. 5100, 2021.
[41] J. Naidoo, N. Bates, T. Gee, and M. Nejati, "Pallet Detection from Synthetic Data Using Game Engines," arXiv preprint arXiv:2304.03602, 2023.
[42] H. Zhao, J. Shi, X. Qi, X. Wang, and J. Jia, "Pyramid scene parsing network," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 2881-2890.
[43] X. Tang, Z. Tu, Y. Wang, M. Liu, D. Li, and X. Fan, "Automatic detection of coseismic landslides using a new transformer method," Remote Sensing, vol. 14, no. 12, p. 2884, 2022.
[44] K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770-778.
[45] X. Wang, R. Girshick, A. Gupta, and K. He, "Non-local neural networks," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 7794-7803.
[46] S. Ioffe and C. Szegedy, "Batch normalization: Accelerating deep network training by reducing internal covariate shift," in International conference on machine learning, 2015: pmlr, pp. 448-456.
[47] L.-C. Chen, Y. Zhu, G. Papandreou, F. Schroff, and H. Adam, "Encoder-decoder with atrous separable convolution for semantic image segmentation," in Proceedings of the European conference on computer vision (ECCV), 2018, pp. 801-818. |