參考文獻 |
[1] L. Jiao, F. Zhang, F. Liu, S. Yang, L. Li, Z. Feng, and R. Qu, “A Survey of Deep Learning-based Object Detection,” IEEE Access, Vol. 7, pp. 128837-128868, 2019.
[2] B. Meena, K. V. Rao, and S. Chittineni, “A Survey On Deep Learning Methods and Tools in Image Processing,” INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH, Vol. 9, Issue 2, pp. 1057-1062, 2020
[3] A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, J. Uszkoreit, and N. Houlsby, “An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale,” ICLR, 2021
[4] I. Tolstikhin, N. Houlsby, A. Kolesnikov, L. Beyer, X. Zhai, T. Unterthiner, J. Yung, A. Steiner, D. Keysers, J. Uszkoreit, M. Lucic, and A. Dosovitskiy, “MLP-Mixer: An all-MLP Architecture for Vision,” arXiv preprint arXiv:2105.01601, 2021.
[5] A. Kolesnikov, L. Beyer, X. Zhai, J. Puigcerver, J. Yung, S. Gelly, and N. Houlsby, “Big Transfer (BiT): General Visual Representation Learning,” ECCV, Vol. 5, pp. 491-507, 2020.
[6] A. Vaswani, P. Ramachandran, A. Srinivas, N. Parmar, B. Hechtman, and J. Shlens, “Scaling Local Self-Attention for Parameter Efficient Visual Backbones,” CVPR, pp. 12894-12904, 2021.
[7] A. Brock, S. De, S. L. Smith, and K. Simonyan, “High-Performance Large-Scale Image Recognition Without Normalization,” arXiv preprint arXiv:2102.06171, 2021.
[8] C.-H. Chen, C.-M. Kuo, C.-Y. Chen, and J.-H. Dai, "The Design and Synthesis Using Hierarchical Robotic Discrete-Event Modeling," Journal of Vibration and Control, vol. 19, pp. 1603-1613, 2013.
[9] Y. LeCun, Y. Bengio, and G. Hinton, “Deep Learning,” Nature, Vol. 521, No. 7553, pp. 436-444, 2015.
[10] Sonali, B. Maind, and P. Wankar, “Research Paper on Basic of Artificial Neural Network,” International Journal on Recent and Innovation Trends in Computing and Communication, Vol. 2, Issue. 1, pp. 96-101, 2014.
[11] W. Wang, Y. Yang, X. Wang, W. Wang, and J. Li, “Development of Convolutional Neural Network and Its Application in Image Classification: A Survey,” OPTICAL ENGINEERING, Vol.58, No. 4, Article ID 040901, 2019.
[12] D. H. Hubel, and T. N. Wiesel, “Receptive Fields, Binocular Interaction and Functional Architecture in the Cat′s Visual Cortex,” The Journal of Physiology, Vol. 160, No. 1, pp.106-154, 1962
[13] K. Fukushima, “Neocognitron: A Self-Organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position,” Biological Cybernetics, Vol. 36, pp.193-202, 1980
[14] D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning Internal Representations by Error Propagation,” Parallel distributed processing: explorations in the microstructure of cognition, Vol. 1, pp.318-362, 1986
[15] Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-Based Learning Applied to Document Recognition,” Proceedings of the IEEE, Vol. 86, Issue. 11, pp.2278-2324, 1998
[16] G. E. Hinton, S. Osindero, and Y. W. Teh, “A Fast Learning Algorithm for Deep Belief Nets,” Neural Computation, Vol. 18, No. 7, pp.1527-1554, 2006
[17] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “ImageNet Classification with Deep Convolutional Neural Networks,” Communications of the ACM, Vol. 60, Issue. 6, pp.84-90, 2017
[18] K. Simonyan, and A. Zisserman, “Very Deep Convolutional Networks for Large-Scale Image Recognition,” Proceedings of the IEEE, Vol. 86, Issue. 11, pp.2278-2324, 1998
[19] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, “Going Deeper with Convolutions,” CVPR, pp.1-9, 2015
[20] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin, “Attention Is All You Need,” Advances in Neural Information Processing Systems, pp.6000-6010, 2017
[21] D. Hendrycks, and K. Gimpel, “Gaussian Error Linear Units (GELUs),” arXiv preprint arXiv:1606.08415, 2016
[22] F. Chollet, “Xception: Deep Learning With Depthwise Separable Convolutions,” CVPR, pp.1800-1807, 2017
[23] L. Sifre, “Rigid-Motion Scattering For Image Classification,” PhD thesis, Ecole Polytechnique, 2014
[24] J. L. Ba, J. R. Kiros, and G. E. Hinton, “Layer Normalization,” arXiv preprint arXiv:1607.06450, 2016
[25] R. Wightman, PyTorchImageModels(timm), 2020 [Online]. Available: https://github.com/rwightman/pytorch-image-models. |