參考文獻 |
[1] Y. LeCun, L. Bottou, Y. Bengio, P. Haffner, “Gradient-based learning ap-plied to document recognition,” in Proceedings of the IEEE 86.11, pp. 2278-2324, 1998.
[2] K. Alex, I. Sutskever, and G. E. Hinton, “ImageNet classification with deep convolutional neural networks,” in Advances in Neural Information Pro-cessing Systems, pp.1097-1105, 2012.
[3] ImageNet Large Scale Visual Recognition Competition: http://www.image-net.org/challenges/LSVRC/
[4] K. Simonyan, and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” in International Conference on Learning Representations (ICLR), 2015.
[5] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, “Going deeper with convolutions,” in Pro-ceedings of the IEEE Conference on Computer Vision and Pattern Recogni-tion (CVPR), pp. 1-9, 2015.
[6] 華文戒菸網-菸害防制法: https://www.e-quit.org/CustomPage/HtmlEditorPage.aspx?MId=242&ML=3
[7] H. Kuehne, H. Jhuang, E. Garrote, T. Poggio, and T. Serre, “Hmdb: a large video database for human motion recognition,” In 2011 International Con-ference on Computer Vision, pp.2556–2563, IEEE, 2011.
[8] F. Caba Heilbron, V. Escorcia, B. Ghanem, and J. Carlos Niebles, “Activi-tynet: A large-scale video benchmark for human activity understanding,” in Computer Vision and Pattern Recognition (CVPR), pp. 961-970, 2015.
[9] C. Gu, C. Sun, D. A. Ross, C. Vondrick, C. Pantofaru, Y. Li, ... and C. Schmid, “AVA: A video dataset of spatio-temporally localized atomic visual actions,” arXiv preprint arXiv: 1705.08421, 2017.
[10] Y. Jia, et al., “Caffe: Convolutional architecture for fast feature embedding,” ACM International Conference on Multimedia, 2014.
[11] H. Wang, and C. Schmid, “Action recognition with improved trajectories,” In: Computer Vision (ICCV), 2013 IEEE International Conference on. IEEE, pp. 3551-3558, 2013.
[12] H. Wang, A. Klaser, C. Schmid, and C. L. Liu, “Dense trajectories and mo-tion boundary descriptors for action recognition,” International journal of computer vision, 103.1, pp. 60-79, 2013.
[13] K. Simonyan, and A. Zisserman. “Two-stream convolutional networks for action recognition in videos,” Advances in neural information processing systems, pp.568-576, 2014.
[14] D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri, “Learning spa-tiotemporal features with 3d convolutional networks,” Computer Vision (ICCV), 2015 IEEE International Conference on. IEEE, pp. 4489-4497, 2015.
[15] L. Wang, Y. Xiong, Z. Wang, Y. Qiao, D. Lin, X. Tang, and L. Van Gool, “Temporal segment networks: Towards good practices for deep action recognition,” in European Conference on Computer Vision, pp. 20-36, 2016.
[16] OpenCV: Open Source Computer Vision Library , https://opencv.org/
[17] K. He, Zhang, X., S. Ren, and J. Sun, “Deep residual learning for image recognition,” Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770-778, 2016.
[18] R. Girshick, J. Donahue, T Darrell., and J. Malik, “Rich feature hierarchies for accurate object detection and semantic segmentation,” Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 580-587, 2014.
[19] S. Ren, K. He, R. Girshick, and J. Sun, “Faster r-cnn: Towards real-time ob-ject detection with region proposal networks.” Advances in neural infor-mation processing systems, pp. 91-99, 2015.
[20] A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar, and L. Fei-Fei, “Large-scale video classification with convolutional neural net-works,” Proceedings of the IEEE conference on Computer Vision and Pat-tern Recognition, pp. 1725-1732, 2014.
[21] C. Feichtenhofer, A. Pinz, and A. Zisserman, “Convolutional two-stream network fusion for video action recognition,” Proceedings of the IEEE Con-ference on Computer Vision and Pattern Recognition, 2016.
[22] C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna, “Rethinking the inception architecture for computer vision,” Proceedings of the IEEE Con-ference on Computer Vision and Pattern Recognition, 2016.
[23] W. S. Mcculloch and W. Pitts, “A Logical Calculus of the Ideas Immanent in Nervous Activity,” Bulletin of Mathematical Biophysics, vol.5, no.4, pp.115-133, Dec. 1943.
[24] D. O. Hebb, “The Organization of Behavior,” New York: Wiley & Sons, 1949.
[25] F. Rosenblatt, “The perceptron: a probabilistic model for information storage and organization in the brain,” Psychological review , 65(6), 386, 1958.
[26] M. Minsky and S. Paper, “Perceptrons,” Cambridge, MA: MIT Press, 1969.
[27] D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning internal repre-sentations by error propagation,” No. ICS-8506. California Univ San Diego La Jolla Inst for Cognitive Science, 1985.
[28] D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning representations by back-propagating errors,” Nature, vol. 323, pp. 533–536, Oct. 1986.
[29] G. E. Hinton, S. Osindero, and Y. W. Teh, “A fast learning algorithm for deep belief nets,” Neural computation, 18(7), pp. 1527-1554, 2006.
[30] S. Ioffe and C. Szegedy, “Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift,” in International Conference on Machine Learning, pp. 448-456, 2015.
[31] N. Tajbakhsh, J. Y. Shin, S. R. Gurudu, R. T. Hurst, C. B. Kendall, M. B. Gotway, and J. Liang, “Convolutional neural networks for medical image analysis: Full training or fine tuning?,” IEEE transactions on medical imag-ing 35(5), pp. 1299-1312, 2016.
[32] M. Lin, Q. Chen, and S. Yan, “Network in network,” arXiv preprint arXiv:1312.4400, 2013. |