參考文獻 |
[1] T. Sakai, M. Nagao and Takeo Kanade, “Computer Analysis and Classification of Photographs of Human Faces”, Proceedings of Proc. First USA-JAPAN Computer Conference, pp. 55-62, January, 1972
[2] N. Dalal and B. Triggs, “Histograms of Oriented Gradients for Human Detection,” IEEE Conf. Computer Vision and Pattern Recognition, San Diego, CA, USA, June 2005
[3] DG. Lowe.: “Object Recognition from Local Scale-Invariant Features.” Proceedings of the International Conference on Computer Vision, Kerkyra, Corfu, Greece, September 20-25, 1999. pp.1150–1157
[4] R. Girshick, J. Donahue, T. Darrell, and J. Malik. “Rich feature hierarchies for accurate object detection and semantic segmentation.” In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2014
[5] R. B. Girshick, "Fast R-CNN," In International Conference on Computer Vision, 2015.
[6] W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, and A. C. Berg. “SSD: Single shot multibox detector.” In ECCV, pages 21–37, 2016.
[7] J. Redmon, S. Divvala, R. Girshick, and A. Farhadi. “You Only Look Once: Unified, real-time object detection.” In IEEE Conference Computer Vision and Pattern Recognition (CVPR), 2016.
[8] H. Li, Z. Lin, X. Shen, J. Brandt, and G. Hua. “A convolutional neural network cascade for face detection.” In CVPR, pages 5325–5334, 2015.
[9] K. Zhang, Z. Zhang, Z. Li, and Y. Qiao, "Joint face detection and alignment using multitask cascaded convolutional networks," IEEE Signal Processing Letters, vol.23, no.10, pp.1499-1503, 2016.
[10] J. Deng, J. Guo, Y. Zhou, J. Yu, I. Kotsia, and S. Zafeiriou. “Retinaface: Single-stage dense face localisation in the wild.” arXiv preprint arXiv:1905.00641, 2019
[11] S. Yang, P. Luo, C. C. Loy and X. Tang, "WIDER FACE: A Face Detection Benchmark," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, 2016, pp. 5525-5533, doi: 10.1109/CVPR.2016.596.
[12] T.-Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollar. “Focal loss for dense object detection.” In ICCV, 2017
[13] Y. Taigman, M. Yang, M. Ranzato, and L. Wolf. “Deepface: Closing the gap to human-level performance in face verification.” In Conference on Computer Vision and Pattern Recognition, 2014
[14] G. B. Huang, M. Ramesh, T. Berg, and E. L. Miller. “Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments.” TR of University of Massachusetts, Amherst, Oct, 2007.
[15] F. Schroff, D. Kalenichenko and J. Philbin, "FaceNet: A unified embedding for face recognition and clustering," 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, 2015, pp. 815-823, doi: 10.1109/CVPR.2015.7298682.
[16] A. Dadashzadeh, A. T. Targhi, M. Tahmasbi, M. Mirmehdi, “HGR-Net: A Fusion Network for Hand Gesture Segmentation and Recognition,” arXiv:1806.05653, 2018.
[17] Ruder, S.: An overview of multi-task learning in deep neural networks. arXiv preprint arXiv:1706.05098 (2017)
[18] R. Ranjan, S. Sankaranarayanan, C. D. Castillo, and R. Chellappa, “An all-in-one convolutional neural network for face analysis,” in Automatic Face & Gesture Recognition (FG 2017), 2017 12th IEEE International Conference on. IEEE, 2017, pp. 17–24.
[19] Z. Liao, P. Zhou, Q. Wu and B. Ni, "Uniface: A Unified Network for Face Detection and Recognition," 2018 24th International Conference on Pattern Recognition (ICPR), Beijing, 2018, pp. 3531-3536, doi: 10.1109/ICPR.2018.8545051.
[20] J. Long, E. Shelhamer, and T. Darrell. “Fully convolutional networks for semantic segmentation,” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3431– 3440, 2015.
[21] L.-C. Chen, G. Papandreou, F. Schroff, and H. Adam. “Rethinking atrous convolution for semantic image segmentation,” arXiv:1706.05587, 2017.
[22] L.-C. Chen, Y. Zhu, G. Papandreou, F. Schroff, and H. Adam, “Encoder-decoder with atrous separable convolution for semantic image segmentation,” arXiv:1802.02611, 2018.
[23] C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna, “Rethinking the inception architecture for computer vision,” in CVPR, 2016, pp. 2818–2826.
[24] V. Jain and E. Learned-Miller. “FDDB: a benchmark for face detection in unconstrained settings.” Technical Report UMCS-2010-009, University of Massachusetts, Amherst, 2010
[25] Q. Cao, L. Shen, W. Xie, O. M. Parkhi and A. Zisserman, "VGGFace2: A Dataset for Recognising Faces across Pose and Age," 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), Xi′an, 2018, pp. 67-74, doi: 10.1109/FG.2018.00020.
[26] D. Yi, Z. Lei, S. Liao, and S. Z. Li. Learning face representation from scratch. arXiv preprint arXiv:1411.7923, 2014.
[27] V. Badrinarayanan, A. Kendall and R. Cipolla, "SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 12, pp. 2481-2495, 1 Dec. 2017, doi: 10.1109/TPAMI.2016.2644615.
[28] K. He, X. Zhang, S. Ren and J. Sun, "Deep Residual Learning for Image Recognition," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, 2016, pp. 770-778, doi: 10.1109/CVPR.2016.90.
[29] S. Zhang, X. Zhu, Z. Lei, H. Shi, X. Wang and S. Z. Li, "S^3FD: Single Shot Scale-Invariant Face Detector," 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 2017, pp. 192-201, doi: 10.1109/ICCV.2017.30. |