參考文獻 |
[1] Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. "Going deeper with convolutions", 2014.
[2] Espinace, P., Kollar, T., Roy, N., Soto, A., "Indoor Scene Recognition Through Object Detection Using Adaptive Objects Search ", 2010.
[3] Bolei Zhou, Agata Lapedriza, Aditya Khosla, Aude Oliva, and Antonio Torralba, "Places: A 10 million Image Database for Scene Recognition", 2017.
[4] Shuang Bai 1 ·Zhaohong Li 1 ·Jianjun Hou, "Learning two-pathway convolutional neural networks for categorizing scene images", 2016
[5] Szummer M, Picard RW, “Indoor-outdoor image classification.”, 1998
[6] Quattoni A, Torralba A, “Recognizing indoor scenes”, 2009.
[7] Li L, Su H, Xing EP, Fei-Fei L , “Object bank: a high-level image representation for scene classification and semantic feature sparsification.”, 2010.
[8] Pandey M, Lazebnik S, “Scene recognition and weakly supervised object localization with deformable part-based models”, 2011.
[9] Singh AAES, Gupta A, “Unsupervised discovery of mid-level discriminative patches”, 2012.
[10] Sadeghi F, Tappen MF, “Latent pyramidal regions for recognizing scenes.”, 2012.
[11] Zhou B, Lapedriza A, Xiao J, Torralba A, Oliva A, “Learning deep features for scene recognition using places database”, 2014.
[12] Ranzato M, Susskind J, Mnih V, Hinton G, “On deep generative models with applications to recognition.”, 2011.
[13] Lowe DG, “Distinctive image features from scale-invariant keypoints.”, 2004.
[14] Dalal N, Triggs B, “Histograms of oriented gradients for human detection”, 2005.
[15] Bay H, Tuytelaars T, Gool LV, “Surf: speeded up robust features”, 2006.
[16] Krizhevsky A, Sutskever I, Hinton G, “Imagenet classification with deep convolutional neural networks”, 2012.
[17] R. Girshick, J. Donahue, T. Darrell, and J. Malik, “Rich feature hierarchies for accurate object detection and semantic segmentation”, 2014.
[18] R. B. Girshick, “Fast R-CNN”, 2015.
.
[19] Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun, "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks", 2016.
[20] J. Redmon, S. Divvala, R. Girshick, and A. Farhadi., "You only look once: Unified, real-time object detection.", 2015.
[21] J. Redmon and A. Farhadi. "YOLO9000: Better, faster, stronger", InCVPR, 2017.
[22] J. Redmon and A. Farhadi., “Yolov3: An incremental improvement.”, 2018.
[23] Luis Herranz, Shuqiang Jiang, Xiangyang Li, “Scene recognition with CNNs: objects, scales and dataset bias”, 2016.
[24] Zhang L, Zhen X, Shao L, “Learning object-to-class kernels for scene classification.”, 2014.
[25] Long, J., Shelhamer, E., and Darrell, T., "Fully convolutional networks for semantic segmentation.", 2014.
[26] Kaiming He Georgia Gkioxari Piotr Doll ́ar Ross Girshick, "Mask R-CNN", 2018.
[27] Simonyan, K. & Zisserman, A., "Very deep convolutional networks for large-scale image recognition", 2014,
[28] S. Ioffe and C. Szegedy., “Batch normalization: Accelerating deep network training by reducing internal covariate shift”, 2015.
[29] Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jonathon Shlens, Zbigniew Wojna, “Rethinking the Inception Architecture for Computer Vision”, 2015.
[30] Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, Alex Alemi., “Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning”, 2016.
[31] Ruder, S. "An overview of gradient descent optimization algorithms", 2016.
[32] Ning Qian., “On the momentum term in gradient descent learning algorithms. Neural networks : the official journal of the International Neural Network Society”, 1999.
[33] Diederik P. Kingma, Jimmy Ba, "Adam: A Method for Stochastic Optimization", 2017.
[34] Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I. & Salakhutdinov, R., "Dropout: A Simple Way to Prevent Neural Networks from Overfitting", 2014.
[35] waleedka, "Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow", 2018, from https://github.com/matterport/Mask_RCNN.
|