參考文獻 |
[1] J. Redmon, S. Divvala, R. Girshick and A. Farhadi, "You Only Look Once: Unified, Real-Time Object Detection," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2016, pp. 779-788, doi: 10.1109/CVPR.2016.91.
[2] R. Girshick, "Fast R-CNN," Proceedings of the IEEE International Conference on Computer Vision (ICCV), Dec. 2015, pp. 1440-1448, doi: 10.1109/ICCV.2015.169.
[3] X. Zhou, D. Wang and P. Krähenbühl, "Objects as Points," arXiv:1904.07850 [cs], Apr. 2019, Accessed: Apr. 20, 2021. [Online]. Available: https://arxiv.org/abs/1904.07850.
[4] A. Rosebrock, "Simple object tracking with OpenCV," PyImageSearch. Accessed: May 5, 2021. [Online]. Available: https://www.pyimagesearch.com/2018/07/23/simple-object-tracking-with-opencv/.
[5] A. Bewley, Z. Ge, L. Ott, F. Ramos and B. Upcroft, "Simple Online and Realtime Tracking," IEEE International Conference on Image Processing (ICIP), pp. 3464-3468, 2016, doi: 10.1109/ICIP.2016.7533003.
[6] S. Ren, K. He, R. Girshick and J. Sun, "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks," Advances in Neural Information Processing Systems (NIPS), 2015.
[7] N. Wojke, A. Bewley and D. Paulus, "Simple Online and Realtime Tracking with a Deep Association Metric," IEEE International Conference on Image Processing (ICIP), pp. 3645-3649, 2017, doi: 10.1109/ICIP.2017.8296962.
[8] Z. Wang, L. Zheng, Y. Liu, Y. Li and S. Wang, "Towards Real-Time Multi-Object Tracking," European Conference on Computer Vision (ECCV), pp. 107-122, Nov. 2020.
[9] Y. Zhang, C. Wang, X. Wang, W. Zeng and W. Liu, "FairMOT: On the Fairness of Detection and Re-Identification in Multiple Object Tracking," arXiv:2004.01888 [cs], Apr. 2020, Accessed: Apr. 20, 2021. [Online]. Available: https://arxiv.org/abs/2004.01888.
[10] C. Liang, Z. Zhang, Y. Lu, X. Zhou, B. Li, X. Ye and J. Zou, "Rethinking the competition between detection and ReID in Multi-Object Tracking," arXiv:2010.12138 [cs], Oct. 2020, Accessed: Apr. 20, 2021. [Online]. Available: https://arxiv.org/abs/2010.12138.
[11] S. Woo, J. Park, J. Lee and I. S. Kweon, "CBAM: Convolutional Block Attention Module," Proceedings of the European Conference on Computer Vision (ECCV), Sep. 2018.
[12] Y. Lecun, L. Bottou, Y.Bengio and P.Haffner, "Gradient-Based Applied to Document Recognition," Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2234, Nov. 1998, doi: 10.1109/5.726791.
[13] A. Krizhevky, I. Sutskever and G. E. Hinton, "ImageNet Classification with Deep Convolutional Neural Networks," Advances in Neural Information Processing System 25(NIPS 2012), pp. 1097-1105, 2012.
[14] M. D. Zeiler and R. Fergus, "Visualizing and Understanding Convolutional Networks," arXiv:1311.2901 [cs], Nov. 2013, Accessed: Apr. 20, 2021. [Online]. Available: https://arxiv.org/abs/1311.2901.
[15] K. Simonyan and A. Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition," arXiv:1409.1556 [cs], Sep. 2014, Accessed: Apr. 20, 2021. [Online]. Available: https://arxiv.org/abs/1409.1556.
[16] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke and A. Rabinovich, "Going Deeper with Convolutions," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2015, pp. 1-9, doi: 10.1109/CVPR.2015.7298594.
[17] K. He, X. Zhang, S. Ren and J. Sun, "Deep Residual Learning for Image Recognition," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2016, pp. 770-778, doi: 10.1109/CVPR.2016.90.
[18] S. Ioffe and C. Szegedy, "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift," arXiv:1502.03167v3 [cs], Feb. 2015, Accessed: Apr. 25, 2021. [Online]. Available: https://arxiv.org/abs/1502.03167.
[19] X. Glorot, A. Bordes and Y. Bengio, "Deep Sparse Rectifier Neural Networks," in Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS), vol. 15, 2011.
[20] G. Huang, Z. Liu, L. van der Maaten and K. Q. Weinberger, "Densely Connected Convolutional Networks," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jul. 2017, pp. 2261-2269, doi: 10.1109/CVPR.2017.243.
[21] J. Hu, L. Shen, S. Albanie, G. Sun and E. Wu, "Squeeze-and-Excitation Networks," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2018, pp. 7132-7141, doi: 10.1109/CVPR.2018.00745.
[22] J. Long, E. Shelhamer and T. Darrell, "Fully Convolutional Networks for Semantic Segmentation," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2015, pp. 3431-3440, doi: 10.1109/CVPR.2015.7298965.
[23] T. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan and S. Belongie, "Feature Pyramid Networks for Object Detection," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jul. 2017, pp. 936-944, doi: 10.1109/CVPR.2017.106.
[24] F. Yu and V. Koltun, "Multi-Scale Context Aggregation by Dilated Convolutions," International Conference on Learning Representations (ICLR), 2016.
[25] J. Dai, H. Qi, Y. Xiong, Y. Li, G. Zhang, H. Hu and Y. Wei, "Deformable Convolutional Networks," Proceedings of the IEEE International Conference on Computer Vision (ICCV), Oct. 2017, pp. 764-773, doi: 10.1109/ICCV.2017.89.
[26] F. Yu, D. Wang, E. Shelhamer and T. Darrell, "Deep Layer Aggregation," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2018, pp. 2403-2412, doi: 10.1109/CVPR.2018.00255.
[27] J. Redmon and A. Farhadi, "YOLO9000: Better, Faster, Stronger," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jul. 2017, pp. 6517-6525, doi: 10.1109/CVPR.2017.690.
[28] J. Deng, W. Dong, R. Socher, L. Li, K. Li and L. Fei-Fei, "ImageNet: A large-scale hierarchical image database," 2009 IEEE Conference on Computer Vision and Pattern Recognition, Jun. 2009, pp. 248-255, doi: 10.1109/CVPR.2009.5206848.
[29] J. Redmon and A. Farhadi, "YOLOv3: An Incremental Improvement," arXiv:1804.02767 [cs], Apr. 2018, Accessed: Apr. 25, 2021. [Online]. Available: https://arxiv.org/abs/1804.02767.
[30] R. Girshick, J. Donahue, T. Darrell and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2014, pp. 580-587, doi: 10.1109/CVPR.2014.81.
[31] T. Lin, P. Goyal, R. Girshick, K. He and P. Dollár, "Focal Loss for Dense Object Detection," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Oct. 2017, pp. 2999-3007, doi: 10.1109/ICCV.2017.324.
[32] A. Newell, K. Yang and J. Deng, "Stacked Hourglass Network for Human Pose Estimation," arXiv:1603.06937 [cs], Mar. 2016, Accessed: Apr. 20, 2021. [Online]. Available: https://arxiv.org/abs/1603.06937.
[33] R. Kalman, "A New Approach to Linear Filtering and Prediction Problems," Journal of Basic Engineering, vol. 82, no. Series D, pp. 35-45, 1960.
[34] H. W. Kuhn, "The Hungarian method for the assignment problem," Naval Research Logistics Quarterly, vol. 2, pp. 83-97, 1955.
[35] F. Pickett and T. O′Neal, "Zynq-7000," Xilinx Wiki. Accessed: May 10, 2021. [Online]. Available: https://xilinx-wiki.atlassian.net/wiki/spaces/A/pages/189530183/Zynq-7000.
[36] "Vivado Simulation Flow," Xilinx. Accessed: May 10, 2021. [Online]. Available: https://www.xilinx.com/products/design-tools/vivado/simulation.html.
[37] "PetaLinux Tools," Xilinx. Accessed: May 10, 2021. [Online]. Available: https://www.xilinx.com/products/design-tools/embedded-software/petalinux-sdk.html.
[38] T. Lin, M. Maire, S. Belongie, L. Bourdev, R. Girshick, J. Hays, P. Perona, D. Ramanan, C. L. Zitnick and P. Dollár, "Microsoft COCO: Common Objects in Context," arXiv:1405.0312 [cs], May 2014, Accessed: Apr. 27, 2021. [Online]. Available: https://arxiv.org/abs/1405.0312.
[39] S. Shao, Z. Zhao, B. Li, T. Xiao, G. Yu, X. Zhang and J. Sun, "Crowdhuman: A Benchmark for Detecting Human in a Crowd," arXiv:1805.00123 [cs], Apr. 2018, Accessed: Apr. 27, 2021. [Online]. Available: https://arxiv.org/abs/1805.00123.
[40] A. Kendall, Y. Gal and R. Cipolla, "Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2018, pp. 7482-7491, doi: 10.1109/CVPR.2018.00781.
[41] Szymon Migacz, "8-bit Inference with TensorRT," NVIDIA, 8 May 2017. Accessed: May 10, 2021. [Online]. Available: https://on-demand.gputechconf.com/gtc/2017/presentation/s7310-8-bit-inference-with-tensorrt.pdf.
[42] L. Leal-Taixé, A. Milan, I. Reid, S. Roth and K. Schindler, "MOTChallenge 2015: Towards a Benchmark for Multi-Target Tracking," arXiv:1504.01942 [cs], Apr. 2015, Accessed: Apr. 27, 2021. [Online]. Available: https://arxiv.org/abs/1504.01942.
[43] A. Milan, L. Leal-Taxie, I. Reid, S. Roth and K. Schindler, "MOT16: A Benchmark for Multi-Object Tracking," arXiv:1603.00831 [cs], Mar. 2016, Accessed: Apr. 27, 2021. [Online]. Available: https://arxiv.org/abs/1603.00831.
[44] P. Dendorfer, H. Rezatofighi, A. Milan, J. Shi, D. Cremers, I. Reid, S. Roth, K. Schindler and L. Leal-Taixé, "MOT20: A benchmark for multi object tracking in crowded scenes," arXiv:2003.09003 [cs], Mar. 2020, Accessed: Apr. 27, 2021. [Online]. Available: https://arxiv.org/abs/2003.09003.
[45] P. Dollar, C. Wojek, B. Schiele and P. Perona, "Pedestrian Detection: An Evaluation of the State of the Art," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, no. 4, pp. 743-761, Apr. 2012, doi: 10.1109/TPAMI.2011.155.
[46] S. Zhang, R. Benenson, B. Schiele, "CityPersons: A Diverse Dataset for Pedestrian Detection," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jul. 2017, pp. 4457-4465, doi: 10.1109/CVPR.2017.474.
[47] T. Xiao, S. Li, B. Wang, L. Lin and X. Wang, "End-to-End Deep Learning for Person Search," arXiv:1604.01850v1 [cs], Apr. 2016, Accessed: Apr. 27, 2021. [Online]. Available: https://arxiv.org/abs/1604.01850v1.
[48] L. Zheng, H. Zhang, S. Sun, M. Chandraker, Y. Yang and Q. Tian, "Person Re-identification in the Wild," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jul. 2017, pp. 3346-3355, doi: 10.1109/CVPR.2017.357.
[49] A. Ess and B. Leibe and K. Schindler and and L. van Gool, "A Mobile Vision System for Robust Multi-Person Tracking," IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2008, pp. 1-8, doi: 10.1109/CVPR.2008.4587581. |