參考文獻 |
[1]Y. Huang, X. Liu, X. Zhang and L. Jin, "A pointing gesture based egocentric interaction system: dataset, approach and application," 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Las Vegas, NV, 2016, pp. 370-377, doi: 10.1109/CVPRW.2016.53.
[2]林士筆,基於RGB無深度影像之中文空中手寫辨識,國立中央大學資訊工程學系碩士論文,2019。
[3]T. Simon et al., "Hand keypoint detection in single images using multiview bootstrapping," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017.
[4]S. E. Wei, V. Ramakrishna, T. Kanade and Y. Sheikh, "Convolutional pose machines," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4724-4732, 2016.
[5]W. W. Mayol, A. J. Davison, B. J. Tordoff, N. D. Molton and D. W. Murray, "Interaction between hand and wearable camera in 2D and 3D environments," In Proc. British Machine Vision Conference, 2004.
[6]T. Kurata, T. Okuma, M. Kourogi and K. Sakaue, "The hand mouse: GMM hand-color classification and mean shift tracking," In Proceedings IEEE ICCV Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems, pp. 119-124, IEEE, 2001.
[7]N. Wang, J. P. Shi, D. Y. Yeung and J. Jia, "Understanding and diagnosing visual tracking systems," In Proceedings of the IEEE International Conference on Computer Vision, pp. 3101-3109, 2015.
[8]J. F. Henriques, R. Caseiro, P. Martins and J. Batista, "High-speed tracking with kernelized correlation filters," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, no. 3, pp. 583-596, 2014.
[9]Z. Kalal, K. Mikolajczyk and J. Matas, "Tracking-Learning-Detection," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, no. 7, pp. 1409-1422, 2012.
[10]J. Tompson, M. Stein, Y. Lecun and K. Perlin, "Real-time continuous pose recovery of human hands using convolutional networks," ACM Transactions on Graphics (ToG), vol. 33, no. 5, pp. 1-10, 2014.
[11]L. Baraldi, F. Paci, G. Serra, L. Benini and R. Cucchiara, "Gesture recognition in ego-centric videos using dense trajectories and hand segmentation," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 688-693, 2014.
[12]J. S. Supancic, G. Rogez, Y. Yang, J. Shotton and D. Ramanan, "Depth-based hand pose estimation: data, methods, and challenges," In Proceedings of the IEEE International Conference on Computer Vision, pp. 1868-1876, 2015.
[13]鄒佩珊,空中手寫中文字辨識,國立中央大學資訊工程學系碩士論文,2018。
[14]Y. Wu, J. W. Lim and M. H. Yang, "Online object tracking: A benchmark," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2411-2418, 2013.
[15]A. Dutta and A. Zisserman, “The VIA annotation software for images, audio and video,” In Proceedings of the 27th ACM International Conference on Multimedia (MM ’19), October 21–25, 2019, Nice, France. ACM, New York, NY, USA, 4 pages, https://doi.org/10.1145/3343031.3350535.
[16]C. Li and K. M. Kitani, "Model recommendation with virtual probes for egocentric hand detection," In Proceedings of the IEEE International Conference on Computer Vision, pp. 2624-2631, 2013.
[17]C. Li and K. M. Kitani, "Pixel-level hand detection in ego-centric videos," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3570-3577, 2013.
[18]S. Bambach, S. Lee, D. J. Crandall and C. Yu, "Lending a hand: Detecting hands and recognizing activities in complex egocentric interactions," In Proceedings of the IEEE International Conference on Computer Vision, pp. 1949-1957, 2015.
[19]A. Betancourt, P. Morerio, L. Marcenaro, M. Rauterberg and C. Regazzoni, "Filtering SVM frame-by-frame binary classification in a detection framework," In 2015 IEEE International Conference on Image Processing (ICIP), pp. 2552-2556, IEEE, 2015.
[20]W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Y. Fu and A. C. Berg, "Ssd: Single shot multibox detector." In European Conference on Computer Vision, pp. 21-37, Springer, Cham, 2016.
[21]J. Redmon, S. Divvala, R. Girshick and A. Farhadi, "You only look once: Unified, real-time object detection," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, 2016.
[22]S. Karen and A. Zisserman, "Very deep convolutional networks for large-scale image recognition," arXiv preprint arXiv:1409.1556, 2014.
[23]M. Andriluka, L. Pishchulin, P. Gehler and B. Schiele, "2d human pose estimation: New benchmark and state of the art analysis," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3686-3693, 2014.
[24]S. Johnson and M. Everingham, "Learning effective human pose estimation from inaccurate annotation," In CVPR 2011, pp. 1465-1472, IEEE, 2011.
[25]B. Sapp and B. Taskar, "Modec: Multimodal decomposable models for human pose estimation," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3674-3681, 2013.
[26]Hand Keypoint Dataset. [Accessed: 08-Apr-2018]. Available from: http://domedb.perception.cs.cmu.edu/handdb.html.
[27]Z. Cao, G. Hidalgo, T. Simon, S. E. Wei and Y. Sheikh, "OpenPose: realtime multi-person 2D pose estimation using Part Affinity Fields," arXiv preprint arXiv:1812.08008, 2018.
[28]R. Girshick, J. Donahue, T. Darrell and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, 2014.
[29]R. Girshick, "Fast R-CNN," In Proceedings of the IEEE International Conference on Computer Vision, pp. 1440-1448, 2015.
[30]S. Ren, K. He, R. Girshick and J. Sun, "Faster R-CNN: Towards real-time object detection with region proposal networks," In Advances in Neural Information Processing Systems, pp. 91-99, 2015. |