摘要(英) |
Medical imaging technology, the most effective method for detecting cell tissue or representations of the interior of a body, including radiography, ultrasound, computerized tomography (CT), magnetic resonance imaging (MRI), cardiac angiography and optical microscopy, etc. plays a vital role in clinical analysis.
However, the analysis of images mostly regarded as an operator-dependent method is time-consuming and has narrow limitations when it is necessary to process complex and large amounts of medical images for medical diagnosis.
Recently, the rapid development of technology such as artificial intelligence and big data in imaging has triggered a vigorous growth in medical imaging applications that assist doctors in making initial interpretation of barely imperceptible symptoms and even primary diagnosis for the relevant symptoms from those complicated and large data.
In this paper, Mask Region-based Convolutional Neural Network (Mask RCNN) applied a series of biomedical image object detection for establishing the detection system and the deep learning model for blood cells and chest X-rays respectively.
The model also equipped with a faster convolutional neural network (Faster RCNN) which fulfills the goal of target object identification and coordinate calibration.
Simultaneously, integrated with the method of instance segmentation for automatic segmentation and the construction of pixel-level foreground and background for each target object, the pixel-level mask model system was established successfully to visualize the object counter in the bounding box to make a medical judgment by blood cells and symptoms of the lung.
Moreover, the study organized a new database for blood cells and pulmonary nodules of chest X-ray, then fine-tuned the model and applied with transfer learning to accelerate the training speed, and replaced traditional selective search with Region Proposal Network (RPN) to enhance the recognition rate and the speed of selecting of regions of interest. In this study, the frame per second can be as high as 15 images per second, and the mean average precision calculated in the classification of blood cells is 0.931 and that of pulmonary nodules is 0.911, respectively. While in instance segmentation, intersection over union was applied to calculate the accuracy of the symptom mask of blood cells and chest X-ray which could reach 0.961 and 0.928, respectively. Eventually, the result provided the necessary empirical evidence that the detection system established by the study will realize the vision to assist health professionals with the rapid and effective clinical diagnosis of clinical application. |
參考文獻 |
[1] Deng, Jia, et al. "Imagenet: A large-scale hierarchical image database." 2009 IEEE conference on computer vision and pattern recognition. Ieee, 2009.
[2] Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton. "Imagenet classification with deep convolutional neural networks." Advances in neural information processing systems. 2012.
[3] Zhuang, Fuzhen, et al. "A comprehensive survey on transfer learning." arXiv preprint arXiv:1911.02685 (2019).
[4] Bushberg, Jerrold T., and John M. Boone. The essential physics of medical imaging. Lippincott Williams & Wilkins, 2011.
[5] de Assuncao, Marcos Dias, Alexandre da Silva Veith, and Rajkumar Buyya. "Distributed data stream processing and edge computing: A survey on resource elasticity and future directions." Journal of Network and Computer Applications 103 (2018): 1-17.
[6] Russell, Stuart, and Peter Norvig. "Artificial intelligence: a modern approach." (2002).
[7] Dreiseitl, Stephan, and Lucila Ohno-Machado. "Logistic regression and artificial neural network classification models: a methodology review." Journal of biomedical informatics 35.5-6 (2002): 352-359.
[8] Flowers, Johnathan Charles. "Strong and Weak AI: Deweyan Considerations." AAAI Spring Symposium: Towards Conscious AI Systems. 2019.
[9] Kaelbling, Leslie Pack, Michael L. Littman, and Andrew W. Moore. "Reinforcement learning: A survey." Journal of artificial intelligence research 4 (1996): 237-285.
[10] Michie, Donald, David J. Spiegelhalter, and C. C. Taylor. "Machine learning." Neural and Statistical Classification 13.1994 (1994): 1-298.
[11] LeCun, Yann, Yoshua Bengio, and Geoffrey Hinton. "Deep learning." nature 521.7553 (2015): 436-444.
[12] Granter, Scott R., Andrew H. Beck, and David J. Papke Jr. "AlphaGo, deep learning, and the future of the human microscopist." Archives of pathology & laboratory medicine 141.5 (2017): 619-621.
[13] Suykens, Johan AK, and Joos Vandewalle. "Least squares support vector machine classifiers." Neural processing letters 9.3 (1999): 293-300.
[14] Wold, Svante, Kim Esbensen, and Paul Geladi. "Principal component analysis." Chemometrics and intelligent laboratory systems 2.1-3 (1987): 37-52.
[15] Rich Caruana, Alexandru Niculescu-Mizil. (2006) An Empirical Comparison of Supervised Learning Algorithms. In Proceeding ICML ′06 Proceedings of the 23rd international conference on Machine learning, Pittsburgh, Pennsylvania, USA.
[16] Girshick, Ross, et al. "Region-based convolutional networks for accurate object detection and segmentation." IEEE transactions on pattern analysis and machine intelligence 38.1 (2015): 142-158.
[17] Girshick, Ross. "Fast r-cnn." Proceedings of the IEEE international conference on computer vision. 2015.
[18] Ren, Shaoqing, et al. "Faster r-cnn: Towards real-time object detection with region proposal networks." Advances in neural information processing systems. 2015.
[19] He, Kaiming, et al. "Mask r-cnn." Proceedings of the IEEE international conference on computer vision. 2017.
[20] Lin, Tsung-Yi, et al. "Feature pyramid networks for object detection." Proceedings of the IEEE conference on computer vision and pattern recognition. 2017.
[21] Fu, Jun, et al. "Stacked deconvolutional network for semantic segmentation." IEEE Transactions on Image Processing (2019).
[22] Moon, Gyeongsik, Ju Yong Chang, and Kyoung Mu Lee. "Posefix: Model-agnostic general human pose refinement network." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2019.
[23] Lin, Tsung-Yi, et al. "Feature pyramid networks for object detection." Proceedings of the IEEE conference on computer vision and pattern recognition. 2017.
[24] He, Kaiming, et al. "Deep residual learning for image recognition." Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.
[25] Hochreiter, Sepp. "The vanishing gradient problem during learning recurrent neural nets and problem solutions." International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 6.02 (1998): 107-116.
[26] Uijlings, Jasper RR, et al. "Selective search for object recognition." International journal of computer vision 104.2 (2013): 154-171.
[27] Ren, Shaoqing, et al. "Faster r-cnn: Towards real-time object detection with region proposal networks." Advances in neural information processing systems. 2015.
[28] Rothe, Rasmus, Matthieu Guillaumin, and Luc Van Gool. "Non-maximum suppression for object detection by passing messages between windows." Asian conference on computer vision. Springer, Cham, 2014.
[29] Rezatofighi, Hamid, et al. "Generalized intersection over union: A metric and a loss for bounding box regression." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2019. |