運用VGG網絡對靜息態功能性磁振造影成分圖進行區分

、線上人數：18

、訪客IP：18.118.226.34

姓名	劉孟儒(Meng-Ru Liu) 查詢紙本館藏	畢業系所	認知與神經科學研究所
論文名稱	運用VGG網絡對靜息態功能性磁振造影成分圖進行區分 (Classification of RS-fMRI component maps using Visual Geometry Group network)
檔案	[Endnote RIS 格式] [Bibtex 格式] [相關文章] [文章引用] [完整記錄] [館藏目錄] 至系統瀏覽論文 (2025-7-17以後開放)
摘要(中)	獨立成分分析作為眾多分析靜息態功能性磁振造影的方式之一，被廣泛應用在各樣的研究當中，但獨立成分分析所產出的結果 – 成分圖(component map) 並非全部來源於腦部活化，更多的是由儀器雜訊、頭部晃動或心臟跳動所引起。為了將成分圖區分成腦部活化以及非腦部活化，目前最常使用的方式為人工判別，但隨著科技的發達，我們應追求客觀的判別方式，因此本實驗希望能透過卷積神經網絡的其中一個經典模型—VGG(Visual Geometry Group)的架構作為參考，透過監督式學習的方法，訓練出一個最適合的模型來幫助醫師，能夠對大量的成分圖進行初次的篩選，將屬於腦部活化的成分圖給找出來。在這篇論文中，我們針對 4 項模型建構前非常重要的參數進行測試，包含 Epochs、模型層數、學習率大小以及卷積核大小，找出各樣參數該如何設定，才能使模型的表現最佳化。此外，由於硬體設備的不足，我們必須降低輸入成分圖的解析度，因此我們也對 180x180 以及 50x50 這兩個降低後的解析度進行模型的訓練，並找出兩者間的模型表現的差異。本實驗的資料為實驗室先前進行其他實驗收取資料的二次使用，取當中 10 位健康受試者 6 分鐘的靜息態磁振造影經過獨立成分分析後的成分圖，經過前處理將其進行空間的正規化，對齊並疊套在膨脹處理後的標準腦圖譜上，並經由專家對所有的成分圖進行標記區分後放入模型當中訓練。結果發現，在最佳化模型參數的狀況下，180x180 所訓練出來的 VGG 模型在 Test AUC 上顯著的高於 50x50 所訓練出來的 VGG 模型，並且當我們將預測錯誤的成分圖放大後，我們發現在 50x50 以及 180x180 的成分圖上都有特徵丟失以及模糊的狀況，但 50x50 的情況更為嚴重，因此可以得知降低圖片的解析度確實會影響模型的判斷，因此在硬體設備許可的狀況下，我們應該將完整圖片輸入，才能使模型的表現最佳化。
摘要(英)	Independent Component Analysis (ICA) is widely used as one of the methods for analyzing resting-state functional magnetic resonance imaging (rsfMRI) data in various research studies. However, the component maps generated by ICA do not solely originate from brain activation but are often influenced by instrument noise, head motion, or cardiac activity. To distinguish brain-activated component maps from non-brain-activated ones, manual inspection is commonly employed. However, with the advancement of technology, there is a need for objective discrimination methods. Therefore, in this experiment, we aimed to utilize the architecture of one of the commonly used Convolutional Neural Networks (CNN) models, VGG, as a classification model. Through supervised learning, we trained the VGG model to best sort out the brain activation independent components from a large number of component maps. In this work, we conducted tests on four crucial parameters for constructing the models, including the number of epochs, the number of model layers, learning rate, and convolutional kernel size, to determine the optimal settings for achieving the best model performance. The tested images were constructed by combining the four views (left and right lateral views and left and right medial views) of the component maps, spatially normalized and overlaid on the inflated Montreal Neurological Institute (MNI) standard brain. In addition, due to hardware limitations, we had to reduce the resolution of the tested images of the component maps. As a result, we trained the model to differentiate tested images with two different resolution, 180x180 and 50x50 from the original 520x370, and examine the model performance, respectively. The data used in this experiment were obtained as secondary usage from previously conducted experiments in the laboratory. Component maps from resting-state fMRI scans of 10 healthy subjects, collected over a 6-minute period, were preprocessed, classified, and utilized for training the model. The results show that, under optimized model parameters, the VGG model trained with 180x180 resolution significantly outperforms the one trained with 50x50 resolution in terms of Test AUC. Additionally, when we magnify the misclassified component maps, we observe feature loss and blurriness in both the 50x50 and 180x180 maps, with the 50x50 resolution exhibiting more severe issues. This indicates that reducing image resolution does affect the model′s judgment, suggesting that, whenever possible within the constraints of hardware resources, inputting the complete images would optimize the model′s performance.
關鍵字(中)	★ 功能性磁振造影 ★ VGG網絡 ★ 獨立成分分析 ★ 圖像識別	關鍵字(英)
論文目次	摘要……………………………………………………………………………..……...Ⅰ Abstract…………………………………………………………………………..……Ⅱ 目錄…………………………………………………………………………………..Ⅳ 圖目錄………………………………………………………………………………..Ⅵ 表目錄………………………………………………………………………….......Ⅷ 第一章緒論…………………………………………………………………………1 1.1 靜息態功能性磁振造影(resting state fMRI)…...…………………….1 1.1.1 磁振造影(Magnetic Resonance Imaging, MRI)...…..………………...1 1.1.2 功能性磁振造影(functional Magnetic Resonance Imaging, fMRI)….2 1.1.3 統計參數映射分析(Statistical Parametric Mapping, SPM)...…..…….4 1.1.3.1 一般線性模型(General Linear Model, GLM)...…..…………………..5 1.1.4 靜息態功能性磁振造影(resting state fMRI)...…..…………………...8 1.1.5 獨立成分分析(Independent Component Analysis, ICA)……………10 1.2 視覺幾何組(Visual Geometry Group, VGG)……………...………...13 1.2.1 深度神經網絡(Deep Neural Network)……………………..………..13 1.2.2 圖像識別………………..…………………………………..…..……14 1.2.3 卷積神經網路(Convolutional Neural Network, CNN)……..……….16 1.2.4 視覺幾何組(Visual Geometry Group, VGG)…..………….......…….17 第二章材料與方法…………………………………………....…………………..20 2.1 研究目的….………………………………………………………….20 2.2 研究問題…………………………………………..………………....20 2.3 研究步驟……………………………………………..……………....21 2.4 輸入資料……………………………………………………………..22 2.5 資料前處理…………………………………………………………..22 2.5.1 fMRI 資料處理..……………………………………………………..22 2.5.1.1 切片時序調整………………………………………………………..23 2.5.1.2 運動校正……………………………………………………………..23 2.5.2 成分圖資料處理……………………………………………………..23 2.5.3 圖片合併…………………………………………………………..…26 2.5.4 圖片裁切……………………………………………………..………26 2.5.5 圖片降低解析度……………………………………………………..28 2.5.6 資料集區分…………………………………………………………..30 2.6 模型參數測試………………………………………………………..24 2.7 硬體設備……………………………………………………………..36 第三章結果………………………………………………………………………..37 3.1 Epochs………………………………………………………………..37 3.2 模型層數…………………………………………………………..…40 3.3 學習率………………………………………………………………..42 3.4 卷積核大小(kernel size)……………………………………………..43 3.5 參數總結……………………………………………………………..45 3.6 實際應用……………………………………………………………..46 3.6.1 預測錯誤情況一：影像縮放過程中所造成活化區域的變形……..47 3.6.2 預測錯誤情況二：原始活化影像區域太小………………………..49 第四章討論………………………………………………………………………..52 4.1 Epochs………………………………………………………………..52 4.2 模型層數……………………………………………………………..53 4.3 學習率………………………………………………………………..54 4.4 卷積核大小…………………………………………………………..54 4.5 第一種被預測錯誤的腦部活化成分圖……………………………..55 4.6 第二種被預測錯誤的腦部活化成分圖……………………………..57 第五章結論………………………………………………………………………..58 參考文獻……………………………………………………………………………..60 附錄…………………………………………………………………………………..63
參考文獻	Agrawal, A., & Mittal, N. J. T. V. C. (2020). Using CNN for facial expression recognition: a study of the effects of kernel size and number of filters on accuracy. 36(2), 405-412. Bell, A. J., & Sejnowski, T. J. J. N. c. (1995). An information-maximization approach to blind separation and blind deconvolution. 7(6), 1129-1159. Biswal, B., Zerrin Yetkin, F., Haughton, V. M., & Hyde, J. S. J. M. r. i. m. (1995). Functional connectivity in the motor cortex of resting human brain using echo‐ planar MRI. 34(4), 537-541. Biswal, B. B., Kylen, J. V., & Hyde, J. S. J. N. i. B. (1997). Simultaneous assessment of flow and BOLD signals in resting‐state functional connectivity maps. 10(4‐ 5), 165-170. Brett, M., Penny, W., & Kiebel, S. J. H. b. f. (2003). Introduction to random field theory. 2, 867-879. Burman, P. J. B. (1989). A comparative study of ordinary cross-validation, v-fold cross-validation and the repeated learning-testing methods. 76(3), 503-514. Chansong, D., & Supratid, S. (2021). Impacts of kernel size on different resized images in object recognition based on convolutional neural network. Paper presented at the 2021 9th International Electrical Engineering Congress (iEECON). Flandin, G., Novak, M. J. J. f. B., & Applications, C. (2020). fMRI data analysis using SPM. 89-116. Friston, K. J., Frith, C., Liddle, P., Frackowiak, R. J. J. o. C. B. F., & Metabolism. (1991). Comparing functional (PET) images: the assessment of significant change. 11(4), 690-699. Glover, G. H. J. N. C. (2011). Overview of functional magnetic resonance imaging. 22(2), 133-139. Hinton, G., Deng, L., Yu, D., Dahl, G. E., Mohamed, A.-r., Jaitly, N., . . . Sainath, T. N. J. I. S. p. m. (2012). Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. 29(6), 82-97. Hung, C.-C., Liu, Y.-H., Huang, C.-C., Chou, C.-Y., Chen, C.-M., Duann, J.-R., . . . Lin, C.-P. J. S. r. (2020). Effects of early ketamine exposure on cerebral gray matter volume and functional connectivity. 10(1), 1-13. Katti, G., Ara, S. A., & Shireen, A. J. I. j. o. d. c. (2011). Magnetic resonance imaging (MRI)–A review. 3(1), 65-70 LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. J. P. o. t. I. (1998). Gradient-based learning applied to document recognition. 86(11), 2278-2324. Lee, M. H., Smyser, C. D., & Shimony, J. S. J. A. J. o. n. (2013). Resting-state fMRI: a review of methods and clinical applications. 34(10), 1866-1872. Li, X., Wang, W., Hu, X., & Yang, J. (2019). Selective kernel networks. Paper presented at the Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. McCullagh, P., & Nelder, J. (1989). Generalized Linear Models Second edition Chapman & Hall: London. McCulloch, W. S., & Pitts, W. J. T. b. o. m. b. (1943). A logical calculus of the ideas immanent in nervous activity. 5, 115-133. McKeown, M. J., Jung, T.-P., Makeig, S., Brown, G., Kindermann, S. S., Lee, T.-W., & Sejnowski, T. J. J. P. o. t. N. A. o. S. (1998). Spatially independent activity patterns in functional MRI data during the Stroop color-naming task. 95(3), 803-810. Mittal, V., Gangodkar, D., & Pant, B. (2020). Exploring The Dimension of DNN Techniques For Text Categorization Using NLP. Paper presented at the 2020 6th International Conference on Advanced Computing and Communication Systems (ICACCS). Ogawa, S., Lee, T.-M., Kay, A. R., & Tank, D. W. J. p. o. t. N. A. o. S. (1990). Brain magnetic resonance imaging with contrast dependent on blood oxygenation. 87(24), 9868-9872. Raichle, M. E., MacLeod, A. M., Snyder, A. Z., Powers, W. J., Gusnard, D. A., & Shulman, G. L. (2001). A default mode of brain function. 98(2), 676-682. doi: doi:10.1073/pnas.98.2.676 Rutt, B. K., & Lee, D. H. J. J. o. M. R. I. (1996). The impact of field strength on image quality in MRI. 6(1), 57-62. Sa, I., Ge, Z., Dayoub, F., Upcroft, B., Perez, T., & McCool, C. J. s. (2016). Deepfruits: A fruit detection system using deep neural networks. 16(8), 1222. Siddique, F., Sakib, S., & Siddique, M. A. B. (2019). Handwritten digit recognition using convolutional neural network in Python with tensorflow and observe the variation of accuracies for various hidden layers: Preprints. Simonyan, K., & Zisserman, A. J. a. p. a. (2014). Very deep convolutional networks for large-scale image recognition. Stork, D. (2001). Foundations of Occam’s razor and parsimony in learning. Paper presented at the NIPS 2001 Workshop. Sun, Y., Chen, Y., Wang, X., & Tang, X. J. A. i. n. i. p. s. (2014). Deep learning face representation by joint identification-verification. 27. Thulborn, K. R., Waterton, J. C., Matthews, P. M., & Radda, G. K. J. B. e. B. A.-G. S. (1982). Oxygenation dependence of the transverse relaxation time of water protons in whole blood at high field. 714(2), 265-270. Van Den Heuvel, M., Mandl, R., & Hulshoff Pol, H. J. P. o. (2008). Normalized cut group clustering of resting-state FMRI data. 3(4), e2001. Ying, X. (2019). An overview of overfitting and its solutions. Paper presented at the Journal of physics: Conference series
指導教授	段正仁張智宏(Jeng-Ren Duann Chih-Hung Chang)	審核日期	2023-7-20
推文	facebook plurk twitter funp google live udn HD myshare reddit netvibes friend youpush delicious baidu
網路書籤	Google bookmarks del.icio.us hemidemi myshare

博碩士論文 108825007 詳細資訊