基於檢驗數值的糖尿病腎病變預測模型

DC 欄位	值	語言
DC.contributor	資訊工程學系	zh_TW
DC.creator	方詩匀	zh_TW
DC.creator	Shih-Yun Fang	en_US
dc.date.accessioned	2022-8-20T07:39:07Z
dc.date.available	2022-8-20T07:39:07Z
dc.date.issued	2022
dc.identifier.uri	http://ir.lib.ncu.edu.tw:88/thesis/view_etd.asp?URN=108522095
dc.contributor.department	資訊工程學系	zh_TW
DC.description	國立中央大學	zh_TW
DC.description	National Central University	en_US
dc.description.abstract	糖尿病為國人最常見慢性病之一，且時常伴隨其他疾病發生。其中，糖尿病腎病變便是最常見的併發症中的一種，同時也是高發病率與高死亡率的疾病。由於腎臟相關疾病在早期不易察覺，等到患者意識到腎功能衰退時，通常都已經需要依靠血液透析維生。如果能在尚未發病的時期，就告知患者未來患病的可能性，或許能讓患者多加留意自己健康狀況。對預測結果提供有效的時間資訊是在研究縱向資料很重要的影響因子，因此本研究會在現有的實驗室資料上探討不同的時間序資料切割方式對於結果的影響。本研究在生化檢測資料上訓練不同架構的機器學習模型，包含以樹狀結構為基底的學習模型XGBoot、以tensorflow構造的多層感知機與先以分群演算法來分群各資料點，再利用泰勒展開式去逼近資料點的雅各比矩陣學習模型。此外，本研究比較多種特徵選取方法並分析特徵對於結果的影響。最終，以多層感知機與自選特徵在交叉驗證上的效果最好，準確率與靈敏度分別達到85.7%與85.4%。	zh_TW
dc.description.abstract	Diabetes is one of the most common chronic diseases in Taiwan and is often associated with various complications. Among them, diabetic nephropathy is one of the most frequent ones. It is also a disease with high morbidity and mortality. Because symptoms of kidney-related diseases are usually not readily observable at an early stage, most patients are unaware of it until the condition has progressed. By the time the kidney damage has already occurred, however, it is usually too late, and the patients will need hemodialysis as a treatment method for survival. If the patients can be informed of the possibility of the disease beforehand, it may allow them to pay more attention to their health conditions. In this sense, providing effective temporal information for prediction results is an important influencing factor in the study of longitudinal data. Therefore, this study will explore the influence of different time series data processing methods on the results based on the existing laboratory data. In this study, machine learning models with different architectures are trained on biochemical data, which include the learning model XGBoot that is based on tree structure, the multilayer perceptron built by tensorflow, and the Jacobian matrix learning model (JMLM). In general, JMLM is a more interpretive model compared to other models because it first uses clustering algorithm to group each data point and then uses Taylor series expansion to approximate the data points. In addition, this study compares multiple feature selection methods and analyzes the impact of features on the results. Ultimately, with the accuracy and sensitivity reaching 0.857 and 0.854, respectively, the multi-layer perception and self-selected features have the best effect on cross-validation.	en_US
DC.subject	糖尿病腎病變,	zh_TW
DC.subject	慢性腎臟病	zh_TW
DC.subject	深度學習	zh_TW
DC.subject	疾病預測模型	zh_TW
DC.title	基於檢驗數值的糖尿病腎病變預測模型	zh_TW
dc.language.iso	zh-TW	zh-TW
DC.title	Prediction Models for Diabetic Nephropathy based on laboratory tests	en_US
DC.type	博碩士論文	zh_TW
DC.type	thesis	en_US
DC.publisher	National Central University	en_US

博碩士論文 108522095 完整後設資料紀錄