以接受者操作特徵曲面下的體積探討不同生物指標的組合對疾病的預測能力

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：12

、訪客IP：3.16.75.169

姓名

洪培紋(Pei-wen Hung) 查詢紙本館藏

畢業系所

統計研究所

論文名稱

以接受者操作特徵曲面下的體積探討不同生物指標的組合對疾病的預測能力
(Use VUS to evaluate the disease prediction under different biomarker combinations)

相關論文

★ 長期與存活資料之聯合模型-新方法和數值方法的改進	★ 復發事件存活分析的共享廣義伽瑪脆弱因子之半母數聯合模型
★ 加乘法風險模型結合長期追蹤資料之聯合模型	★ 有序雙重事件時間分析使用與時間相關的共變數－邊際方法的比較
★ 存活與長期追蹤資料之聯合模型－台灣愛滋病實例研究	★ 以聯合模型探討地中海果蠅繁殖力與老化之關係
★ 聯合模型在雞尾酒療法療效評估之應用—利用CD4/CD8比值探討台灣愛滋病資料	★ 時間相依共變數之雙重存活時間分析—台灣愛滋病病患存活時間與 CD4 / CD8 比值關係之案例研究
★ Cox比例風險模型之參數估計─比較部分概似法與聯合模型	★ 復發事件存活時間分析-丙型干擾素對慢性肉芽病患復發療效之案例研究
★ Cox 比例風險假設之探討與擴充風險模型之應用	★ 以聯合模型探討原發性膽汁性肝硬化
★ 聯合長期追蹤與存活資料分析－肝硬化病患之實例研究	★ 復發事件存活時間分析-rhDNase對囊狀纖維化病患復發療效之案例研究
★ 聯合長期追蹤與存活資料分析-原發性膽汁性肝硬化病患之實例研究	★ 復發事件存活時間分析-Thiotepa對膀胱癌病患復發療效之案例研究

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

[檢視]

[下載]

本電子論文使用權限為同意立即開放。
已達開放權限電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。
請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。

摘要(中)

在本篇論文中主要探討如何利用接受者作業特徵曲線 (Receiver Operating Characteristic curve ; ROC curve)來判斷不同的生物指標(biomarker)對於疾病預測能力的好壞，在本篇論文中，所考慮的共變數數值和疾病狀態是會跟隨時間而改變的，一般地情況下都是使用接受者作業特徵曲線下面積 (Area under ROC curve ; AUC)來判斷，但由於是時間相依共變數的資料，所以在不同時間點所得到的接受者作業特徵曲線下的面積可能有大有小，無法明確判別出哪一個生物指標的預測能力較好，因此，就想到採用接受者作業特徵曲面下體積 (Volume under ROC surface ; VUS)來判斷，當體積越大代表預測能力越好。這邊使用二元分配最近鄰點估計法 (NNE)來估計ROC曲線。在模擬研究中，生成兩個生物指標，想要知道哪一個生物指標的預測能力較佳。由AUC得知，生物指標一和生物指標二的線性組合的預測能力較佳，又由VUS得知，生物指標一和生物指標二的線性組合的預測能力較佳。在本篇論文中，舉了兩個實例，第一個為探討CD4細胞數和病毒載量針對愛滋病的預測能力的好壞，由AUC可知，CD4細胞數對於愛滋病的預測能力是優於病毒載量；第二個為果蠅的資料，探討果蠅產卵量與老化狀況的關係，在這裡討論了三種不同的生物指標：總產卵量、最大產卵量的時間和每日產卵量，由AUC得知，總產卵量和每日產卵量對果蠅老化的影響較大，又由VUS得知，每日產卵量對果蠅老化的影響最大。

摘要(英)

In this paper, we are mainly interested in using the receiver operating characteristic (ROC) curve to determine which biomarker has better disease prediction. We consider the data that patient’s covariates and their disease status are both time dependent and, in general, this kind of data is justified by the Area under ROC curve (AUC). However, due to the time-dependent covariates, AUC values may vary (under different time points), which make us difficult to make inference (or decide which biomarker has better disease prediction). Thus, we adapt the volume under the ROC surface (VUS) approach instead－the larger the volume, the better the disease prediction. Here, we use the nearest neighbor estimation for a bivariate distribution to estimate the ROC curve. In simulation, we generate two biomarkers, and we are interested in which biomarker has better prediction. From the AUC values, we can know that the biomarker one is better than biomarker two, we compare biomarker one to the combination of biomarker one and biomarker two and by the AUC values, we can know that the linear combination of biomarkers has better prediction. We also use the VUS, we know the linear combination of biomarkers has better prediction. In the practical data analysis, two examples (cases) are given. First, we are interested in the biomarkers CD4 counts and viral load, which one has better prediction for the AIDS. From the AUC values, we can know that the CD4 counts is better than viral load. Second, we are interested in the biomarkers total number of eggs laid during lifetime, the time of maximum eggs laid and number of eggs laid daily, which one has more influence to medfly lifetime. From the AUC values, we can know that the total number of eggs laid during lifetime and number of eggs laid daily are better, but by volume under the ROC surface, number of eggs laid daily has more influence to medfly lifetime.

關鍵字(中)

★ 接受者作業特徵曲線
★ 接受者作業特徵曲線下面積
★ 接受者作業特徵曲面下體積
★ 二元分配最近鄰點估計法
★ 生物指標

關鍵字(英)

★ Receiver Operating Characteristic curve
★ Area under ROC curve
★ Volume under ROC surface
★ Nearest neighbor estimation of a bivariate distribution
★ biomarker

論文目次

目錄
摘要 i
Abstract ii
致謝辭 iv
目錄 v
圖目次 vii
表目次 viii
第一章緒論 1
1.1 診斷準確性的測量. . . . . . . . . . . . . . . . . . 1
1.1.1 敏感度和專一性. . . . . . . . . . . . . . . . 2
1.1.2 準確性. . . . . . . . . . . . . . . . . . . . 4
1.1.3 勝算比. . . . . . . . . . . . . . . . . . . . 4
1.1.4 尤登指標. . . . . . . . . . . . . . . . . . . 4
1.1.5 接受者作業特徵曲線和接受者作業特徵曲線下面積. 5
1.1.6 概似比. . . . . . . . . . . . . . . . . . . . 6
1.2 單一樣本的估計與假設檢定 7
1.3 接受者作業特徵曲線的估計 8
1.3.1 Kaplan-Meier估計法. . . . . . . . . . . . . . 8
1.3.2 二元分配的最近鄰點估計法. . . . . . . . . . . 9
1.3.3 標準差的估計. . . . . . . . . . . . . . . . . 10
第二章統計方法 12
2.1 長期追蹤資料. . . . . . . . . . . . . . . . . . . . 13
2.2 存活模型. . . . . . . . . . . . . . . . . . . . . . 14
2.3 接受者作業特徵曲線. . . . . . . . . . . . . . . . . 17
第三章統計模擬 20
3.1 模擬方法. . . . . . . . . . . . . . . . . . . . . . 20
3.2 模擬資料設計. . . . . . . . . . . . . . . . . . . . 22
3.3 模擬結果. . . . . . . . . . . . . . . . . . . . . . 22
第四章實例分析 29
4.1 愛滋病資料背景. . . . . . . . . . . . . . . . . . . 29
4.2 接受者作業特徵曲線. . . . . . . . . . . . . . . . . 30
4.3 果蠅資料背景. . . . . . . . . . . . . . . . . . . . 37
4.4 接受者作業特徵曲線. . . . . . . . . . . . . . . . . 38
第五章結論與探討 46
參考文獻 48

參考文獻

Akritas, M. G. (1994). “Nearest neighbor estimation of a bivariate distribution under random censoring.” Annals of Statistics, 22, 1299-1327.
Carey, J. R., Liedo, P., M ller, H. G., Wang, J. L. & Chiou, J. M. (1998). ”Relationship of age patterns of fecundity to mortality, longevity,and lifetime reproduction in a large cohort of Mediterranean fruit fly females.” J. of Gerontology : Biological Sciences 53, 245-251.
Cleveland, W. S. (1979). “Robust Locally Weighted Regression and Smoothing Scatterplots.”Journal of the American Statistical Associtatio, 74, 829-836.
Cox, D. R. and Oakes, D. (1984). Analysis of Survival Data, Chapman and Hall,London, New York.
Dempster, A. P., Laird,N. M. and Rubin, D. B. (1977). “Maximum Likelihood from Imcomplete Data via the EM Algorithm.” Journal of the Royal StatisticalSociety Series B (Methodological), 39, 1-38.
Satten, G. A., Datta, S. and Robins, J. (2001). “Estimating the marginal survival function in the presence of time dependent covariates. ” Statistics and Probability Letters, 54, 397-403.
Hanley, J. A. (1989). “Receiver operating characteristic (ROC) methodology:the state of the art.” Critical Reviews in Diagnostic Imaging, 29, 307-335.
Heagrty, P. J., Lumley, T. and Pepe, M. S (2000). “Time-dependent ROC curves for censored survival data and a diagnostic marker.” Biometrics, 56, 337-344.
Henderson, R., Diggle, P. and Dobson, A. (2000). “Joint modeling of longitudinal measurements and event time data.” Biostatistics, 4, 465-480.
Jones, M. C. (1990). “The performance of kernel density functions in kernel distribution function estimation.” Statistics and Probability Letters, 9, 129-132.
Jones, M. C. and Sheather, S.J. (1991). “Using non-stochastic terms to advantage in kernel-based estimation of integrated squared density derivatives.” Statistics and Prob-ability Letters, 11, 511-514.
Hsieh, F., Tseng, Y. K. and Wang, J. L. (2006). “Joint Modeling of Survival and Longitudinal Data: Likelihood Approach Revisited. ” Biometrics, 62.1037-1043.
Tseng, Y. K., Hsieh F. and Wang, J. L. (2005). “ Joint modeling of accelerated failure time and longitudinal data.” Biometrika, 92, 587-603.
Tsiatis, A. A., Degruttola, V. and Wulfsohn, M. S. (1995). “Modeling the Relationship of Survival to Longitudinal Data Measured with Error. Applications to Survival and CD4 Coutns in Patients with AIDS.” Journal of the American Statistical Association, 90, 27-37.
Wulfsohn, M. S. and Tsiatis, A. A. (1997). “A Joint Model for Survival and Longitudinal Data Measured with Error.” Biometrics, 53, 330-339.
Zeng, D. and Lin, D. Y. (2007a). “Maximum Likelihood Estimation in Semiparametric Regression Models with Censored Data (with Discussion).” Journal of the Royal Statistical Society, Series B, 69, 507-564.
Zweig, M. H. and Campbell, G. (1993). “Receiver-operator characteristic plots: a fundamental evaluation tool in clinical medicine.” Clinical Chemistry, 39, 561-577.

指導教授

曾議寬

審核日期

2013-6-28

推文