博碩士論文 962405004 詳細資訊




以作者查詢圖書館館藏 以作者查詢臺灣博碩士 以作者查詢全國書目 勘誤回報 、線上人數:17 、訪客IP:18.191.186.72
姓名 王婉倫(Wan-Lun Wang)  查詢紙本館藏   畢業系所 統計研究所
論文名稱 多元反應變數長期資料之多變量線性混合模型
(Analysis of Multi-outcome Longitudinal Data with Multivariate Linear Mixed Models)
相關論文
★ 具Box-Cox轉換之逐步加速壽命實驗的指數推論模型★ 多重型 I 設限下串聯系統之可靠度分析與最佳化設計
★ 應用累積暴露模式至單調過程之加速衰變模型★ 串聯系統加速壽命試驗之最佳樣本數配置
★ 破壞性加速衰變試驗之適合度檢定★ 串聯系統加速壽命試驗之最佳妥協設計
★ 加速破壞性衰變模型之貝氏適合度檢定★ 加速破壞性衰變模型之最佳實驗配置
★ 累積暴露模式之單調加速衰變試驗★ 具ED過程之兩因子加速衰退試驗建模研究
★ 逆高斯過程之完整貝氏衰變分析★ 加速不變原則之偏斜-t過程
★ 花蓮地區地震資料改變點之貝氏模型選擇★ 颱風降雨量之統計迴歸預測
★ 花蓮地區地震資料之長時期相關性及時間-空間模型之可行性★ 台灣地區地震資料之經驗貝氏分析
檔案 [Endnote RIS 格式]    [Bibtex 格式]    [相關文章]   [文章引用]   [完整記錄]   [館藏目錄]   至系統瀏覽論文 ( 永不開放)
摘要(中) 在生物醫學或臨床試驗中,每個實驗個體隨著時間被重複測量紀錄多元反應變數和其他解釋變數。涉及分析此類型資料之研究一般稱作「多變量長期追蹤研究」。多變量線性混合效應模型已成為聯合分析多元長期資料的一個普遍使用工具。其主要目的除了描述重複測量值和其他解釋變數之間的關係外,並調查反應變數之間隨時間演變的關係。本文主要目的是基於多變量常態分佈與多變量t分佈去架構具p階自我迴歸(AR(p))之多變量線性混合模型,並以最大概似及貝氏觀點去發展相關的統計理論和演算方法。
在本文的第一部分,我們致力於提供額外的工具來分析多變量線性混合模型,其誤差項假設為根據自我迴歸之序列相關。基於ECM演算法之最大概似估計和透過馬可夫鏈蒙地卡羅(MCMC)程序之貝氏方法將被提出來計算模型之參數估計值。分數檢定統計量被提出來檢定在個體內誤差項之自相關性是否存在。同時我們也討論如何估計隨機效應以及未來值預測等問題。
由於考量常態假設下之模型對於潛在離群值的敏感以及資料的厚尾現象,本文的第二部分提出一個具穩健性一般化的多變量線性混合模型,此模型假設隨機效應和具AR(p)的個體內誤差項之聯合分佈為多變量t分佈。在此模型下,分數檢定統計量也被提出來檢測自相關的存在性。有效的混合ECME-scoring演算法被提出來求得參數之最大概似估計及其標準差。在貝氏結構中,我們利用Gibbs抽樣方法和Metropolis-Hastings演算法來獲得後驗分析推論。
最後,我們將所提出之理論方法應用於後天免疫缺乏症候群(愛滋病;AIDS)臨床試驗研究資料上。除此之外,模擬結果顯示具AR(p)之多變量線性混合模型確實能提供更精準的預測;多變量t線性混合模型確實比傳統的多變量線性混合模型對於離群值在參數估計上更具穩健性。
摘要(英) In biomedical studies or clinical trials, repeated measures with multiple characteristics and a set of covariates are taken from each subject over a certain period of time. Research involving the analysis of such data is commonly referred to as multivariate longitudinal studies. The multivariate linear mixed model (MLMM) has become a frequently used tool for a joint analysis of more than one series of longitudinal data. The primary objective of the topic is not only to describe the relationship between repeated measures and other possible covariates, but also to investigate the association of the evolutions of characteristics. My dissertation mainly consists of two themes with emphases on maximum likelihood (ML) and Bayesian inferences for MLMMs and its robust t-based extension, called the multivariate t linear mixed model (MtLMM).
In the first theme, we are devoted to providing additional tools for MLMMs in which the errors are assumed to be serially correlated according to an autoregressive process. We present a computationally flexible ECM pocedure for obtaining the ML estimates of model parameters. Moreover, the approximate Bayesian method as well as a fully Bayesian via the Markov chain Monte Carlo (MCMC) sampler in this context are discussed. A score test statistic is derived for testing the existence of autocorrelation among within-subject errors. The techniques for estimation of the random effects and prediction of further responses given past repeated measures are also investigated.
In the second theme, motivated by a concern of sensitivity to potential outliers and data with longer-than-normal tails or possible serial correlation, we aim to develop a robust generalization of the MLMM, which is constructed by using the multivariate t distribution and a parsimonious AR(p) dependence structure for the within-subject errors. A score test for inspection of the existence of autocorrelation is also derived. An efficient hybrid ECME-scoring procedure is provided for computing the ML estimates with standard errors as a by-product. Similarly, a fully Bayesian estimation can then be conducted by utilizing the MCMC algorithm. An approximate Bayesian method is also described for drawing the large-sample Bayesian inference.
The proposed methodologies are demonstrated through an application of the ACTG 175 study. Several simulation studies are undertaken to clarify the applicability of the MLMM and the robust-to-outliers ability of the MtLMM.
關鍵字(中) ★ 經驗貝氏
★ 自我迴歸模型
★ 離群值
★ 預測
★ 分數檢定
★ 隨機效應
關鍵字(英) ★ AR(p)
★ Empirical Bayes estimates
★ Outliers
★ Prediction
★ Random effects
★ Score test
論文目次 1 Introduction 1
1.1 Background and Literature Review......................1
1.2 Motivation and Objective..............................3
1.3 Overview..............................................5
2 Multivariate Linear Mixed Models for Multiple Longitudinal Data 7
2.1 Notation and Setting..................................7
2.1.1 Model formulation...................................8
2.1.2 The score test for autocorrelation..................9
2.2 Maximum Likelihood Estimation........................12
2.2.1 Parameter estimation via the ECM algorithm.........12
2.2.2 Initialization.....................................16
2.3 Bayesian Approach....................................17
2.3.1 Asymptotic methods.................................19
2.3.2 The MCMC method....................................23
2.4 Inference for Random Effects and Prediction..........26
3 Multivariate t Linear Mixed Models with Autoregressive Errors 30
3.1 Model Specification..................................30
3.1.1 The likelihood.....................................30
3.1.2 Large-sample properties............................32
3.2 Maximum Likelihood Approach..........................34
3.2.1 Parameter estimation via the hybrid ECME-scoring procedure................................................34
3.2.2 Notes on implementation............................39
3.3 Bayesian Inference...................................40
3.3.1 Approximate Bayesian method........................41
3.3.2 Implementation of MCMC sampling....................43
3.4 Estimation of Random Effects and Prediction of Future Values...................................................46
4 An Application to the ACTG 175 Study 49
4.1 Description of the Data..............................49
4.2 Preliminary Analysis.................................50
4.3 Analytical Results Based on ML Estimation............52
4.3.1 Model fitting......................................53
4.3.2 Diagnostics........................................60
4.4 Posterior Inference Based on MCMC Outputs............61
4.4.1 Bayesian model selection...........................61
4.4.2 Outlier detection..................................68
5 Simulation Studies 69
5.1 Study I: Utilities of the Model......................69
5.2 Study II: Robustness of the MtLMM....................72
6 Discussion and Conclusion 75
References 78
Appendices 83
A. The Score Vector and the Fisher Information Matrix for Model (2.1)..............................................83
B. Proof of Theorem 2.1..................................83
C. Posterior Distribution of θ in the MLMM via Normal Approximation............................................84
D. Proof of Eqs. (2.24) and (2.25).......................86
E. Proof of (2.27) and (2.28)............................87
F. The Score Vector and the Fisher Information Matrix for Model (3.1)..............................................89
G. Proof of Theorem 3.3..................................91
H. Normal Approximation of the Posterior Distribution of θ in the MtLMM..........................................92
參考文獻 Barndorff-Nielsen, O. and Schou, G. (1973). On the parameterization of autoregressive models by partial autocorrelations. J. Multivariate Anal. 3, 408–419.
Berger, J. O. (1985). Statistical Decision Theory and Bayesian Analysis. New York: Springer.
B¨ohning, D., Dietz, E., Schaub, R., Schlattmann, P. and Lindsay, B. G. (1994). The distribution of the likelihood ratio for mixtures of densities from the oneparameter exponential family. Ann. Inst. Statist. Math. 46, 373–388.
Box, G. E. P., Jenkins, G. M. and Reinsel, G. C. (1994). Time Series Analysis Forecasting and Control. 3rd Edition, Holden-Day.
Carlin, B. P. and Louis, T. A. (2000). Bayes and Empirical Bayes Methods for Data Analysis. 2nd Edition, Chapman & Hall/CRC.
Chi, E. M. and Reinsel, G. C. (1989). Models for longitudinal data with random effects and AR(1) errors. J. Amer. Statist. Assoc. 84, 452–459.
Chi, E. M. and Reinsel, G. C. (1991). Asymptotic properties of the score test for autocorrelation in a random effects with AR(1) errors model. Statist. Probab. Lett. 11, 453–457.
Chib, S. and Carlin, E. (1999). On MCMC sampling in hierarchical models. Statist. Comput. 9, 17–26.
Chib, S. and Greenberg, E. (1995). Understanding the Metropolis-Hastings algorithm. Amer. Statist. 49, 327–335.
Dempster, A. P., Laird, N. M. and Rubin, D. B. (1977). Maximum likelihood estimation from incomplete data via the EM algorithm (with discussion). J. Roy. Statist. Soc. Ser. B 39, 1–38.
Diggle, P. J., Heagerty, P., Liang, K. Y. and Zeger, S. L. (2002). Analysis of Longitudinal Data. 2nd Edition, New York: Clarendon Press.
Fieuws, S. and Verbeke, G. (2004). Joint modelling of multivariate longitudinal profiles: pitfalls of the random-effects approach. Statist. Med. 23, 3093–3104.
Fitzmaurice, G. M., Laird, N. M. andWare, J. (2004). Applied Longitudinal Analysis.New York: Wiley.
Gelfand, A. E., Hills, S. E., Racine-Poon, A. and Smith, A. F. M. (1990). Illustration of Bayesian inference in normal data models using Gibbs sampling. J. Amer. Statist. Assoc. 85, 972–985.
Gelfand, A. E. and Smith, A. F. M. (1990). Sampling based approaches to calculate marginal densities. J. Amer. Statist. Assoc. 85, 398–409.
Gelman, A., Carlin, J. B., Stern, H. S. and Rubin, D. B. (2004). Bayesian Data Analysis. 2nd Edition, London, Boca Raton, FL: Chapman & Hall, CRC press.
Geman, S. and Geman, D. (1984). Stochastic relaxation, Gibbs distributions and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell.6, 721–741.
Golub, G. H., Van Loan, C. F. (1989). Matrix Computations. 2nd Edition, Baltimore, MD.: Johns Hopkins University Press.
Hammer, S. M., Katzenstein, D. A., Hughes, M. D., Gundacker, H., Schooley, R. T., Haubrich, R. H., Henry, W. K., Lederman, M. M., Phair, J. P., Niu, M., Hirsch, M. S. and Merigan, T. C. (1996). A trial comparing nucleoside monotherapy with combination therapy in HIV-infected adults with CD4 cell counts from 200 to 500 per cubic millimeter. The New England Journal of Medicine 335,
1081–1090.
Hastings, W.K. (1970). Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57, 97–109.
Healy, M. J. R. (1968). Multivariate normal plotting. Appl. Statist. 17, 157–161.
Hedeker, D. and Gibbons, R. D. (2006). Longitudinal Data Analysis. Hoboken, New Jersey: John Wiley & Sons.
Hogan, J. W. and Laird, N. M. (1997). Model-based approaches to analysing incomplete longitudinal and failure time data. Statist. Med. 16, 259–272.
Jennrich, R. I. and Schluchter, M. D. (1986). Unbalanced repeated-measures models with structured covariance matrices. Biometrics 38, 967–974.
Kotz, S. and Nadarajah S. (2004). Multivariate t Distributions and Their Applications. Cambridge University Press.
Laird, N. M., Lange, N. and Stram, D. (1987). Maximum likelihood computations with repeated measures: application of the EM algorithm. J. Amer. Statist. Assoc. 82, 97–105.
Laird, N. M. and Ware, J. H. (1982). Random effects models for longitudinal data. Biometrics 38, 963–974.
Lange, K. L., Little R. J. A. and Taylor J. M. G. (1989). Robust statistical modeling using the t distribution. J. Amer. Statist. Assoc. 84, 881–896.
Lee, J. C., Lin, T. I., Lee, K. J. and Hsu, Y. L. (2005). Bayesian analysis of Box-Cox transformed linear mixed models with ARMA(p, q) dependence. J. Statist. Plann. Inference 133, 435–451.
Lin, T. I. (2008). Longitudinal data analysis using t linear mixed models with autoregressive dependence structures. J. Data Sci. 6, 333–355.
Lin, T. I. and Ho, H. J. (2008). A simplified approach to inverting the autocovariance matrix of a general ARMA(p, q) process. Statist. Probab. Lett. 78, 36–41.
Lin, T. I. and Lee, J. C. (2006). A robust approach to t linear mixed models applied to multiple sclerosis data. Statist. Med. 25, 1397-1412.
Lin, T. I. and Lee, J. C. (2007). Bayesian analysis of hierarchical linear mixed modeling using the multivariate t distribution. J. Statist. Plann. Inference 137, 484–495.
Lin, T. I., Lee, J. C. (2008). Estimation and prediction in linear mixed models with skew-normal random effects for longitudinal data. Statist. Med. 27, 1490–1507.
Lindstrom, M. J. and Bates, D. M. (1988). Newton-Raphson and EM algorithms for linear mixed-effects models for repeated-measures data. J. Amer. Statist. Assoc. 83, 1014–1022.
Liu, C. and Rubin, D. B. (1994). The ECME algorithm: a simple extension of EM and ECM with faster monotone convergence. Biometrika 81, 633–648.
Ljung, G. M. and Box, G. E. P. (1980). Analysis of variance with autocorrelated observations. Scandinavian Journal of Statistics 7, 172–180.
McLachlan, G. J. and Krishnan, T. (2008). The EM Algorithm and Extensions. 2nd Edition, New York: John Wiley & Sons.
Meng, X. L. and Rubin, D. B. (1993). Maximum likelihood estimation via the ECM algorithm: a general framework. Biometrika 80, 267–278.
Nadarajah, S. and Kotz, S. (2005). Mathematical properties of the multivariate t distribution. Acta Applicandae Mathematicae 89, 53–84.
Nadarajah, S. and Kotz, S. (2008). Estimation Methods for the Multivariate t Distribution. Acta Applicandae Mathematicae 102, 99–118.
Pinheiro, J. C., Liu, C. H. and Wu, Y. N. (2001). Efficient algorithms for robust estimation in linear mixed-effects model using the multivariate t distribution. J. Comput. Graph. Statist. 10, 249–276.
R Development Core Team. (2008). R: A Language and environment for Statistical Computing. Vienna: R Foundation for Statistical Computing.
Rao, C. R. (1973). Linear Statistical Inference and Its Application. 2nd Edition, New York: John Wiley.
Reinsel, G. (1984). Estimation and prediction in a multivariate random-effects generalized linear model. J. Amer. Statist. Assoc. 79, 406–414.
Rochon, J. (1992). ARMA covariance structures with time heteroscedasticity for repeated measures experiments. J. Amer. Statist. Assoc. 87, 777–784.
Roy, A. (2006). Estimating correlation coefficient between two variables with repeated observations using mixed effects model. Biom. J. 48, 286–301.
Sammel, M., Lin, X. and Ryan, L. (1999). Multivariate linear mixed models for multiple outcomes. Statist. Med. 18, 2479–2492.
SAS. (2001). SAS/Stat User's Guide, Version 8.2. Cary, NC: SAS Institute Inc.
Schafer, J. L. and Yucel, R. M. (2002). Computational strategies for multivariate linear mixed-effects models with missing values. J. Comput. Graph. Statist. 11, 437–457.
Seber, G. A. F. (1984). Multivariate Observations. New York: John Wiley.
Shah, A., Laird, N. and Schoenfeld, D. (1997). A random-effects model for multiple characteristics with possibly missing data. J. Amer. Statist. Assoc. 92, 775–779.
Shao, J. (2003). Mathematical Statistics. 2nd Edition, New York: Springer-Verlag.
Silvey, S. D. (1959). The lagrangian multiplier test. Ann. Math. Statist. 30, 389–407.
Song, P. X. K., Zhang, P. and Qu, A. (2007). Maximum likelihood inference in robust linear mixed-effects models using multivariate t distributions. Statist. Sinica 17, 929–943.
Song, X., Davidian, M. and Tsiatis, A. A. (2002a). A semiparametric likelihood approach to joint modeling of longitudinal and time-to-event data. Biometrics 58, 742–753.
Song, X., Davidian, M. and Tsiatis, A. A. (2002b). An estimator for the proportional hazards model with multiple longitudinal covariates measured with error. Biostatistics 3, 511–528.
Spiegelhalter, D. J., Best, N. G., Carlin, B.P. and Van der Linde, A. (2002). Bayesian measures of model complexity and fit (with discussion). J. Roy. Statist. Soc. Ser B 64, 583–616.
Verbeke, G. and Molenberghs, G. (2000). Linear Mixed Models for Longitudinal Data. New York: Springer.
Wakefield, J. C., Smith, A. F. M., Racine-Poon, A. and Gelfand, A. E. (1994). Bayesian analysis of linear and non-linear model by using Gibbs sampler. Appl. Statist. 43, 201–221.
Zellner, A. (1976). Bayesian and non-Bayesian analysis of the regression model with multivariate student-t error terms. J. Amer. Statist. Assoc. 71, 400–405.
Zucker, D. M., Zerbe, G. O., Wu, M. C. (1995). Inference for the association between coefficients in a multivariate growth curve model. Biometrics 51, 413–424.
指導教授 樊采虹(Tsai-Hung Fan) 審核日期 2010-5-24
推文 facebook   plurk   twitter   funp   google   live   udn   HD   myshare   reddit   netvibes   friend   youpush   delicious   baidu   
網路書籤 Google bookmarks   del.icio.us   hemidemi   myshare   

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明