參考文獻 |
Afshari Safavi, A., Kazemzadeh Gharechobogh, H., & Rezaei, M. (2015). Comparison of EM algorithm and standard imputation methods for missing data: a questionnaire study on diabetic patients. Iranian journal of epidemiology, 11(3), 43-51.
Arriagada, P., Karelovic, B., & Link, O. (2021). Automatic gap-filling of daily streamflow time series in data-scarce regions using a machine learning algorithm. Journal of Hydrology, 598, 126454.
Baneshi, M. R., & Talei, A. R. (2012). Does the missing data imputation method affect the composition and performance of prognostic models?. Iranian Red Crescent Medical Journal, 14(1), 31.
Bania, R. K., & Halder, A. (2020). R-Ensembler: A greedy rough set based ensemble attribute selection algorithm with kNN imputation for classification of medical data. Computer methods and programs in biomedicine, 184, 105122.
Breiman, L. (2001). Random forests. Machine learning, 45(1), 5-32.
Burgette, L. F., & Reiter, J. P. (2010). Multiple imputation for missing data via sequential regression trees. American journal of epidemiology, 172(9), 1070-1076.
Cheng, C. H., Chang, J. R., & Huang, H. H. (2020). A novel weighted distance threshold method for handling medical missing values. Computers in Biology and Medicine, 122, 103824.
Chen, Q., Meng, Z., Liu, X., Jin, Q., & Su, R. (2018). Decision variants for the automatic determination of optimal feature subset in RF-RFE. Genes, 9(6), 301.
Debastiani, V. J., Bastazini, V. A., & Pillar, V. D. (2021). Using phylogenetic information to impute missing functional trait values in ecological databases. Ecological Informatics, 63, 101315.
Deng, Y., Chang, C., Ido, M. S., & Long, Q. (2016). Multiple imputation for general missing data patterns in the presence of high-dimensional data. Scientific reports, 6(1), 1-10.
De La Iglesia, B. (2013). Evolutionary computation for feature selection in classification problems. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 3(6), 381-407.
Doquire, G., & Verleysen, M. (2012). Feature selection with missing data using mutual information estimators. Neurocomputing, 90, 3-11.
Donders, A. R. T., Van Der Heijden, G. J., Stijnen, T., & Moons, K. G. (2006). A gentle introduction to imputation of missing values. Journal of clinical epidemiology, 59(10), 1087-1091.
Dzulkalnine, M. F., & Sallehuddin, R. (2019). Missing data imputation with fuzzy feature selection for diabetes dataset. SN Applied Sciences, 1(4), 1-12.
Fedushko, S., & Ustyianovych, T. (2019). Medical card data imputation and patient psychological and behavioral profile construction. Procedia Computer Science, 160, 354-361.
Fichman, M., & Cummings, J. N. (2003). Multiple imputation for missing data: Making the most of what you know. Organizational Research Methods, 6(3), 282-308.
García-Laencina, P. J., Sancho-Gómez, J. L., & Figueiras-Vidal, A. R. (2010). Pattern classification with missing data: a review. Neural Computing and Applications, 19(2), 263-282.
García-Laencina, P. J., Abreu, P. H., Abreu, M. H., & Afonoso, N. (2015). Missing data imputation on the 5-year survival prediction of breast cancer patients with unknown discrete values. Computers in biology and medicine, 59, 125-133.
Granitto, P. M., Furlanello, C., Biasioli, F., & Gasperi, F. (2006). Recursive feature elimination with random forest for PTR-MS analysis of agroindustrial products. Chemometrics and intelligent laboratory systems, 83(2), 83-90.
Guyon, I., Weston, J., Barnhill, S., & Vapnik, V. (2002). Gene selection for cancer classification using support vector machines. Machine learning, 46(1), 389-422.
Guyon, I., & Elisseeff, A. (2003). An introduction to variable and feature selection. Journal of machine learning research, 3(Mar), 1157-1182.
Hariz, N. B., Khoufi, H., & Zagrouba, E. (2017, June). On Combining Imputation Methods for Handling Missing Data. In International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems (pp. 171-181). Springer, Cham.
Hall, M. A. (1999). Correlation-based feature selection for machine learning.
Huang, S. F., & Cheng, C. H. (2020). A Safe-Region Imputation Method for Handling Medical Data with Missing Values. Symmetry, 12(11), 1792.
Hong, S., & Lynn, H. S. (2020). Accuracy of random-forest-based imputation of missing data in the presence of non-normality, non-linearity, and interaction. BMC medical research methodology, 20(1), 1-12.
Huang, H. H., Liu, X. Y., & Liang, Y. (2016). Feature selection and cancer classification via sparse logistic regression with the hybrid L1/2+ 2 regularization. PloS one, 11(5), e0149675.
Huang, C., Mezencev, R., McDonald, J. F., & Vannberg, F. (2017). Open source machine-learning algorithms for the prediction of optimal cancer drug therapies. PLoS One, 12(10), e0186906.
Jerez, J. M., Molina, I., García-Laencina, P. J., Alba, E., Ribelles, N., Martín, M., & Franco, L. (2010). Missing data imputation using statistical and machine learning methods in a real breast cancer problem. Artificial intelligence in medicine, 50(2), 105-115.
Knol, M. J., Janssen, K. J., Donders, A. R. T., Egberts, A. C., Heerdink, E. R., Grobbee, D. E., ... & Geerlings, M. I. (2010). Unpredictable bias when using the missing indicator method or complete case analysis for missing confounder values: an empirical example. Journal of clinical epidemiology, 63(7), 728-736.
Kohavi, R., & John, G. H. (1997). Wrappers for feature subset selection. Artificial intelligence, 97(1-2), 273-324.
Kwak, S. K., & Kim, J. H. (2017). Statistical data preparation: management of missing values and outliers. Korean journal of anesthesiology, 70(4), 407.
Li, X., Peng, S., Chen, J., Lü, B., Zhang, H., & Lai, M. (2012). SVM–T-RFE: A novel gene selection algorithm for identifying metastasis-related genes in colorectal cancer using gene expression profiles. Biochemical and biophysical research communications, 419(2), 148-153.
Li, X., Liu, T., Tao, P., Wang, C., & Chen, L. (2015). A highly accurate protein structural class prediction approach using auto cross covariance transformation and recursive feature elimination. Computational biology and chemistry, 59, 95-100.
Lin, X., Li, C., Zhang, Y., Su, B., Fan, M., & Wei, H. (2018). Selecting feature subsets based on SVM-RFE and the overlapping ratio with applications in bioinformatics. Molecules, 23(1), 52.
Liu, X. Y., Liang, Y., Wang, S., Yang, Z. Y., & Ye, H. S. (2018). A hybrid genetic algorithm with wrapper-embedded approaches for feature selection. IEEE Access, 6, 22863-22874.
Liu, H., & Yu, L. (2005). Toward integrating feature selection algorithms for classification and clustering. IEEE Transactions on knowledge and data engineering, 17(4), 491-502.
Mundra, P. A., & Rajapakse, J. C. (2009). SVM-RFE with MRMR filter for gene selection. IEEE transactions on nanobioscience, 9(1), 31-37.
Naghani, S. Y., Dara, R., Poljak, Z., & Sharif, S. (2019). A review of knowledge discovery process in control and mitigation of avian influenza. Animal health research reviews, 20(1), 61-71.
Pedersen, A. B., Mikkelsen, E. M., Cronin-Fenton, D., Kristensen, N. R., Pham, T. M., Pedersen, L., & Petersen, I. (2017). Missing data and multiple imputation in clinical epidemiological research. Clinical epidemiology, 9, 157.
Peng, H., Long, F., & Ding, C. (2005). Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Transactions on pattern analysis and machine intelligence, 27(8), 1226-1238.
Purwar, A., & Singh, S. K. (2015). Hybrid prediction model with missing value imputation for medical data. Expert Systems with Applications, 42(13), 5621-5631.
Rubin, D. B. (1976). Inference and missing data. Biometrika, 63(3), 581-592.
Sharafoddini, A., Dubin, J. A., Maslove, D. M., & Lee, J. (2019). A new insight into missing data in intensive care unit patient profiles: observational study. JMIR medical informatics, 7(1), e11605.
Sterne, J. A., White, I. R., Carlin, J. B., Spratt, M., Royston, P., Kenward, M. G., ... & Carpenter, J. R. (2009). Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. Bmj, 338.
Stekhoven, D. J., & Bühlmann, P. (2012). MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics, 28(1), 112-118.
Shah, A. D., Bartlett, J. W., Carpenter, J., Nicholas, O., & Hemingway, H. (2014). Comparison of random forest and parametric imputation models for imputing missing data using MICE: a CALIBER study. American journal of epidemiology, 179(6), 764-774.
Su, R., Liu, X., & Wei, L. (2020). MinE-RFE: determine the optimal subset from RFE by minimizing the subset-accuracy–defined energy. Briefings in bioinformatics, 21(2), 687-698.
Su, R., Xiong, S., Zink, D., & Loo, L. H. (2016). High-throughput imaging-based nephrotoxicity prediction for xenobiotics with diverse chemical structures. Archives of toxicology, 90(11), 2793-2808.
Svetnik, V., Liaw, A., Tong, C., & Wang, T. (2004, June). Application of Breiman’s random forest to modeling structure-activity relationships of pharmaceutical molecules. In International Workshop on Multiple Classifier Systems (pp. 334-343). Springer, Berlin, Heidelberg.
Tang, Y., Zhang, Y. Q., & Huang, Z. (2007). Development of two-stage SVM-RFE gene selection strategy for microarray expression data analysis. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 4(3), 365-381.
Torres-Valencia, C., Álvarez-López, M., & Orozco-Gutiérrez, Á. (2017). SVM-based feature selection methods for emotion recognition from multimodal data. Journal on Multimodal User Interfaces, 11(1), 9-23.
Van Wolputte, E., & Blockeel, H. (2020, October). Missing Value Imputation with MERCS: A Faster Alternative to MissForest. In International Conference on Discovery Science (pp. 502-516). Springer, Cham.
Van Buuren, S. (2018). Flexible imputation of missing data. CRC press.
Voyle, N., Keohane, A., Newhouse, S., Lunnon, K., Johnston, C., Soininen, H., ... & Dobson, R. J. (2016). A pathway based classification method for analyzing gene expression for Alzheimer’s disease diagnosis. Journal of Alzheimer′s Disease, 49(3), 659-669.
Waljee, A. K., Mukherjee, A., Singal, A. G., Zhang, Y., Warren, J., Balis, U., ... & Higgins, P. D. (2013). Comparison of imputation methods for missing laboratory data in medicine. BMJ open, 3(8).
Xu, Z., Zhang, H., Wang, Y., Chang, X., & Liang, Y. (2010). L 1/2 regularization. Science China Information Sciences, 53(6), 1159-1169.
Yang, Z., Zhuan, B., Yan, Y., Jiang, S., & Wang, T. (2016). Identification of gene markers in the development of smoking-induced lung cancer. Gene, 576(1), 451-457.
Zhang, Z. (2015). Missing values in big data research: some basic skills. Annals of translational medicine, 3(21).
Zhang, S., Gong, L., Zeng, Q., Li, W., Xiao, F., & Lei, J. (2021). Imputation of GPS Coordinate Time Series Using MissForest. Remote Sensing, 13(12), 2312.
Zhu, R., & Kosorok, M. R. (2012). Recursively imputed survival trees. Journal of the American Statistical Association, 107(497), 331-340.
Zhang, X., Yan, C., Gao, C., Malin, B. A., & Chen, Y. (2020). Predicting Missing Values in Medical Data Via XGBoost Regression. Journal of Healthcare Informatics Research, 4(4), 383-394. |