摘要(英) |
The protein thermostability information is closely related to production of many biomaterials. Recent developments in research on the proteins thermostability find out the significant features for thermal stability of protein according to comparisons between homologous proteins. The amino acid composition, special pattern in sequence information and hydrogen bond, disulfide bond, salt bridges and so on in protein structure are considered important for thermostability. In this study, we present a system to integrate various factors to predict protein thermostability. In our research, a large number of proteins are from PGTdb and PDB. To start with, fetch out various features form sequences and structures. Then, feature selection algorithm is used to filter the features that have higher linear correlation coefficient to thermostability. Lastly, we apply these features to machine learning approach to built a predict system. In this research we discover two features, i.e., (E+F+M+R)/residue and charged/noncharged have linear correlation to thermostability. We finally establish two predict systems, one can predict protein thermostability by inputting protein sequences only, and the other can get better performance if the protein structure is known. |
參考文獻 |
Baumgartner, C., C. Bohm, et al. (2004). "Supervised machine learning techniques for the classification of metabolic disorders in newborns." Bioinformatics 20(17): 2985-96.
Berman, H. M., J. Westbrook, et al. (2000). "The Protein Data Bank." Nucleic Acids Res 28(1): 235-42.
Brown, T. A. (2002). Genomes -2nd ed.
Chan, C. H., H. K. Liang, et al. (2004). "Relationship between local structural entropy and protein thermostabilty." Proteins 57(4): 684-91.
Dalton, J. A., I. Michalopoulos, et al. (2003). "Calculation of helix packing angles in protein structures." Bioinformatics 19(10): 1298-9.
Dominy, B. N., H. Minoux, et al. (2004). "An electrostatic basis for the stability of thermophilic proteins." Proteins 57(1): 128-41.
Farias, S. T., M. G. van der Linden, et al. (2004). "Thermo-search: lifestyle and thermostability analysis." In Silico Biol 4(3): 377-80.
Gianese, G., F. Bossa, et al. (2002). "Comparative structural analysis of psychrophilic and meso- and thermophilic enzymes." Proteins 47(2): 236-49.
Gromiha, M. M., M. Oobatake, et al. (1999). "Important amino acid properties for enhanced thermostability from mesophilic to thermophilic proteins." Biophys Chem 82(1): 51-67.
Haney, P. J., M. Stees, et al. (1999). "Analysis of thermal stabilizing interactions in mesophilic and thermophilic adenylate kinases from the genus Methanococcus." J Biol Chem 274(40): 28453-8.
Huang, S. L., L. C. Wu, et al. (2004). "PGTdb: a database providing growth temperatures of prokaryotes." Bioinformatics 20(2): 276-8.
Kannan, N. and S. Vishveshwara (2000). "Aromatic clusters: a determinant of thermal stability of thermophilic proteins." Protein Eng 13(11): 753-61.
Liang, H. K., C. M. Huang, et al. (2005). "Amino acid coupling patterns in thermophilic proteins." Proteins 59(1): 58-63.
M.Kamber, J. H. Dara-Mining Concepts and Techniques.
Matthews, X. Z. a. B. W. (1995). "EdPDB: A Multi-Functional Tool for Protein Structure Analysis." J. Appl. Cryst. 28: 624-630.
McDonald, I. K. and J. M. Thornton (1994). "Satisfying hydrogen bonding potential in proteins." J Mol Biol 238(5): 777-93.
Parthasarathy, S. and M. R. Murthy (2000). "Protein thermal stability: insights from atomic displacement parameters (B values)." Protein Eng 13(1): 9-13.
Petukhov, M., Y. Kil, et al. (1997). "Insights into thermal resistance of proteins from the intrinsic stability of their alpha-helices." Proteins 29(3): 309-20.
Ragone, R. (2001). "Hydrogen-bonding classes in proteins and their contribution to the unfolding reaction." Protein Sci 10(10): 2075-82.
Shir-Ly Huang, L.-C. W., Hsien-Da Huang, Han-Kuen Liang, Ming-Tat Ko, and Jorng-Tzong Horng (2004). "A Probabilistic Method to Correlate Ion-pairs to Protein Thermostability." Applied Bioformatics 3(1): 21-29.
Szilagyi, A. and P. Zavodszky (2000). "Structural differences between mesophilic, moderately thermophilic and extremely thermophilic protein subunits: results of a comprehensive survey." Structure Fold Des 8(5): 493-504.
Vieille, C. and G. J. Zeikus (2001). "Hyperthermophilic enzymes: sources, uses, and molecular mechanisms for thermostability." Microbiol Mol Biol Rev 65(1): 1-43.
Vogt, G., S. Woell, et al. (1997). "Protein thermal stability, hydrogen bonds, and ion pairs." J Mol Biol 269(4): 631-43. |