摘要(英) |
Motivation: There are a lot of gene sequence analyses, especially the time after human genome project. The proteomics becomes more and more attractive for biologists. It can bridge the gap between the genome sequence and the cellular behavior. We are concerned about the Mass spectrometry which is high throughput, fast, and accurate. Matrix assisted laser desorption ionization (MALDI) and surface-enhanced laser desorption ionization (SELDI) time of flight (TOF) are two popular technologies in the field of spectrometry. With the peaks detected in spectra, we can compare the normal group with disease. However, the spectrum is complicated and full of noise. Consequently, the preprocessing of the mass data plays an important role during our analysis.
Results: We provide a novel algorithm of preprocessing dealing with the MALDI and SELDI spectrum. The algorithm uses the Hilbert-Huang Transform mainly. We compare the performance of several famous algorithms including PROcess, SpecAlign, and MassSpecWavelet with ours called HHT. The main thought of performance is chiefly visual comparison. We pick the significant peaks and observe the results which the algorithm shows in figure. The results show that HHT for preprocessing is more fitness than others. Not only detecting the peaks, but HHT has the advantage of denoising the spectra, especially for the complex data.
|
參考文獻 |
1. Beyer, S., Y. Walter, et al. (2006). "Comparison of software tools to improve the detection of carcinogen induced changes in the rat liver proteome by analyzing SELDI-TOF-MS spectra." J Proteome Res 5(2): 254-61.
2. Coombes, K. R., J. S. Morris, et al. (2005). "Serum proteomics profiling--a young technology begins to mature." Nat Biotechnol 23(3): 291-2.
3. Cruz-Marcelo, A., R. Guerra, et al. (2008). "Comparison of algorithms for pre-processing of SELDI-TOF mass spectrometry data." Bioinformatics 24(19): 2129-36.
4. DiMagno, E. P., D. Corle, et al. (1989). "Effect of long-term freezer storage, thawing, and refreezing on selected constituents of serum." Mayo Clin Proc 64(10): 1226-34.
5. Du, P., W. A. Kibbe, et al. (2006). "Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching." Bioinformatics 22(17): 2059-65.
6. Fung, E. T. and C. Enderwick (2002). "ProteinChip clinical proteomics: computational challenges and solutions." Biotechniques Suppl: 34-8, 40-1.
7. Ge, G. and G. W. Wong (2008). "Classification of premalignant pancreatic cancer mass-spectrometry data using decision tree ensembles." BMC Bioinformatics 9: 275.
8. Hilario, M., A. Kalousis, et al. (2006). "Processing and classification of protein mass spectra." Mass Spectrom Rev 25(3): 409-49.
9. Kwon, D., M. Vannucci, et al. (2008). "A novel wavelet-based thresholding method for the pre-processing of mass spectrometry data that accounts for heterogeneous noise." Proteomics 8(15): 3019-29.
10. Li, X. e. a. (2005). Seldi-tof mass spectrometry protein data. Bioinformatics and Computational Biology Solutions Using R and Bioconductor. R. e. a. In Gentleman. New York, Springer: 99-109.
11. Malyarenko, D. I., W. E. Cooke, et al. (2005). "Enhancement of sensitivity and resolution of surface-enhanced laser desorption/ionization time-of-flight mass spectrometric records for serum peptides using time-series analysis techniques." Clin Chem 51(1): 65-74.
12. Meuleman, W., J. Y. Engwegen, et al. (2008). "Comparison of normalisation methods for surface-enhanced laser desorption and ionisation (SELDI) time-of-flight (TOF) mass spectrometry data." BMC Bioinformatics 9: 88.
13. Morris, J. S., K. R. Coombes, et al. (2005). "Feature extraction and quantification for mass spectrometry in biomedical applications using the mean spectrum." Bioinformatics 21(9): 1764-75.
14. Qu, Y., B. L. Adam, et al. (2003). "Data reduction using a discrete wavelet transform in discriminant analysis of very high dimensionality data." Biometrics 59(1): 143-51.
15. Randolph, T. W. and Y. Yasui (2006). "Multiscale processing of mass spectrometry data." Biometrics 62(2): 589-97.
16. Shin, H. and M. K. Markey (2006). "A machine learning perspective on the development of clinical decision support systems utilizing mass spectra of blood samples." J Biomed Inform 39(2): 227-48.
17. Wong, J. W., G. Cagney, et al. (2005). "SpecAlign--processing and alignment of mass spectra datasets." Bioinformatics 21(9): 2088-90.
|