dc.description.abstract | So far extensive studies are being performed to associate phenotypic differences、disease susceptibility and pathogenesis of cancer with genetic variation. And at a genome-wide scale, since copy number variation(CNV)regions cover more content of the genome, suggesting the importance of CNV in one form of genetic diversity. Hence, detection of CNV in whole-genome DNA become an important method for understanding genetics. However, owing to the difficulty in obtaining human tissue samples and more expensive costs, such that a global assessment of CNVs exists challenges. To overcome this limitation, we use a reliable technology, mouse whole genome CGH microarray, as our biological model source, and there are complete
information of mouse strains in databases. Additionally, compare occupied on overall less memory in previous data, we trend to execute appropriate algorithms towards even larger sample sizes and higher resolution microarrays in follow-up analysis. We compare two different algorithms-Pair-wised Gaussian Merging(PGM)and Hidden Markov Model(HMM)-to detect copy number variations of mouse genome. In addition to results of two algorithms are significantly different, we further analyze advantages and defections of two algorithms, and we try to choose some more appropriate parameters in PGM. Although, those are two different kinds of theory to support two algorithms, result in different strategies of detections. Specifically compared to HMM, results of PGM exist wider distribution of size of CNV-regions, however, counts of CNV-regions are lower. We suggest one of reasons is, HMM is a stochastic generative model for time series defined by a finite set of states, and the probabilities depend on training(past).Therefore, the segmental lengths of predicted CNVs of HMM are similar. In other words, compare to PGM, prediction of HMM is a narrower distribution of segmental lengths of CNVs. We could say, this is a random variable; however, we know the spatial distributions of biology are the non-random association. On the other hand, we don’’t make a random assumption in PGM. Based on the view, we suggest that the results of PGM are more precise. This inference with before, compared PGM to other algorithms(include HMM)are coincident. Finally, in this research, we conclude much divergent in results of these two methods, but for ultimately evaluation still depends on some experiments in the future.
| en_US |