Faculty & Staff Scholarship

A Novel Model for DNA Sequence Similarity Analysis Based on Graph Theory

Xingqin Qi, Shandong University
Qin Wu, West Virginia University
Yusen Zhang, Shandong University
Eddie Fuller, West Virginia University
Cun-Quan Zhang, West Virginia University

Document Type

Article

Publication Date

2011

College/Unit

Eberly College of Arts and Sciences

Department/Program/Center

Mathematics

Abstract

Determination of sequence similarity is one of the major steps in computational phylogenetic studies. As we know, during evolutionary history, not only DNA mutations for individual nucleotide but also subsequent rearrangements occurred. It has been one of major tasks of computational biologists to develop novel mathematical descriptors for similarity analysis such that various mutation phenomena information would be involved simultaneously. In this paper, different from traditional methods (eg, nucleotide frequency, geometric representations) as bases for construction of mathematical descriptors, we construct novel mathematical descriptors based on graph theory. In particular, for each DNA sequence, we will set up a weighted directed graph. The adjacency matrix of the directed graph will be used to induce a representative vector for DNA sequence. This new approach measures similarity based on both ordering and frequency of nucleotides so that much more information is involved. As an application, the method is tested on a set of 0.9-kb mtDNA sequences of twelve different primate species. All output phylogenetic trees with various distance estimations have the same topology, and are generally consistent with the reported results from early studies, which proves the new method's efficiency; we also test the new method on a simulated data set, which shows our new method performs better than traditional global alignment method when subsequent rearrangements happen frequently during evolutionary history.

Digital Commons Citation

Qi, Xingqin; Wu, Qin; Zhang, Yusen; Fuller, Eddie; and Zhang, Cun-Quan, "A Novel Model for DNA Sequence Similarity Analysis Based on Graph Theory" (2011). Faculty & Staff Scholarship. 2788.
https://researchrepository.wvu.edu/faculty_publications/2788

Source Citation

Qi, X., Wu, Q., Zhang, Y., Fuller, E., & Zhang, C.-Q. (2011). A Novel Model for DNA Sequence Similarity Analysis Based on Graph Theory. Evolutionary Bioinformatics, 7, EBO.S7364. https://doi.org/10.4137/ebo.s7364

Comments

Download

COinS

Faculty & Staff Scholarship

A Novel Model for DNA Sequence Similarity Analysis Based on Graph Theory

Document Type

Publication Date

College/Unit

Department/Program/Center

Abstract

Digital Commons Citation

Source Citation

Comments

Browse

Resources

Search

Author Corner

Faculty & Staff Scholarship

A Novel Model for DNA Sequence Similarity Analysis Based on Graph Theory

Authors

Document Type

Publication Date

College/Unit

Department/Program/Center

Abstract

Digital Commons Citation

Source Citation

Comments

Share

Browse

Resources

Search

Author Corner