Title |
Comparing Reverse Complementary Genomic Words Based on Their Distance Distributions and Frequencies
|
---|---|
Published in |
Interdisciplinary Sciences: Computational Life Sciences, December 2017
|
DOI | 10.1007/s12539-017-0273-0 |
Pubmed ID | |
Authors |
Ana Helena Tavares, Jakob Raymaekers, Peter J. Rousseeuw, Raquel M. Silva, Carlos A. C. Bastos, Armando Pinho, Paula Brito, Vera Afreixo |
Abstract |
In this work, we study reverse complementary genomic word pairs in the human DNA, by comparing both the distance distribution and the frequency of a word to those of its reverse complement. Several measures of dissimilarity between distance distributions are considered, and it is found that the peak dissimilarity works best in this setting. We report the existence of reverse complementary word pairs with very dissimilar distance distributions, as well as word pairs with very similar distance distributions even when both distributions are irregular and contain strong peaks. The association between distribution dissimilarity and frequency discrepancy is also explored, and it is speculated that symmetric pairs combining low and high values of each measure may uncover features of interest. Taken together, our results suggest that some asymmetries in the human genome go far beyond Chargaff's rules. This study uses both the complete human genome and its repeat-masked version. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
Portugal | 4 | 57% |
Unknown | 3 | 43% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Members of the public | 5 | 71% |
Scientists | 2 | 29% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Unknown | 8 | 100% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Professor > Associate Professor | 2 | 25% |
Researcher | 2 | 25% |
Student > Ph. D. Student | 1 | 13% |
Other | 1 | 13% |
Unknown | 2 | 25% |
Readers by discipline | Count | As % |
---|---|---|
Mathematics | 2 | 25% |
Biochemistry, Genetics and Molecular Biology | 2 | 25% |
Social Sciences | 1 | 13% |
Engineering | 1 | 13% |
Unknown | 2 | 25% |