↓ Skip to main content

Neighbor-Dependent Ramachandran Probability Distributions of Amino Acids Developed from a Hierarchical Dirichlet Process Model

Overview of attention for article published in PLoS Computational Biology, April 2010
Altmetric Badge

Mentioned by

wikipedia
2 Wikipedia pages

Citations

dimensions_citation
150 Dimensions

Readers on

mendeley
212 Mendeley
citeulike
3 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Neighbor-Dependent Ramachandran Probability Distributions of Amino Acids Developed from a Hierarchical Dirichlet Process Model
Published in
PLoS Computational Biology, April 2010
DOI 10.1371/journal.pcbi.1000763
Pubmed ID
Authors

Daniel Ting, Guoli Wang, Maxim Shapovalov, Rajib Mitra, Michael I. Jordan, Roland L. Dunbrack

Abstract

Distributions of the backbone dihedral angles of proteins have been studied for over 40 years. While many statistical analyses have been presented, only a handful of probability densities are publicly available for use in structure validation and structure prediction methods. The available distributions differ in a number of important ways, which determine their usefulness for various purposes. These include: 1) input data size and criteria for structure inclusion (resolution, R-factor, etc.); 2) filtering of suspect conformations and outliers using B-factors or other features; 3) secondary structure of input data (e.g., whether helix and sheet are included; whether beta turns are included); 4) the method used for determining probability densities ranging from simple histograms to modern nonparametric density estimation; and 5) whether they include nearest neighbor effects on the distribution of conformations in different regions of the Ramachandran map. In this work, Ramachandran probability distributions are presented for residues in protein loops from a high-resolution data set with filtering based on calculated electron densities. Distributions for all 20 amino acids (with cis and trans proline treated separately) have been determined, as well as 420 left-neighbor and 420 right-neighbor dependent distributions. The neighbor-independent and neighbor-dependent probability densities have been accurately estimated using Bayesian nonparametric statistical analysis based on the Dirichlet process. In particular, we used hierarchical Dirichlet process priors, which allow sharing of information between densities for a particular residue type and different neighbor residue types. The resulting distributions are tested in a loop modeling benchmark with the program Rosetta, and are shown to improve protein loop conformation prediction significantly. The distributions are available at http://dunbrack.fccc.edu/hdp.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 212 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 5 2%
Brazil 2 <1%
Germany 2 <1%
United Kingdom 2 <1%
Canada 2 <1%
Korea, Republic of 1 <1%
Japan 1 <1%
Argentina 1 <1%
Unknown 196 92%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 70 33%
Researcher 38 18%
Student > Master 26 12%
Student > Bachelor 14 7%
Student > Doctoral Student 11 5%
Other 31 15%
Unknown 22 10%
Readers by discipline Count As %
Agricultural and Biological Sciences 60 28%
Chemistry 39 18%
Biochemistry, Genetics and Molecular Biology 33 16%
Computer Science 16 8%
Physics and Astronomy 12 6%
Other 30 14%
Unknown 22 10%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 3. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 06 May 2019.
All research outputs
#8,626,132
of 25,604,262 outputs
Outputs from PLoS Computational Biology
#5,669
of 9,014 outputs
Outputs of similar age
#38,667
of 105,221 outputs
Outputs of similar age from PLoS Computational Biology
#31
of 50 outputs
Altmetric has tracked 25,604,262 research outputs across all sources so far. This one is in the 43rd percentile – i.e., 43% of other outputs scored the same or lower than it.
So far Altmetric has tracked 9,014 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 20.4. This one is in the 33rd percentile – i.e., 33% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 105,221 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 21st percentile – i.e., 21% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 50 others from the same source and published within six weeks on either side of this one. This one is in the 26th percentile – i.e., 26% of its contemporaries scored the same or lower than it.