↓ Skip to main content

Long-Branch Attraction Bias and Inconsistency in Bayesian Phylogenetics

Overview of attention for article published in PLOS ONE, December 2009
Altmetric Badge

About this Attention Score

  • In the top 25% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (95th percentile)
  • High Attention Score compared to outputs of the same age and source (88th percentile)

Mentioned by

blogs
2 blogs
wikipedia
3 Wikipedia pages

Citations

dimensions_citation
74 Dimensions

Readers on

mendeley
293 Mendeley
citeulike
7 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Long-Branch Attraction Bias and Inconsistency in Bayesian Phylogenetics
Published in
PLOS ONE, December 2009
DOI 10.1371/journal.pone.0007891
Pubmed ID
Authors

Bryan Kolaczkowski, Joseph W. Thornton

Abstract

Bayesian inference (BI) of phylogenetic relationships uses the same probabilistic models of evolution as its precursor maximum likelihood (ML), so BI has generally been assumed to share ML's desirable statistical properties, such as largely unbiased inference of topology given an accurate model and increasingly reliable inferences as the amount of data increases. Here we show that BI, unlike ML, is biased in favor of topologies that group long branches together, even when the true model and prior distributions of evolutionary parameters over a group of phylogenies are known. Using experimental simulation studies and numerical and mathematical analyses, we show that this bias becomes more severe as more data are analyzed, causing BI to infer an incorrect tree as the maximum a posteriori phylogeny with asymptotically high support as sequence length approaches infinity. BI's long branch attraction bias is relatively weak when the true model is simple but becomes pronounced when sequence sites evolve heterogeneously, even when this complexity is incorporated in the model. This bias--which is apparent under both controlled simulation conditions and in analyses of empirical sequence data--also makes BI less efficient and less robust to the use of an incorrect evolutionary model than ML. Surprisingly, BI's bias is caused by one of the method's stated advantages--that it incorporates uncertainty about branch lengths by integrating over a distribution of possible values instead of estimating them from the data, as ML does. Our findings suggest that trees inferred using BI should be interpreted with caution and that ML may be a more reliable framework for modern phylogenetic analysis.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 293 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 22 8%
Brazil 6 2%
Spain 4 1%
Germany 2 <1%
Colombia 2 <1%
United Kingdom 2 <1%
Poland 2 <1%
Canada 2 <1%
Sweden 1 <1%
Other 9 3%
Unknown 241 82%

Demographic breakdown

Readers by professional status Count As %
Researcher 92 31%
Student > Ph. D. Student 72 25%
Student > Master 27 9%
Professor > Associate Professor 23 8%
Professor 12 4%
Other 48 16%
Unknown 19 6%
Readers by discipline Count As %
Agricultural and Biological Sciences 216 74%
Biochemistry, Genetics and Molecular Biology 17 6%
Computer Science 7 2%
Environmental Science 6 2%
Earth and Planetary Sciences 6 2%
Other 14 5%
Unknown 27 9%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 19. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 14 February 2014.
All research outputs
#1,662,670
of 22,705,019 outputs
Outputs from PLOS ONE
#21,552
of 193,828 outputs
Outputs of similar age
#7,631
of 165,060 outputs
Outputs of similar age from PLOS ONE
#68
of 568 outputs
Altmetric has tracked 22,705,019 research outputs across all sources so far. Compared to these this one has done particularly well and is in the 92nd percentile: it's in the top 10% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 193,828 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 15.0. This one has done well, scoring higher than 88% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 165,060 tracked outputs that were published within six weeks on either side of this one in any source. This one has done particularly well, scoring higher than 95% of its contemporaries.
We're also able to compare this research output to 568 others from the same source and published within six weeks on either side of this one. This one has done well, scoring higher than 88% of its contemporaries.