↓ Skip to main content

Automated tetraploid genotype calling by hierarchical clustering

Overview of attention for article published in Theoretical and Applied Genetics, January 2017
Altmetric Badge

About this Attention Score

  • Average Attention Score compared to outputs of the same age

Mentioned by

twitter
4 X users

Citations

dimensions_citation
61 Dimensions

Readers on

mendeley
76 Mendeley
Title
Automated tetraploid genotype calling by hierarchical clustering
Published in
Theoretical and Applied Genetics, January 2017
DOI 10.1007/s00122-016-2845-5
Pubmed ID
Authors

Cari A. Schmitz Carley, Joseph J. Coombs, David S. Douches, Paul C. Bethke, Jiwan P. Palta, Richard G. Novy, Jeffrey B. Endelman

Abstract

New software to make tetraploid genotype calls from SNP array data was developed, which uses hierarchical clustering and multiple F1 populations to calibrate the relationship between signal intensity and allele dosage. SNP arrays are transforming breeding and genetics research for autotetraploids. To fully utilize these arrays, the relationship between signal intensity and allele dosage must be calibrated for each marker. We developed an improved computational method to automate this process, which is provided as the R package ClusterCall. In the training phase of the algorithm, hierarchical clustering within an F1 population is used to group samples with similar intensity values, and allele dosages are assigned to clusters based on expected segregation ratios. In the prediction phase, multiple F1 populations and the prediction set are clustered together, and the genotype for each cluster is the mode of the training set samples. A concordance metric, defined as the proportion of training set samples equal to the mode, can be used to eliminate unreliable markers and compare different algorithms. Across three potato families genotyped with an 8K SNP array, ClusterCall scored 5729 markers with at least 0.95 concordance (94.6% of its total), compared to 5325 with the software fitTetra (82.5% of its total). The three families were used to predict genotypes for 5218 SNPs in the SolCAP diversity panel, compared with 3521 SNPs in a previous study in which genotypes were called manually. One of the additional markers produced a significant association for vine maturity near a well-known causal locus on chromosome 5. In conclusion, when multiple F1 populations are available, ClusterCall is an efficient method for accurate, autotetraploid genotype calling that enables the use of SNP data for research and plant breeding.

X Demographics

X Demographics

The data shown below were collected from the profiles of 4 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 76 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Netherlands 1 1%
United States 1 1%
Brazil 1 1%
Unknown 73 96%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 18 24%
Researcher 17 22%
Student > Master 13 17%
Professor > Associate Professor 6 8%
Student > Doctoral Student 4 5%
Other 6 8%
Unknown 12 16%
Readers by discipline Count As %
Agricultural and Biological Sciences 49 64%
Biochemistry, Genetics and Molecular Biology 6 8%
Business, Management and Accounting 1 1%
Computer Science 1 1%
Earth and Planetary Sciences 1 1%
Other 1 1%
Unknown 17 22%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 3. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 28 April 2017.
All research outputs
#13,675,166
of 23,794,258 outputs
Outputs from Theoretical and Applied Genetics
#2,590
of 3,565 outputs
Outputs of similar age
#208,934
of 425,251 outputs
Outputs of similar age from Theoretical and Applied Genetics
#23
of 30 outputs
Altmetric has tracked 23,794,258 research outputs across all sources so far. This one is in the 42nd percentile – i.e., 42% of other outputs scored the same or lower than it.
So far Altmetric has tracked 3,565 research outputs from this source. They receive a mean Attention Score of 4.9. This one is in the 27th percentile – i.e., 27% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 425,251 tracked outputs that were published within six weeks on either side of this one in any source. This one has gotten more attention than average, scoring higher than 50% of its contemporaries.
We're also able to compare this research output to 30 others from the same source and published within six weeks on either side of this one. This one is in the 23rd percentile – i.e., 23% of its contemporaries scored the same or lower than it.