↓ Skip to main content

The challenge for genetic epidemiologists: how to analyze large numbers of SNPs in relation to complex diseases

Overview of attention for article published in BMC Genomic Data, April 2006
Altmetric Badge

Mentioned by

twitter
1 X user

Citations

dimensions_citation
141 Dimensions

Readers on

mendeley
127 Mendeley
connotea
1 Connotea
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
The challenge for genetic epidemiologists: how to analyze large numbers of SNPs in relation to complex diseases
Published in
BMC Genomic Data, April 2006
DOI 10.1186/1471-2156-7-23
Pubmed ID
Authors

A Geert Heidema, Jolanda MA Boer, Nico Nagelkerke, Edwin CM Mariman, Daphne L van der A, Edith JM Feskens

Abstract

Genetic epidemiologists have taken the challenge to identify genetic polymorphisms involved in the development of diseases. Many have collected data on large numbers of genetic markers but are not familiar with available methods to assess their association with complex diseases. Statistical methods have been developed for analyzing the relation between large numbers of genetic and environmental predictors to disease or disease-related variables in genetic association studies. In this commentary we discuss logistic regression analysis, neural networks, including the parameter decreasing method (PDM) and genetic programming optimized neural networks (GPNN) and several non-parametric methods, which include the set association approach, combinatorial partitioning method (CPM), restricted partitioning method (RPM), multifactor dimensionality reduction (MDR) method and the random forests approach. The relative strengths and weaknesses of these methods are highlighted. Logistic regression and neural networks can handle only a limited number of predictor variables, depending on the number of observations in the dataset. Therefore, they are less useful than the non-parametric methods to approach association studies with large numbers of predictor variables. GPNN on the other hand may be a useful approach to select and model important predictors, but its performance to select the important effects in the presence of large numbers of predictors needs to be examined. Both the set association approach and random forests approach are able to handle a large number of predictors and are useful in reducing these predictors to a subset of predictors with an important contribution to disease. The combinatorial methods give more insight in combination patterns for sets of genetic and/or environmental predictor variables that may be related to the outcome variable. As the non-parametric methods have different strengths and weaknesses we conclude that to approach genetic association studies using the case-control design, the application of a combination of several methods, including the set association approach, MDR and the random forests approach, will likely be a useful strategy to find the important genes and interaction patterns involved in complex diseases.

X Demographics

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 127 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 7 6%
Spain 3 2%
Brazil 2 2%
Germany 2 2%
United Kingdom 2 2%
Italy 1 <1%
Belgium 1 <1%
Ireland 1 <1%
Netherlands 1 <1%
Other 1 <1%
Unknown 106 83%

Demographic breakdown

Readers by professional status Count As %
Researcher 34 27%
Student > Ph. D. Student 27 21%
Student > Master 16 13%
Professor > Associate Professor 11 9%
Professor 6 5%
Other 23 18%
Unknown 10 8%
Readers by discipline Count As %
Agricultural and Biological Sciences 30 24%
Medicine and Dentistry 23 18%
Computer Science 23 18%
Mathematics 9 7%
Biochemistry, Genetics and Molecular Biology 7 6%
Other 21 17%
Unknown 14 11%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 23 June 2013.
All research outputs
#17,285,668
of 25,373,627 outputs
Outputs from BMC Genomic Data
#668
of 1,204 outputs
Outputs of similar age
#75,554
of 84,845 outputs
Outputs of similar age from BMC Genomic Data
#5
of 5 outputs
Altmetric has tracked 25,373,627 research outputs across all sources so far. This one is in the 21st percentile – i.e., 21% of other outputs scored the same or lower than it.
So far Altmetric has tracked 1,204 research outputs from this source. They receive a mean Attention Score of 4.3. This one is in the 34th percentile – i.e., 34% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 84,845 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 5th percentile – i.e., 5% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 5 others from the same source and published within six weeks on either side of this one.