Title |
Comparison of Clustering Methods for Investigation of Genome-Wide Methylation Array Data
|
---|---|
Published in |
Frontiers in Genetics, January 2011
|
DOI | 10.3389/fgene.2011.00088 |
Pubmed ID | |
Authors |
Harry Clifford, Frank Wessely, Satish Pendurthi, Richard D. Emes |
Abstract |
The use of genome-wide methylation arrays has proved very informative to investigate both clinical and biological questions in human epigenomics. The use of clustering methods either for exploration of these data or to compare to an a priori grouping, e.g., normal versus disease allows assessment of groupings of data without user bias. However no consensus on the methods to use for clustering of methylation array approaches has been reached. To determine the most appropriate clustering method for analysis of illumina array methylation data, a collection of data sets was simulated and used to compare clustering methods. Both hierarchical clustering and non-hierarchical clustering methods (k-means, k-medoids, and fuzzy clustering algorithms) were compared using a range of distance and linkage methods. As no single method consistently outperformed others across different simulations, we propose a method to capture the best clustering outcome based on an additional measure, the silhouette width. This approach produced a consistently higher cluster accuracy compared to using any one method in isolation. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 1 | 33% |
Australia | 1 | 33% |
Switzerland | 1 | 33% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 2 | 67% |
Members of the public | 1 | 33% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 3 | 3% |
United Kingdom | 2 | 2% |
Uruguay | 1 | 1% |
Spain | 1 | 1% |
Germany | 1 | 1% |
Unknown | 90 | 92% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 22 | 22% |
Student > Ph. D. Student | 18 | 18% |
Student > Master | 11 | 11% |
Student > Bachelor | 7 | 7% |
Professor > Associate Professor | 6 | 6% |
Other | 13 | 13% |
Unknown | 21 | 21% |
Readers by discipline | Count | As % |
---|---|---|
Biochemistry, Genetics and Molecular Biology | 19 | 19% |
Agricultural and Biological Sciences | 15 | 15% |
Medicine and Dentistry | 12 | 12% |
Engineering | 7 | 7% |
Computer Science | 5 | 5% |
Other | 14 | 14% |
Unknown | 26 | 27% |