Title |
NucTools: analysis of chromatin feature occupancy profiles from high-throughput sequencing data
|
---|---|
Published in |
BMC Genomics, February 2017
|
DOI | 10.1186/s12864-017-3580-2 |
Pubmed ID | |
Authors |
Yevhen Vainshtein, Karsten Rippe, Vladimir B. Teif |
Abstract |
Biomedical applications of high-throughput sequencing methods generate a vast amount of data in which numerous chromatin features are mapped along the genome. The results are frequently analysed by creating binary data sets that link the presence/absence of a given feature to specific genomic loci. However, the nucleosome occupancy or chromatin accessibility landscape is essentially continuous. It is currently a challenge in the field to cope with continuous distributions of deep sequencing chromatin readouts and to integrate the different types of discrete chromatin features to reveal linkages between them. Here we introduce the NucTools suite of Perl scripts as well as MATLAB- and R-based visualization programs for a nucleosome-centred downstream analysis of deep sequencing data. NucTools accounts for the continuous distribution of nucleosome occupancy. It allows calculations of nucleosome occupancy profiles averaged over several replicates, comparisons of nucleosome occupancy landscapes between different experimental conditions, and the estimation of the changes of integral chromatin properties such as the nucleosome repeat length. Furthermore, NucTools facilitates the annotation of nucleosome occupancy with other chromatin features like binding of transcription factors or architectural proteins, and epigenetic marks like histone modifications or DNA methylation. The applications of NucTools are demonstrated for the comparison of several datasets for nucleosome occupancy in mouse embryonic stem cells (ESCs) and mouse embryonic fibroblasts (MEFs). The typical workflows of data processing and integrative analysis with NucTools reveal information on the interplay of nucleosome positioning with other features such as for example binding of a transcription factor CTCF, regions with stable and unstable nucleosomes, and domains of large organized chromatin K9me2 modifications (LOCKs). As potential limitations and problems we discuss how inter-replicate variability of MNase-seq experiments can be addressed. |
Twitter Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United Kingdom | 3 | 21% |
United States | 3 | 21% |
Canada | 2 | 14% |
France | 2 | 14% |
Russia | 1 | 7% |
Namibia | 1 | 7% |
Germany | 1 | 7% |
Unknown | 1 | 7% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 11 | 79% |
Members of the public | 3 | 21% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Unknown | 47 | 100% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Ph. D. Student | 13 | 28% |
Researcher | 9 | 19% |
Student > Bachelor | 6 | 13% |
Student > Master | 3 | 6% |
Other | 3 | 6% |
Other | 8 | 17% |
Unknown | 5 | 11% |
Readers by discipline | Count | As % |
---|---|---|
Biochemistry, Genetics and Molecular Biology | 20 | 43% |
Agricultural and Biological Sciences | 16 | 34% |
Engineering | 3 | 6% |
Chemistry | 1 | 2% |
Computer Science | 1 | 2% |
Other | 0 | 0% |
Unknown | 6 | 13% |