Title |
Identification and removal of low-complexity sites in allele-specific analysis of ChIP-seq data
|
---|---|
Published in |
Bioinformatics, November 2013
|
DOI | 10.1093/bioinformatics/btt667 |
Pubmed ID | |
Authors |
Sebastian M. Waszak, Helena Kilpinen, Andreas R. Gschwind, Andrea Orioli, Sunil K. Raghav, Robert M. Witwicki, Eugenia Migliavacca, Alisa Yurovsky, Tuuli Lappalainen, Nouria Hernandez, Alexandre Reymond, Emmanouil T. Dermitzakis, Bart Deplancke |
Abstract |
High-throughput sequencing technologies enable the genome-wide analysis of the impact of genetic variation on molecular phenotypes at unprecedented resolution. However, although powerful, these technologies can also introduce unexpected artifacts. Results: We investigated the impact of library amplification bias on the identification of allele-specific (AS) molecular events from high-throughput sequencing data derived from chromatin immunoprecipitation assays (ChIP-seq). Putative AS DNA binding activity for RNA polymerase II was determined using ChIP-seq data derived from lymphoblastoid cell lines of two parent-daughter trios. We found that, at high-sequencing depth, many significant AS binding sites suffered from an amplification bias, as evidenced by a larger number of clonal reads representing one of the two alleles. To alleviate this bias, we devised an amplification bias detection strategy, which filters out sites with low read complexity and sites featuring a significant excess of clonal reads. This method will be useful for AS analyses involving ChIP-seq and other functional sequencing assays. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
Switzerland | 1 | 17% |
Germany | 1 | 17% |
Norway | 1 | 17% |
France | 1 | 17% |
India | 1 | 17% |
Unknown | 1 | 17% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 3 | 50% |
Members of the public | 3 | 50% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 2 | 6% |
France | 1 | 3% |
Unknown | 31 | 91% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 14 | 41% |
Student > Ph. D. Student | 7 | 21% |
Professor | 5 | 15% |
Student > Master | 2 | 6% |
Student > Doctoral Student | 1 | 3% |
Other | 5 | 15% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 19 | 56% |
Biochemistry, Genetics and Molecular Biology | 11 | 32% |
Mathematics | 3 | 9% |
Computer Science | 1 | 3% |