Visitor, an informatic pipeline for analysis of viral siRNA sequencing datasets.
Methods in molecular biology, March 2011
Antoniewski C, Christophe Antoniewski
High-throughput sequencing emerged as a powerful approach to characterize siRNA populations -generated by hosts in response to viral infections. Here we described an informatic pipeline visitor to analyze in-house large sequencing datasets generated from Illumina sequencing of Drosophila small RNA libraries. The visitor perl script is designed to treat fastq sequence datasets from the Illumina sequencing platform, using a computer running under a UNIX compliant operating system (MacOS X, Linux, etc.). visitor first generates a detailed report of the sequence quality of the Illumina run. Then, using the Novoalign software, the script removes reads that match with the D. melanogaster genome from the sequencing data set. The remaining reads are aligned to a viral reference library, which can contain one or several virus genomes. visitor provides a hit table of identified viral siRNAs as well as graphics eps files of viral siRNA profiles. Unmatched small RNAs are also available in a fast format for de novo assembly and new virus discovery.
|Readers by professional status||Count||As %|
|Student > Bachelor||3||23%|
|Professor > Associate Professor||2||15%|
|Student > Master||2||15%|
|Readers by discipline||Count||As %|
|Biochemistry, Genetics and Molecular Biology||4||31%|
|Agricultural and Biological Sciences||4||31%|
|Medicine and Dentistry||1||8%|