ORegAnno: an open-access community-driven resource for regulatory annotation
Nucleic Acids Research, December 2007
O. L. Griffith, S. B. Montgomery, B. Bernier, B. Chu, K. Kasaian, S. Aerts, S. Mahony, M. C. Sleumer, M. Bilenky, M. Haeussler, M. Griffith, S. M. Gallo, B. Giardine, B. Hooghe, P. Van Loo, E. Blanco, A. Ticoll, S. Lithwick, E. Portales-Casamar, I. J. Donaldson, G. Robertson, C. Wadelius, P. De Bleser, D. Vlieghe, M. S. Halfon, W. Wasserman, R. Hardison, C. M. Bergman, S. J.M. Jones, Griffith, Obi L, Montgomery, Stephen B, Bernier, Bridget, Chu, Bryan, Kasaian, Katayoon, Aerts, Stein, Mahony, Shaun, Sleumer, Monica C, Bilenky, Mikhail, Haeussler, Maximilian, Griffith, Malachi, Gallo, Steven M, Giardine, Belinda, Hooghe, Bart, Van Loo, Peter, Blanco, Enrique, Ticoll, Amy, Lithwick, Stuart, Portales-Casamar, Elodie, Donaldson, Ian J, Robertson, Gordon, Wadelius, Claes, De Bleser, Pieter, Vlieghe, Dominique, Halfon, Marc S, Wasserman, Wyeth, Hardison, Ross, Bergman, Casey M, Jones, Steven J M, , , Griffith, Obi L., Montgomery, Stephen B., Sleumer, Monica C., Gallo, Steven M., Donaldson, Ian J., Halfon, Marc S., Bergman, Casey M., Jones, Steven J.M.
ORegAnno is an open-source, open-access database and literature curation system for community-based annotation of experimentally identified DNA regulatory regions, transcription factor binding sites and regulatory variants. The current release comprises 30 145 records curated from 922 publications and describing regulatory sequences for over 3853 genes and 465 transcription factors from 19 species. A new feature called the 'publication queue' allows users to input relevant papers from scientific literature as targets for annotation. The queue contains 4438 gene regulation papers entered by experts and another 54 351 identified by text-mining methods. Users can enter or 'check out' papers from the queue for manual curation using a series of user-friendly annotation pages. A typical record entry consists of species, sequence type, sequence, target gene, binding factor, experimental outcome and one or more lines of experimental evidence. An evidence ontology was developed to describe and categorize these experiments. Records are cross-referenced to Ensembl or Entrez gene identifiers, PubMed and dbSNP and can be visualized in the Ensembl or UCSC genome browsers. All data are freely available through search pages, XML data dumps or web services at: http://www.oreganno.org.
|Readers by professional status||Count||As %|
|Student > Ph. D. Student||39||24%|
|Student > Master||18||11%|
|Professor > Associate Professor||18||11%|
|Student > Bachelor||10||6%|
|Readers by discipline||Count||As %|
|Agricultural and Biological Sciences||93||58%|
|Biochemistry, Genetics and Molecular Biology||20||13%|
|Medicine and Dentistry||10||6%|