Dataset: Groundwater microorganism retroelement protein sequences
Deployment: Valentine_IFRC_site

Protein sequences from diversity-generating retroelements in groundwater microorganisms
Principal Investigator: 
David L. Valentine (University of California-Santa Barbara, UCSB)
Student: 
Blair Paul (University of California-Santa Barbara, UCSB-MSI)
Contact: 
David L. Valentine (University of California-Santa Barbara, UCSB)
BCO-DMO Data Manager: 
Amber D. York (Woods Hole Oceanographic Institution, WHOI BCO-DMO)
Version: 
Description

This dataset includes links to diversity-generating retroelement sequences files in .fasta format.  There are separate .fasta files for reverse transcriptase protein sequences and variable protein sequences.  These sequences were derived from previously generated sequence accessions at the National Center for Biotechnology Information (NCBI).  Original sampling took place near Rifle, Colorado​ between 2011 and 2012.

To access the .fasta files and the list of source sequences, click the "Get Data" button at the top of this page.

These data are published in the following journal article:
Paul, B.G., Burstein, D., Castelle, C.J., Handa, S., Arambula, D., Czornyj, E., Thomas, B.C., Ghosh, P., Miller, J.F., Banfield, J.F. and Valentine, D.L. (2017) Retroelement-guided protein diversification abounds in vast lineages of Bacteria and Archaea. Nature Microbiology, 2, p.17045. doi: 10.1038/nmicrobiol.2017.45

More information about this dataset deployment