Different strategies were applied to assemble the reads of the individual cells, and later to combine single cells Dsc # 2 and # 3, in order to get the most out of the sequencing data. Statistics were checked with assemblathon. Since good assembly statistics do not automatically hold true that the assembly is optimal assemblies were always run through the RAST pipeline to check for misassemblies. SAGs were assembled by:<\/p>\n
A. CLC bio
\nB. spades 2.3
\nC. spades-n
\nD. velvet-sc, kmer=37
\nE. velvet-sc n
\nF. Celera (CA)
\nG. Hybrid error correction method using CA assembled Illumina\u00ae data to correct long PacBio\u00ae reads
\nH. velvet assembly using Euler correction, kmer=55
\nI. spades assembly of Illumina\u00ae-only combined via PCAP with CA assembly of PacBio corrected by PacBio only
\nJ. velvet-sc assembly of Illumina\u00ae-only combined via PCAP with CA assembly of PacBio\u00ae corrected by PacBio\u00ae only
\nK. velvet-sc assembly of Illumina\u00ae-only combined via PCAP with CA assembly of PacBio\u00ae corrected by Illumina\u00ae-only
\nL. spades assembly of Illumina\u00ae-only combined via PCAP with CA assembly of PacBio corrected by Illumina\u00ae only n = Normalization of the Illumina\u00ae reads<\/p>\n
Single cells 2 and 3 were assembled together since they showed almost 100% identity at the nucleotide level after individual assembly. At this stage, a 0.32-Mb assembly was contained in 126 contigs for single cell 1 (Dsc1) and a 1.38-Mb assembly in 327 contigs for the co-assembly of single cells 2 and 3 (DscP2).<\/p>\n
Assembled contigs were submitted to the Integrated Microbial Genomes database annotation pipeline (IMG, version 4.1) and to the Rapid Annotations using Subsystems Technology pipeline (RAST, version 4.0) in 2013. Some computationally assigned annotations were manually changed based on the inspection of evidence for the assigned annotations, orthologs in related genomes and gene neighborhoods. Pathways were predicted using RAST, IMG and KEGG (Kyoto Encyclopedia of Genes and Genomes). Nucleotide and amino-acid sequences of genes were blasted as query sequences against the NCBI databases.<\/p><\/div>","@type":"rdf:HTML"}],"http:\/\/purl.org\/dc\/terms\/identifier":[{"@value":"637878","@type":"xsd:int"}],"http:\/\/purl.org\/dc\/terms\/title":[{"@value":"Peru Margin SAGs"}],"http:\/\/purl.org\/dc\/terms\/date":[{"@value":"2016-02-04T13:46:32-05:00","@type":"xsd:dateTime"}],"http:\/\/purl.org\/dc\/terms\/created":[{"@value":"2016-02-04T13:46:32-05:00","@type":"xsd:dateTime"}],"http:\/\/purl.org\/dc\/terms\/modified":[{"@value":"2023-07-07T16:10:26-04:00","@type":"xsd:dateTime"}],"http:\/\/rdfs.org\/ns\/void#inDataset":[{"@id":"http:\/\/www.bco-dmo.org\/"}],"http:\/\/ocean-data.org\/schema\/namedGraph":[{"@value":"urn:bcodmo:dataset:637878","@type":"xsd:token"}],"http:\/\/ocean-data.org\/schema\/osprey_page":[{"@id":"https:\/\/www.bco-dmo.org\/dataset\/637878"}],"http:\/\/ocean-data.org\/schema\/identifier":[{"@value":"_:Identifier637878"}],"http:\/\/ocean-data.org\/schema\/datasetTitle":[{"@value":"Sub-seafloor single amplified genomes (SAGs) from anaerobic Peru Margin sediment collected on R\/V JOIDES Resolution cruise JRES-201 in 2002","@language":"en-US"}],"http:\/\/ocean-data.org\/schema\/abstract":[{"@value":"","@language":"en-US"}],"http:\/\/purl.org\/dc\/terms\/rights":[{"@id":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"http:\/\/ocean-data.org\/schema\/deprecated":[{"@value":"false","@type":"xsd:boolean"}],"http:\/\/purl.org\/dc\/terms\/bibliographicCitation":[{"@value":"Spormann, A. M. (2016) Sub-seafloor single amplified genomes (SAGs) from anaerobic Peru Margin sediment collected on R\/V JOIDES Resolution cruise JRES-201 in 2002. Biological and Chemical Oceanography Data Management Office (BCO-DMO). (Version 04 Feb 2016) Version Date 2016-02-04 [if applicable, indicate subset used]. http:\/\/lod.bco-dmo.org\/id\/dataset\/637878 [access date]","@type":"xsd:string"}],"http:\/\/ocean-data.org\/schema\/validated":[{"@value":"true","@type":"xsd:boolean"}],"http:\/\/ocean-data.org\/schema\/versionLabel":[{"@value":"04 Feb 2016","@type":"xsd:string"}],"http:\/\/ocean-data.org\/schema\/currentState":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset-current-state\/7"}],"http:\/\/ocean-data.org\/schema\/nodcTopic":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/nodc-dataset-topic\/150"},{"@id":"http:\/\/lod.bco-dmo.org\/id\/nodc-dataset-topic\/156"}],"http:\/\/ocean-data.org\/schema\/datasetType":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset-type\/172"}],"http:\/\/ocean-data.org\/schema\/restricted":[{"@value":"false","@type":"xsd:boolean"}],"http:\/\/ocean-data.org\/schema\/hasAward":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/award\/554980"}],"http:\/\/ocean-data.org\/schema\/storesValuesFor":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset-parameter\/637886"},{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset-parameter\/637887"},{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset-parameter\/637888"},{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset-parameter\/637889"},{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset-parameter\/637890"},{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset-parameter\/637891"}],"http:\/\/ocean-data.org\/schema\/hasAgentWithRole":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/person-role\/637883"},{"@id":"http:\/\/lod.bco-dmo.org\/id\/person-role\/637884"}],"http:\/\/purl.org\/dc\/terms\/language":[{"@value":"http:\/\/id.loc.gov\/vocabulary\/iso639-1\/en","@type":"xsd:anyURI"}],"http:\/\/xmlns.com\/foaf\/0.1\/homepage":[{"@id":"https:\/\/www.bco-dmo.org\/dataset\/637878"}],"http:\/\/purl.org\/dc\/terms\/issued":[{"@value":"2016-02-04","@type":"xsd:date"}],"http:\/\/purl.org\/dc\/terms\/publisher":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/affiliation\/191"}],"http:\/\/www.w3.org\/ns\/dcat#contactPoint":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/person\/636469"}]},"_:html637878":{"@id":"_:html637878","@type":["http:\/\/ocean-data.org\/schema\/MetadataViewAffordance"],"http:\/\/schema.org\/subjectOf":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset\/637878"}],"http:\/\/schema.org\/name":[{"@value":"HTML","@type":"xsd:string"}],"http:\/\/ocean-data.org\/schema\/affordedBy":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/affiliation\/191"}],"http:\/\/schema.org\/target":[{"@value":"_:html637878entryPoint"}]},"_:html637878entryPoint":{"@id":"_:html637878entryPoint","@type":["http:\/\/schema.org\/EntryPoint","http:\/\/ocean-data.org\/schema\/HtmlLandingPage"],"http:\/\/schema.org\/url":[{"@value":"https:\/\/www.bco-dmo.org\/dataset\/637878","@type":"xsd:anyURI"}],"http:\/\/schema.org\/contentType":[{"@value":"text\/html","@type":"xsd:token"}]},"_:frictionlessdata637878":{"@id":"_:frictionlessdata637878","@type":["http:\/\/ocean-data.org\/schema\/MetadataDownloadAffordance"],"http:\/\/schema.org\/subjectOf":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset\/637878"}],"http:\/\/schema.org\/name":[{"@value":"Datapackage.json","@type":"xsd:string"}],"http:\/\/schema.org\/alternateName":[{"@value":"Frictionless Data Package","@type":"xsd:string"}],"http:\/\/ocean-data.org\/schema\/affordedBy":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/affiliation\/191"}],"http:\/\/schema.org\/target":[{"@value":"_:frictionlessdata637878entryPoint"}]},"_:frictionlessdata637878entryPoint":{"@id":"_:frictionlessdata637878entryPoint","@type":["http:\/\/schema.org\/EntryPoint"],"http:\/\/schema.org\/url":[{"@value":"https:\/\/www.bco-dmo.org\/dataset\/637878\/datapackage.json","@type":"xsd:anyURI"}],"http:\/\/schema.org\/contentType":[{"@value":"application\/vnd.datapackage+json","@type":"xsd:token"}]},"_:datasetdescription637878":{"@id":"_:datasetdescription637878","@type":["http:\/\/ocean-data.org\/schema\/MetadataDownloadAffordance"],"http:\/\/schema.org\/subjectOf":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset\/637878"}],"http:\/\/schema.org\/name":[{"@value":"PDF","@type":"xsd:string"}],"http:\/\/ocean-data.org\/schema\/affordedBy":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/affiliation\/191"}],"http:\/\/schema.org\/target":[{"@value":"_:datasetdescription637878entryPoint"}]},"_:datasetdescription637878entryPoint":{"@id":"_:datasetdescription637878entryPoint","@type":["http:\/\/schema.org\/EntryPoint","http:\/\/ocean-data.org\/schema\/PdfDatasetDescription"],"http:\/\/schema.org\/url":[{"@value":"https:\/\/www.bco-dmo.org\/dataset\/637878\/Dataset_description.pdf","@type":"xsd:anyURI"}],"http:\/\/schema.org\/contentType":[{"@value":"application\/pdf","@type":"xsd:token"}]},"_:json637878":{"@id":"_:json637878","@type":["http:\/\/ocean-data.org\/schema\/MetadataDownloadAffordance"],"http:\/\/schema.org\/subjectOf":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset\/637878"}],"http:\/\/schema.org\/name":[{"@value":"JSON-LD","@type":"xsd:string"}],"http:\/\/ocean-data.org\/schema\/affordedBy":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/affiliation\/191"}],"http:\/\/schema.org\/target":[{"@value":"_:json637878entryPoint"}]},"_:json637878entryPoint":{"@id":"_:json637878entryPoint","@type":["http:\/\/schema.org\/EntryPoint"],"http:\/\/schema.org\/url":[{"@value":"https:\/\/www.bco-dmo.org\/dataset\/637878.json","@type":"xsd:anyURI"}],"http:\/\/schema.org\/contentType":[{"@value":"application\/ld+json","@type":"xsd:token"}]},"_:ttl637878":{"@id":"_:ttl637878","@type":["http:\/\/ocean-data.org\/schema\/MetadataDownloadAffordance"],"http:\/\/schema.org\/subjectOf":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset\/637878"}],"http:\/\/schema.org\/name":[{"@value":"Turtle","@type":"xsd:string"}],"http:\/\/ocean-data.org\/schema\/affordedBy":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/affiliation\/191"}],"http:\/\/schema.org\/target":[{"@value":"_:ttl637878entryPoint"}]},"_:ttl637878entryPoint":{"@id":"_:ttl637878entryPoint","@type":["http:\/\/schema.org\/EntryPoint"],"http:\/\/schema.org\/url":[{"@value":"https:\/\/www.bco-dmo.org\/dataset\/637878.ttl","@type":"xsd:anyURI"}],"http:\/\/schema.org\/contentType":[{"@value":"text\/turtle","@type":"xsd:token"}]},"_:rdf637878":{"@id":"_:rdf637878","@type":["http:\/\/ocean-data.org\/schema\/MetadataDownloadAffordance"],"http:\/\/schema.org\/subjectOf":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset\/637878"}],"http:\/\/schema.org\/name":[{"@value":"RDF\/XML","@type":"xsd:string"}],"http:\/\/ocean-data.org\/schema\/affordedBy":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/affiliation\/191"}],"http:\/\/schema.org\/target":[{"@value":"_:rdf637878entryPoint"}]},"_:rdf637878entryPoint":{"@id":"_:rdf637878entryPoint","@type":["http:\/\/schema.org\/EntryPoint"],"http:\/\/schema.org\/url":[{"@value":"https:\/\/www.bco-dmo.org\/dataset\/637878.rdf","@type":"xsd:anyURI"}],"http:\/\/schema.org\/contentType":[{"@value":"application\/rdf+xml","@type":"xsd:token"}]},"_:iso637878":{"@id":"_:iso637878","@type":["http:\/\/ocean-data.org\/schema\/MetadataDownloadAffordance"],"http:\/\/schema.org\/subjectOf":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset\/637878"}],"http:\/\/schema.org\/name":[{"@value":"ISO 19115-2 (NOAA Profile)","@type":"xsd:string"}],"http:\/\/ocean-data.org\/schema\/affordedBy":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/affiliation\/191"}],"http:\/\/schema.org\/target":[{"@value":"_:iso637878entryPoint"}]},"_:iso637878entryPoint":{"@id":"_:iso637878entryPoint","@type":["http:\/\/schema.org\/EntryPoint","http:\/\/ocean-data.org\/schema\/ISOMetadata"],"http:\/\/schema.org\/url":[{"@value":"https:\/\/www.bco-dmo.org\/dataset\/637878\/iso","@type":"xsd:anyURI"}],"http:\/\/schema.org\/contentType":[{"@value":"application\/xml","@type":"xsd:token"}],"http:\/\/purl.org\/dc\/terms\/conformsTo":[{"@value":"http:\/\/www.isotc211.org\/2005\/gmd-noaa","@type":"xsd:anyURI"}]},"_:dublincore637878":{"@id":"_:dublincore637878","@type":["http:\/\/ocean-data.org\/schema\/MetadataDownloadAffordance"],"http:\/\/schema.org\/subjectOf":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/dataset\/637878"}],"http:\/\/schema.org\/name":[{"@value":"Dublin Core","@type":"xsd:string"}],"http:\/\/ocean-data.org\/schema\/affordedBy":[{"@id":"http:\/\/lod.bco-dmo.org\/id\/affiliation\/191"}],"http:\/\/schema.org\/target":[{"@value":"_:dublincore637878entryPoint"}]},"_:dublincore637878entryPoint":{"@id":"_:dublincore637878entryPoint","@type":["http:\/\/schema.org\/EntryPoint"],"http:\/\/schema.org\/url":[{"@value":"https:\/\/www.bco-dmo.org\/dataset\/637878\/dublin-core","@type":"xsd:anyURI"}],"http:\/\/schema.org\/contentType":[{"@value":"application\/xml","@type":"xsd:token"}],"http:\/\/purl.org\/dc\/terms\/conformsTo":[{"@value":"http:\/\/purl.org\/dc\/elements\/1.1\/","@type":"xsd:anyURI"}]},"_:Identifier637878":{"@id":"_:Identifier637878","@type":["http:\/\/ocean-data.org\/schema\/Identifier","http:\/\/ocean-data.org\/schema\/BCODMOIdentifier","http:\/\/ocean-data.org\/schema\/OSPREY_v2_Node_dataset"],"http:\/\/ocean-data.org\/schema\/identifierScheme":[{"@id":"http:\/\/ocean-data.org\/schema\/IdentifierScheme_BCODMO_Version_2"}],"http:\/\/ocean-data.org\/schema\/identifierValue":[{"@value":"637878","@type":"xsd:token"}],"http:\/\/ocean-data.org\/schema\/resolvableURL":[{"@value":"http:\/\/lod.bco-dmo.org\/id\/dataset\/637878","@type":"xsd:anyURI"}]}}]}