Total spectral count of proteins from R/V Falkor cruise 160115 for the ProteOMZ expedition in the Central Pacific in 2016.

Website: https://www.bco-dmo.org/dataset/737620
Data Type: Cruise Results
Version: 3
Version Date: 2018-12-10

Project
» The ProteOMZ Expedition: Investigating Life Without Oxygen in the Pacific Ocean (ProteOMZ (Proteomics in an Oxygen Minimum Zone))
ContributorsAffiliationRole
Saito, Mak A.Woods Hole Oceanographic Institution (WHOI)Principal Investigator
Ake, HannahWoods Hole Oceanographic Institution (WHOI BCO-DMO)BCO-DMO Data Manager
York, Amber D.Woods Hole Oceanographic Institution (WHOI BCO-DMO)BCO-DMO Data Manager

Abstract
Relative protein abundance data of the upper 1000m water column from the ProteOMZ R/V Falkor expedition. There are 56,577 protein identifications over 103 samples, for ~6 million data points.


Coverage

Spatial Extent: N:17.4465 E:-139.1089 S:-0.4708 W:-157.3022
Temporal Extent: 2016-01-19 - 2016-01-28

Dataset Description

These data are part of the Ocean Protein Portal "ProteOMZ" dataset (https://proteinportal.whoi.edu/; Saito et al., 2019).


Data Processing Description

The raw mass spectra files were searched against SEQUEST within Proteome Discoverer v2.2 software. Processed files were then loaded into Proteome Software and protein and peptide reports as well as and fasta files were exported. The files were modified slightly to map to the Protein Portal data model for submission to BCO-DMO.


BCO-DMO Processing Description

-Date, time, filter min, filter max, lat, lon, and cruise columns were added based on information from the Falkor 160115 Event log and CTD log.
-Column names reformatted to comply with BCO-DMO standards.

Dataset version 3: replaces the earlier data version from date (2018-05-25)
* Values "#N/A" changed to "Unknown" which has a different meaning than blank values. Unknown = "Protein functional and taxonomic annotations are marked as "Unknown" for protein sequences which did not have any significant hits to known reference sequences or motifs in the metagenome annotation database. "
* Event log and McLane pump log were updated to fix lat/lon, date/time issues. Since these were sources of information in this dataset, this dataset is also being updated. See respective "processing notes" sections for these two logs for detailed information about changes to those data sources. event log: https://www.bco-dmo.org/dataset/708384 pump log: https://www.bco-dmo.org/dataset/708495
* ISO_DateTime_UTC timestamp added from Date and time columns in the McLane log dataset. Date and time columns were HST time zone so 10 hours were added to make the time in UTC.
* columns for max and min pump filter size added (min 0.2 max 3.0).
* some station and target depths for station and target depth combinations did not exist in the Mclane pump log so the missing values were added to the mclane pump log. The following columns in this dataset come from the pump log: cruise, cast, date, time, lat, lon, depth (ISO_DateTime_UTC is derived from local date and time). Where there were two matched casts in the pump log for a station and target depth, the first cast for the station was used. The only differences between the two possible casts were the date/time columns.


[ table of contents | back to top ]

Related Datasets

IsSupplementedBy
Saito, M. A. (2018) Total spectral counts of peptides from the R/V Falkor cruise 160115 in the Central Pacific for the ProteOMZ expedition in 2016. Biological and Chemical Oceanography Data Management Office (BCO-DMO). (Version 1) Version Date 2018-05-25 http://lod.bco-dmo.org/id/dataset/737596 [view at BCO-DMO]
Relationship Description: The peptides related to these proteins
IsRelatedTo
Saito, M. A. (2019) FASTA file of identified protein sequences from the R/V Falkor cruise 160115 for the ProteOMZ expedition in the Central Pacific during 2016. Biological and Chemical Oceanography Data Management Office (BCO-DMO). (Version 2) Version Date 2019-01-02 http://lod.bco-dmo.org/id/dataset/737611 [view at BCO-DMO]
Relationship Description: These datasets are part of the Ocean Protein Portal "ProteOMZ" dataset (https://proteinportal.whoi.edu/; Saito et al., 2019).
Saito, M. A., Saunders, J. (2022) Relative protein abundance from scaled and corrected exclusive peptide spectral counts from the ProteOMZ R/V Falkor expedition cruise FK160115 in the Pelagic central Pacific Ocean in 2016. Biological and Chemical Oceanography Data Management Office (BCO-DMO). (Version 1) Version Date 2022-01-13 doi:10.26008/1912/bco-dmo.868030.1 [view at BCO-DMO]
Relationship Description: These datasets are part of the Ocean Protein Portal "ProteOMZ" dataset (https://proteinportal.whoi.edu/; Saito et al., 2019).

[ table of contents | back to top ]

Parameters

ParameterDescriptionUnits
row_idRow number unitless
Identified_ProteinsJCVI metagenome ID number unitless
Molecular_WeightUseful for calculated information kDa
best_hit_annotationProduct name unitless
best_hit_speciesTaxon unitless
best_hit_taxon_idTaxon ID unitless
KOKegg ID unitless
KO_descKegg name unitless
KO_pathwayKegg pathway unitless
ECEnzyme Commission ID number unitless
uniprotUniprot database ID number unitless
PFamsProtein family ID number unitless
PFams_descProtein family description unitless
cruiseCruise number unitless
stationStation number unitless
date; yyyy/mm/ddDate at station unitless
time; hh:mmTime at station unitless
latLatitude decimal degrees
lonLongitude decimal degrees
castCast number unitless
depthCast depth meters
spectral_countSpectral count of each protein count
min_filter_sizeMinimum filter size microns
max_filter_sizeMaximum filter size microns
ISO_DateTime_UTCDateTime UTC ISO formatted unitless


[ table of contents | back to top ]

Instruments

Dataset-specific Instrument Name
SeaBird SBE19 CTD
Generic Instrument Name
CTD Sea-Bird
Dataset-specific Description
Used for water sampling
Generic Instrument Description
Conductivity, Temperature, Depth (CTD) sensor package from SeaBird Electronics, no specific unit identified. This instrument designation is used when specific make and model are not known. See also other SeaBird instruments listed under CTD. More information from Sea-Bird Electronics.

Dataset-specific Instrument Name
Trace Metal Rosette
Generic Instrument Name
Trace Metal Bottle
Dataset-specific Description
Used for nutrient sampling
Generic Instrument Description
Trace metal (TM) clean rosette bottle used for collecting trace metal clean seawater samples.

Dataset-specific Instrument Name
Technicon AutoAnalyzer II
Generic Instrument Name
Technicon AutoAnalyzer II
Dataset-specific Description
Used to measure phosphate and ammonium
Generic Instrument Description
A rapid flow analyzer that may be used to measure nutrient concentrations in seawater. It is a continuous segmented flow instrument consisting of a sampler, peristaltic pump, analytical cartridge, heating bath, and colorimeter. See more information about this instrument from the manufacturer.

Dataset-specific Instrument Name
Alpkem Autosampler
Generic Instrument Name
Alpkem RFA300
Dataset-specific Description
Used in nutrient analysis
Generic Instrument Description
A rapid flow analyser (RFA) that may be used to measure nutrient concentrations in seawater. It is an air-segmented, continuous flow instrument comprising a sampler, a peristaltic pump which simultaneously pumps samples, reagents and air bubbles through the system, analytical cartridge, heating bath, colorimeter, data station, and printer. The RFA-300 was a precursor to the smaller Alpkem RFA/2 (also RFA II or RFA-2).


[ table of contents | back to top ]

Deployments

FK160115

Website
Platform
R/V Falkor
Report
Start Date
2016-01-16
End Date
2016-02-11
Description
Project: Using Proteomics to Understand Oxygen Minimum Zones (ProteOMZ) More information is available from the ship operator at https://schmidtocean.org/cruise/investigating-life-without-oxygen-in-the... Additional cruise information is available from the Rolling Deck to Repository (R2R): https://www.rvdata.us/search/cruise/FK160115


[ table of contents | back to top ]

Project Information

The ProteOMZ Expedition: Investigating Life Without Oxygen in the Pacific Ocean (ProteOMZ (Proteomics in an Oxygen Minimum Zone))


Coverage: Central Pacific Ocean (Hawaii to Tahiti)


From Schmidt Ocean Institute's ProteOMZ Project page:

Rising temperatures, ocean acidification, and overfishing have now gained widespread notoriety as human-caused phenomena that are changing our seas. In recent years, scientists have increasingly recognized that there is yet another ingredient in that deleterious mix: a process called deoxygenation that results in less oxygen available in our seas.

Large-scale ocean circulation naturally results in low-oxygen areas of the ocean called oxygen deficient zones (ODZs). The cycling of carbon and nutrients – the foundation of marine life, called biogeochemistry – is fundamentally different in ODZs than in oxygen-rich areas. Because researchers think deoxygenation will greatly expand the total area of ODZs over the next 100 years, studying how these areas function now is important in predicting and understanding the oceans of the future. This first expedition of 2016 led by Dr. Mak Saito from the Woods Hole Oceanographic Institution (WHOI) along with scientists from University of Maryland Center for Environmental Science, University of California Santa Cruz, and University of Washington aimed to do just that, investigate ODZs.

During the 28 day voyage named “ProteOMZ,” researchers aboard R/V Falkor traveled from Honolulu, Hawaii to Tahiti to describe the biogeochemical processes that occur within this particular swath of the ocean’s ODZs. By doing so, they contributed to our greater understanding of ODZs, gathered a database of baseline measurements to which future measurements can be compared, and established a new methodology that could be used in future research on these expanding ODZs.



[ table of contents | back to top ]

Funding

Funding SourceAward
Gordon and Betty Moore Foundation: Marine Microbiology Initiative (MMI)
Schmidt Ocean Institute (SOI)

[ table of contents | back to top ]