Peptides and their spectral counts from KM1128 the METZYME expedition on R/V Kilo Moana in the tropical North Pacific in 2011

Website: https://www.bco-dmo.org/dataset/731314
Data Type: Cruise Results
Version: 4
Version Date: 2021-03-30

Project
» Connecting Trace Elements and Metalloenzymes Across Marine Biogeochemical Gradients (GPc03) (MetZyme)
» Adaptation of key N2-fixing cyanobacteria to changing CO2 (HiCO2_AdaptCyano)

Program
» U.S. GEOTRACES (U.S. GEOTRACES)
ContributorsAffiliationRole
Saito, Mak A.Woods Hole Oceanographic Institution (WHOI)Principal Investigator
Ake, HannahWoods Hole Oceanographic Institution (WHOI BCO-DMO)BCO-DMO Data Manager
York, Amber D.Woods Hole Oceanographic Institution (WHOI BCO-DMO)BCO-DMO Data Manager

Abstract
Global metaproteomic dataset for KM1128 the METZYME expedition. Size fractionated samples collected by McLane pump filters (filter size 0.2 – 3.0 micron). The global proteome dataset presented here included 16951 protein identifications and 31994 unique peptide identifications compiled from 37 samples and 74 raw files (CID and HCD modes for each file) and ### total spectra.


Coverage

Spatial Extent: N:20.6372 E:175.9503 S:-15.7878 W:153.2745
Temporal Extent: 2011-10-01 - 2011-10-25

Dataset Description

These data are part of the Ocean Protein Portal "MetZyme 0.2" dataset version 5 (https://proteinportal.whoi.edu/; Saito et al., 2019).


Methods & Sampling

Ocean metaproteomes were sampled and extracted as described in Saito et al., 2014. The protocols are reproduced here:

Samples were collected during the KM1128 METZYME research expedition (Metals and Enzymes in the Pacific) on the R/V Kilo Moana October 1-25, 2011 from Oahu, Hawaii to Apia, Samoa, with Carl Lamborg and Mak Saito as Chief Scientists. Microbial biomass for protein analyses was collected on vertical profiles using in situ high volume particle filtration pumps with a focus on the North Pacific and Equatorial regions. Protein samples were collected by a suite of 4 L/min and 8 L/min McLane Pumps (WTS-LV; McLane Research Laboratories Inc., Falmouth MA, USA) outfitted with custom Mini-MULVS multiple filter head systems. The 0.2-3.0 micron size fraction was collected on 142 mm filters (Supor, Pall Corp.) for analyses used in this study. The volume of water was pumped until a minimal flow rate was achieved or the allotted cast time period expired, typically ~300-500 L. Filters were sectioned immediately after pump retrieval, and protein samples (¼ filter) were stored in RNAlater reagent (Ambion, Life Technologies), which has been shown to be an effective preservative for cyanobacterial biomass (Saito et al., 2011), frozen at -80C, transported back to the laboratory on dry ice, and stored at -80C until analysis.

Total microbial protein (0.2-3.0um fraction) was extracted using detergent-based methods, described below. Total protein showed enhanced concentrations in the photic zone, particularly in the Equatorial and South Pacific portions of the transect (Fig. S1F). For protein extraction, samples were thawed and the filter and RNAlater preservative (Ambion Life Technologies) were separated. The removed preservative was spin-concentrated by a 5K MWCO membrane (Sartorius Stedim Biotech 6 mL, 5 K MWCO Vivaspin units; Goettingen, Germany), and rinsed with 0.1M Tris buffer to recover, desalt, and concentrate any suspended material. The sample filter was unfolded and placed in a larger tube to which 1% SDS extraction buffer (1% SDS, 0.1M Tris/HCL pH 7.5, 10mM EDTA) and the rinsed/desalted RNAlater fraction was added back. Each sample was incubated at room temperature for 15 minutes, heated at 95C for 10 minutes, and shaken at room temperature for 350 rpm for 1 h. The protein extract was decanted and placed in a new tube and centrifuged for 30 min x 3220 g at room temperature. The supernatants were removed and filtered through a 5 um low protein binding syringe filter (Fisher Scientific), and the filter rinsed with extraction buffer. The extracts were concentrated by 5 kD membrane centrifugation to a small volume, washed with extraction buffer, and concentrated again. Each sample was precipitated with cold 50% methanol (MeOH) 50% acetone 0.5 mM HCL for 3 days at –20C, centrifuged at 14100 x g (14500 rpm) for 30 m at 4C, decanted and dried by vacuum concentration (Thermo Savant Speedvac) for 10 min or until dry. Pellets were resuspended in 1% SDS extraction buffer and left at room temperature (RT) for 1 h to redissolve. Total protein was quantified (Bio-Rad DC protein assay, Hercules, CA) with BSA as a standard.

Extracted proteins were purified from SDS detergent, reduced, alkylated, and  trypsin digested, while embedded within a polyacrylamide tube gel, using a modified protocol from a previously published method (38). The tube gel approach allowed all proteins including membrane proteins to be solubilized by detergent and purified while immobilized in the gel matrix. A gel premix was made by combining 1 M Tris HCL (pH 7.5) and 40% Bis-acrylimide L 29:1 (Acros Organics) at a ratio of 1:3. The premix (103 ul) was combined with an extracted protein sample (usually 25 ug-200 ug), TE, 7 ul 1% APS and 3 ul of TEMED (Acros Organics) to a final volume of 200 ul. After 1 h of polymerization at RT, 200 ul of gel fix solution (50% ETOH, 10% acetic acid in LC/MS grade water) was added to the top of the gel and incubated at RT for 20 minutes. Liquid was then removed and the tube gel was transferred into a new 1.5 mL microtube.


BCO-DMO Processing Description

Data Version 4 (2021-03-30) replaces data version 2 (2018-05-21). The intermediate version 3 was never made public.
Uses updated station information (dataset 730891 version 3) joined with the submitted file METZYME_V1_peptides_050917.xlsx. Also several transformation and find and replace operations performed as described below.
Sheet "Final" imported into the BCO-DMO data system from file Metzyme_Proteins_V1_targetdecoyPSMvalidator_030918.xlsx
N/A interpreted as no data value
skipped row that started "END"
Columns renamed to fit the OPP template (see https://github.com/oceanproteinportal/data-file-templates).
Added minimum_filter_size_microns column with value '0.2'
Added maximum_filter_size_microns with 'value 3.0'
Added station information from dataset 730891 version 3 using a join operation station and depth as keys. Added dates times (local and UTC), and lat, lon.
file saved as csv format.
{MS/MS sample name}
*Needed sample name with the number 0 padded to match other sample ids in station info and protein data. Created new column for sample_id.

v2 notes:

- Replaced blank cells with nd
- Replaced #N/A with NA
- Replaced commas in cells with ;
- Added cruise number column


[ table of contents | back to top ]

Related Publications

Saito, M. A., Bertrand, E. M., Duffy, M. E., Gaylord, D. A., Held, N. A., Hervey, W. J., Hettich, R. L., Jagtap, P. D., Janech, M. G., Kinkade, D. B., Leary, D. H., McIlvin, M. R., Moore, E. K., Morris, R. M., Neely, B. A., Nunn, B. L., Saunders, J. K., Shepherd, A. I., Symmonds, N. I., & Walsh, D. A. (2019). Progress and Challenges in Ocean Metaproteomics and Proposed Best Practices for Data Sharing. Journal of Proteome Research, 18(4), 1461–1476. https://doi.org/10.1021/acs.jproteome.8b00761
Methods
Saito, M. A., Bulygin, V. V., Moran, D. M., Taylor, C., & Scholin, C. (2011). Examination of Microbial Proteome Preservation Techniques Applicable to Autonomous Environmental Sample Collection. Frontiers in Microbiology, 2. doi:10.3389/fmicb.2011.00215
Related Research
Saito, M. A., McIlvin, M. R., Moran, D. M., Goepfert, T. J., DiTullio, G. R., Post, A. F., & Lamborg, C. H. (2014). Multiple nutrient stresses at intersecting Pacific Ocean biomes detected by protein biomarkers. Science, 345(6201), 1173–1177. https://doi.org/10.1126/science.1256450
Methods

[ table of contents | back to top ]

Related Datasets

IsRelatedTo
Saito, M. (2019) Station locations for the global metaproteomic dataset from R/V Kilo Moana cruise KM1128, the METZYME expedition in the tropical North Pacific in 2011. Biological and Chemical Oceanography Data Management Office (BCO-DMO). (Version 3) Version Date 2019-04-12 doi:10.1575/1912/bco-dmo.730891.3 [view at BCO-DMO]
Relationship Description: Station locations and sampling information for these data.
Saito, M. A. (2017) FASTA file of protein identifications from R/V Kilo Moana KM1128 in the North Pacific during 2011 (MetZyme project). Biological and Chemical Oceanography Data Management Office (BCO-DMO). (Version 1) Version Date 2017-07-14 http://lod.bco-dmo.org/id/dataset/708816 [view at BCO-DMO]
Relationship Description: FASTA file of protein identifications. This dataset is part of the same "MetZyme 0.2" dataset at the Ocean Protein Portal (https://proteinportal.whoi.edu/).
Saito, M. A. (2022) Total spectral count of proteins from R/V Kilo Moana cruise KM1128 for the METZYME expedition in the tropical North Pacific in 2011. Biological and Chemical Oceanography Data Management Office (BCO-DMO). (Version 5) Version Date 2022-07-06 http://lod.bco-dmo.org/id/dataset/730722 [view at BCO-DMO]
Relationship Description: Related protein spectral counts. This dataset is part of the same "MetZyme 0.2" dataset at the Ocean Protein Portal (https://proteinportal.whoi.edu/).

[ table of contents | back to top ]

Parameters

ParameterDescriptionUnits
cruiseCruise deployment number unitless
stationStation number unitless
depthDepth of sampling event meters
MS_MS_sample_nameRAW file name - crossreferences to proteomexchange or OPP repository unitless
protein_ID_numMetagenome ID - needed to map to protein IDs. unitless
protein_accession_numDuplicate of Protein ID but can contain multiple entries. unitless
protein_molecular_weightMolecular weight kDA
peptide_sequenceUnique peptide sequence - most unique identifier in dataset unitless
best_peptide_identification_probabilityMetric of peptide quality unitless
best_SEQUEST_Xcorr_scoreMetric of peptide quality unitless
best_SEQUEST_DCn_scoreMetric of peptide quality unitless
number_of_identified_pos2H_spectraNumber of identified +2H spectra unitless
number_of_identified_pos3H_spectraNumber of identified +3H spectra unitless
number_of_identified_pos4H_spectraNumber of identified +4H spectra unitless
sum_spectral_countsSum of +2+3 +4 data = total unnormalized spectral counts - Quantitative Value unitless
median_retention_timeMedian retention time minutes
total_precursor_intensityQuantitative Value (2 of 3) unitless
total_TICQuantitative Value (3 of 3) unitless
peptide_start_indexData useful in creating peptide coverage map for WHO page unitless
peptide_stop_indexData useful in creating peptide coverage map for WHO page unitless
latLatitude decimal degrees
lonLongitude decimal degrees
time_localLocal time of sampling at station and depth (HST, UTC-10) unitless
min_filter_sizeMinimum filter size microns
max_filter_sizeMaximum filter size microns


[ table of contents | back to top ]

Instruments

Dataset-specific Instrument Name
McLane Pumps (WTS-LV; McLane Research Laboratories Inc., Falmouth MA, USA)
Generic Instrument Name
McLane Pump
Dataset-specific Description
Used to collect microbial biomass for protein analyses
Generic Instrument Description
McLane pumps sample large volumes of seawater at depth. They are attached to a wire and lowered to different depths in the ocean. As the water is pumped through the filter, particles suspended in the ocean are collected on the filters. The pumps are then retrieved and the contents of the filters are analyzed in a lab.

Dataset-specific Instrument Name
5 kD membrane centrifuge
Generic Instrument Name
Centrifuge
Dataset-specific Description
Used to concentrate the extracts
Generic Instrument Description
A machine with a rapidly rotating container that applies centrifugal force to its contents, typically to separate fluids of different densities (e.g., cream from milk) or liquids from solids.


[ table of contents | back to top ]

Deployments

KM1128

Website
Platform
R/V Kilo Moana
Start Date
2011-10-01
End Date
2011-10-25
Description
This is a MetZyme project cruise. The original cruise data are available from the NSF R2R data catalog.


[ table of contents | back to top ]

Project Information

Connecting Trace Elements and Metalloenzymes Across Marine Biogeochemical Gradients (GPc03) (MetZyme)

Coverage: Tropical North Pacific along 150 degrees West from 18 degrees North to the equator


MetZyme project researchers will determine the role of enzymatic activity in the cycling of trace metals. Specifically the research will address the following questions: (1) degradation of sinking particulate organic material in the Tropical North Pacific can be influenced by the ability of microbes to synthesize zinc proteases, which in turn is controlled by the abundance or availability of zinc, and (2) methylation of mercury is controlled, in part, by the activity of cobalt-containing enzymes, and therefore the supply of labile cobalt to the corrinoid-containing enzymes or co-factors responsible for methylation. To attain their goal, they will collect dissolved and particulate samples for trace metals and metalloenzymes from three stations along a biogeochemical gradient in the Tropical North Pacific (along 150 degrees West from 18 degrees North to the equator). Sinking particles from metal clean sediment traps will also be obtained. The samples will also be used to carry out shipboard incubation experiments using amendments of metals, metal-chelators, B12, and proteases to examine the sensitivity and metal limitation of heterotrophic, enzymatic degradation of organic matter within the oceanic "Twilight Zone" (100-500 m). This study will result in a novel metaproteomic/metalloenzyme datasets that should provide insights into the biogeochemical cycling of metals, as well as co-limitation of primary productivity and controls on the export of carbon from the photic zone. In addition to the final data being contributed to BCO-DMO, an online metaproteomic data server will be created so the community has access to the raw data files generated by this research.


Adaptation of key N2-fixing cyanobacteria to changing CO2 (HiCO2_AdaptCyano)

Coverage: Culture study at the University of Southern California, Los Angeles


Description from NSF award abstract:
This study will employ a novel combination of experimental evolution techniques and state-of-the-art molecular methods to yield unique insights into adaptive changes in the keystone marine cyanobacteria Trichodesmium and Crocosphaera in response to selection by high CO2. Several studies have suggested that N2-fixation rates of the biogeochemically-critical cyanobacteria Trichodesmium and Crocosphaera may increase dramatically in the future high CO2 ocean, but these have all used the same limited set of cultured isolates and considered cells only briefly acclimated to elevated CO2. The investigator's new results, however, demonstrate that a broad diversity of high- and low-CO2 adapted ecotypes exists within each diazotroph genus. Furthermore, in a preliminary four year experimental evolution study with Trichodesmium, the PIs observed large adaptive responses following 500-700 generations of selection by high CO2- but in a completely unexpected way. All of the six replicate high CO2-adapted cell lines exhibited strong constitutive up-regulation of N2 fixation rates. These very elevated N2 fixation rates continued, even though the cultures have were switched back to low-CO2 conditions for many months. Expression of the nif operon and N assimilatory genes was also up-regulated in these cell lines, as is expression of many intergenic regions of the genome.

The investigators hypothesize that constitutive up-regulation of cellular N2 fixation systems may be a common adaptive response of both Trichodesmium and Crocosphaera under extended selection by elevated CO2. This project will test this hypothesis in a four-year experimental evolution study to determine the adaptive responses of both high- and low-CO2 specialized ecotypes of these two diazotrophs to increased CO2.

The investigators will grow representative high- and low-CO2 adapted ecotypes from each genus in well-replicated cell lines at 380 ppm and 750 ppm CO2 for up to 1000 generations. Periodically, they will perform "switch" experiments to measure N2 and CO2 fixation rates and growth rates of high CO2-selected cell lines grown briefly (one week) at low CO2, and vice versa. These switch experiments will allow screening for cell lines which exhibit adaptive changes in phenotypically-expressed rate parameters, such as those observed in the preliminary Trichodesmium study. Evolutionary mechanisms in the CO2-selected cell lines will be examined by comparison of changes in their genomes, transcriptomes, and proteomes over time relative to reference genomes, using frozen samples archived monthly during the preceding selection period. Examination of these molecular and biochemical changes will be coordinated with an in-depth array of physiological and biogeochemical analyses. This combined approach will allow an evaluation of potential adaptive mechanisms in diazotrophic cyanobacteria ranging from indel, duplication, single nucleotide polymorphism, and transposition mutations to altered putative non-coding RNA expression, protein expression, and post-translational protein modifications, and then allow the investigators to link these mechanisms directly with their potential impacts on ecosystem-level biogeochemical processes like N2 and CO2 fixation. Finally, the research team will determine how long term selection by high CO2 affects the iron and phosphorus requirements of Trichodesmium and Crocosphaera, since constitutive up-regulation of N2 fixation would also have major implications for limitation of diazotrophs by these two critical nutrients in the future high CO2 ocean.



[ table of contents | back to top ]

Program Information

U.S. GEOTRACES (U.S. GEOTRACES)


Coverage: Global


GEOTRACES is a SCOR sponsored program; and funding for program infrastructure development is provided by the U.S. National Science Foundation.

GEOTRACES gained momentum following a special symposium, S02: Biogeochemical cycling of trace elements and isotopes in the ocean and applications to constrain contemporary marine processes (GEOSECS II), at a 2003 Goldschmidt meeting convened in Japan. The GEOSECS II acronym referred to the Geochemical Ocean Section Studies To determine full water column distributions of selected trace elements and isotopes, including their concentration, chemical speciation, and physical form, along a sufficient number of sections in each ocean basin to establish the principal relationships between these distributions and with more traditional hydrographic parameters;

* To evaluate the sources, sinks, and internal cycling of these species and thereby characterize more completely the physical, chemical and biological processes regulating their distributions, and the sensitivity of these processes to global change; and

* To understand the processes that control the concentrations of geochemical species used for proxies of the past environment, both in the water column and in the substrates that reflect the water column.

GEOTRACES will be global in scope, consisting of ocean sections complemented by regional process studies. Sections and process studies will combine fieldwork, laboratory experiments and modelling. Beyond realizing the scientific objectives identified above, a natural outcome of this work will be to build a community of marine scientists who understand the processes regulating trace element cycles sufficiently well to exploit this knowledge reliably in future interdisciplinary studies.

Expand "Projects" below for information about and data resulting from individual US GEOTRACES research projects.



[ table of contents | back to top ]

Funding

Funding SourceAward
NSF Division of Ocean Sciences (NSF OCE)
Gordon and Betty Moore Foundation: Marine Microbiology Initiative (MMI)

[ table of contents | back to top ]