| Contributors | Affiliation | Role |
|---|---|---|
| Martiny, Adam | University of California-Irvine (UC Irvine) | Principal Investigator |
| Saito, Mak A. | Woods Hole Oceanographic Institution (WHOI) | Principal Investigator |
| Garcia, Nathan S. | University of California-Irvine (UC Irvine) | Co-Principal Investigator, Scientist |
| McIlvin, Matthew R. | Woods Hole Oceanographic Institution (WHOI) | Scientist |
| Moran, Dawn | Woods Hole Oceanographic Institution (WHOI) | Scientist |
| York, Amber D. | Woods Hole Oceanographic Institution (WHOI BCO-DMO) | BCO-DMO Data Manager |
Experimental design
Synechococcus cultures (WH8102) were grown in polycarbonate bottles with a continuous chemostat method used previously (Garcia et al. 2016) in artificial seawater. We used two concentration ratios of macronutrients (NO3-:PO43- = 1.7 and 80) and 3 levels of temperature (20, 24 and 28°C) with a slow dilution rate (0.18 d-1) to ensure treatment-wise culture stability. Target concentrations of NO3- and PO43- in chemostats were 45 µM and 0.56 µM, respectively in the N:P=80 treatment, and 15.9 µM and 9.2 µM, respectively in the N:P=1.7 treatment. White light was supplied at 125 µmol quanta m-2 s-1 on a 12h:12h light:dark cycle. Chemostat equilibria were monitored by measuring culture cell density and forward scatter (FSCH) with a Novocyte flow cytometer 1000 (Acea Biosciences, Inc, San Diego, CA). Cells for proteome analysis were collected after an acclimation period on days 38, 43, 47, 50 and 57 with a 47 mm polycarbonate filter (0.2 µm pore size) 7-8 hours into the light period, pelleted by centrifugation (21,130 g for 3 minutes), flash frozen in liquid nitrogen, and stored at -80°C.
Protein extraction and peptide preparation
Proteins were extracted by heating pelleted cells at 95°C for 10 min and gently shaking at room temperature for 30 min in a buffer solution (400 µL – 1760 µL; 50 mM HEPES pH 8.5 (Boston BioProducts #BB-2082), 1% SDS in HPLC grade water) before centrifuging at 14100 g for 20 min at room temperature and removing the supernatant. Sodium dodecyl sulfate (1%) is a strong detergent for diverse matrices including cell membranes (Hughes et al. 2014). Benzonase nuclease (50 units; Novagen #70746-3) was added to 400 µL extracted protein sample and incubated at 37°C for 30 min. Samples were reduced by adding 20 µL of 200 mM DTT (Fisher #BP172-5) in 50 mM HEPES pH 8.5 at 45°C for 30 min and alkylated with 40 µL of 400 mM iodoacetamide (Acros #122270050) in HEPES pH 8.5 for 30 min at 24°C. The reaction was quenched by adding 40 µL of 200 mM DTT in 50 mM HEPES pH 8.5. SpeedBead Magnetic Carboxylate Modified Particles (GE Healthcare #65152105050250 and #45152105050250) were prepared according to Hughes et al. (2014) and added (20 µg/µL) to 400 µL of extracted protein sample. Samples were incubated with formic acid (pH of 2-3) and washed with ethanol and acetonitrile using a magnetic rack. Protein was measured with the BCA method (Thermo Scientific Micro BCA Protein Assay Kit #23235) and digested overnight at 37°C with 1 part trypsin (Promega #V5280; dissolved in HEPES pH 8.0, 0.5 µg/µL) and 25 parts protein. Peptides were washed with acetonitrile and ethanol using a magnetic rack and diluted to a target concentration of 0.1% trifluoroacetic acid or 1% formic acid and a final concentration of 1 µg/µL.
Mass spectrometry of peptides
Similar to other analyses (Searle et al. 2018), peptides were analyzed using a Michrom Advance HPLC system coupled to a Q-Exactive mass spectrometer (Thermo Scientific instrument version 2.8) with a Michrom Advance CaptiveSpray source, using the constant injection concentration of 1 µg/µL (one microgram per microliter) to allow uniformity across the dataset. Samples were concentrated onto a C18 column (Reprosil-Gold, Dr. Maisch GmbH) and eluted in a non-linear, 200-min gradient of formic acid and acetonitrile buffers. Full MS1 scans were performed (35,000 resolution, 3e6 AGC target, 60 ms maximum IT, 385 to 1015 m/z) with overlapping DIA scans (17,500 resolution, 1e6 AGC target, 60 ms maximum IT, 24.0 m/z isolation windows, normalized collision energy of 27, loop count 25).
Proteomic data analysis
Data-independent acquisition mass spectrometry sample data were analyzed using Scaffold DIA (2.2.1), converted to mzML format (ProteoWizard 3.0.11748, Chambers et al (2012)), and individually searched against Syn8102_uniprot-proteome_UP000001422.fasta with a peptide and fragment mass tolerance of 10.0 ppm. Percolator (3.01) filtered peptides for a maximum false discovery rate of 0.01. Charged peptides (2-3) with length (6-30) were considered. EncyclopeDIA (0.9.6) selected the 5 highest quality fragment ions for quantitation (Searle et al. 2018).
Data-independent acquisition mass spectrometry sample data were analyzed using Scaffold DIA (2.2.1), converted to mzML format (ProteoWizard 3.0.11748, Chambers et al (2012)), and individually searched against Syn8102_uniprot-proteome_UP000001422.fasta with a peptide and fragment mass tolerance of 10.0 ppm. Percolator (3.01) filtered peptides for a maximum false discovery rate of 0.01. Charged peptides (2-3) with length (6-30) were considered. EncyclopeDIA (0.9.6) selected the 5 highest quality fragment ions for quantitation (Searle et al. 2018).
.raw proteomics files are accessible at the PRIDE Archive (https://www.ebi.ac.uk/pride/archive/projects/PXD043180).
Dataset version 1:
Wide format table: "923159_v1_norm-peak-area_exclusive-intensity-wide-format.xlsx" history:
* This contains the wide format of the dataset (separate normalized peak area columns per Protein_ID_(long_form),Temp_chemostat,Nutrient_chemostat,Treatment_replicate,Chemostat_day). It is the original Excel file provided to BCO-DMO on Sept 11th, 2025.
* This format contains 5 header lines containing sample and treatment metadata values. The data rows start as row 6.
* The last row contains no values except ("#VALUE!" as gene name 1 and protein name.)
Long format table: "923159_v1_norm-peak-area_exclusive-intensity.csv" history:
* This table is in "long" format with one column for Normalized peak area, and treatment/sample metadata contained in additional columns: "Sample_Number,Temp_chemostat,Nutrient_chemostat,Treatment_replicate,Chemostat_day."
* Sheet 1 of submitted file "Normalized peak areas (exclusive intensity for BCO-DMO).xlsx" (submitted Apr 2nd, 2024) was modified to add a row of sample ID numbers (1-30) before import into the bco-dmo data system, discussed with the file submitter. The last row was not imported, it contained no values except ("#VALUE!" as gene name 1 and protein name.)
* The wide to long transformation was performed with an unpivot operation.
* Values "NaN" were imported as missing data values.
** Missing data values are displayed differently based on the file format you download. They are blank in csv files, "NaN" in MatLab files, etc.
* Column names adjusted to conform to BCO-DMO naming conventions designed to support broad re-use by a variety of research tools and scripting languages. [Only numbers, letters, and underscores. Can not start with a number]
| File |
|---|
923159_v1_norm-peak-area_exclusive-intensity.csv (Comma Separated Values (.csv), 7.23 MB) MD5:0e72a38f21caee0338e52145fe16af8c Primary data file for dataset ID 923159, version 1 which includes a row per sample measured for each protein. This form of the data is referred to as a long-form data table. See supplemental files for an alternate form of this table (wide-form, rows per protein with additional columns per sample). |
| File |
|---|
923159_v1_norm-peak-area_exclusive-intensity-wide-format.xlsx (Microsoft Excel, 437.10 KB) MD5:1b7d758f4126fa56411e03a515046680 Normalized peak areas, exclusive intensity (wide format table). This is the same data as in "923159_v1_norm-peak-area_exclusive-intensity.csv," however, this is the "wide" form of the dataset with normalized peak areas (exclusive intensities) in separate columns per (Temp_(deg_C)_chemostat,Nutrient_(N:P)_chemostat,Treatment_replicate,Chemostat_day). |
| Parameter | Description | Units |
| Protein_ID_long_form | This is the protein ID (long form) (this is the same as the Raw protein ID) | unitless |
| Protein_ID_short_form | This is the protein ID (short form) | unitless |
| Gene_name1 | Gene name 1 | unitless |
| Gene_name2 | Gene name 2 | unitless |
| Protein_name | Protein name (e.g. "C-phycoerythrin class 2 subunit alpha") | unitless |
| Molecular_Weight | Molecular Weight of protein | kilodalton (kDa) |
| Peptide_Count | This is the number of peptides that are identified by the mass-spec method and subsequently used to identify the protein. | unitless |
| Sample_Number | Sample number (1_through_30) | unitless |
| Temp_chemostat | Temperature of chemostat | Degrees Celsius |
| Nutrient_chemostat | Nitrate:phosphate (NO3_to_PO4) input ratio into chemostats | unitless |
| Treatment_replicate | Treatment replicate (1_through_5) | unitless |
| Chemostat_day | Sampling day from chemostat start day | days |
| Norm_peak_area | Normalized peak areas (exclusive intensity). [relative values, unitless] | unitless |
| Dataset-specific Instrument Name | Michrom Advance HPLC system |
| Generic Instrument Name | High-Performance Liquid Chromatograph |
| Dataset-specific Description | HPLC system and mass spectrometer: A Michrom Advance HPLC system was coupled to a Q-Exactive mass spectrometer (Thermo Scientific instrument version 2.8) with a Michrom Advance CaptiveSpray source. |
| Generic Instrument Description | A High-performance liquid chromatograph (HPLC) is a type of liquid chromatography used to separate compounds that are dissolved in solution. HPLC instruments consist of a reservoir of the mobile phase, a pump, an injector, a separation column, and a detector. Compounds are separated by high pressure pumping of the sample mixture onto a column packed with microspheres coated with the stationary phase. The different components in the mixture pass through the column at different rates due to differences in their partitioning behavior between the mobile liquid phase and the stationary phase. |
| Dataset-specific Instrument Name | Q-Exactive mass spectrometer |
| Generic Instrument Name | Mass Spectrometer |
| Dataset-specific Description | HPLC system and mass spectrometer: A Michrom Advance HPLC system was coupled to a Q-Exactive mass spectrometer (Thermo Scientific instrument version 2.8) with a Michrom Advance CaptiveSpray source. |
| Generic Instrument Description | General term for instruments used to measure the mass-to-charge ratio of ions; generally used to find the composition of a sample by generating a mass spectrum representing the masses of sample components. |
| Dataset-specific Instrument Name | 8 L polycarbonate bottles |
| Generic Instrument Name | no_bcodmo_term |
| Dataset-specific Description | Synechococcus cultures (WH8102) were grown in polycarbonate bottles. |
| Generic Instrument Description | No relevant match in BCO-DMO instrument vocabulary. |
| Dataset-specific Instrument Name | |
| Generic Instrument Name | no_bcodmo_term |
| Dataset-specific Description | 47 mm polycarbonate filter (0.2 µm pore size, Millipore): Cells for proteome analysis were collected with a 47 mm polycarbonate filter (0.2 µm pore size, Millipore) 7-8 hours into the light period, pelleted by centrifugation in 2 mL centrifuge tubes (21,130 g for 3 minutes), flash frozen in liquid nitrogen, and stored at -80°C. |
| Generic Instrument Description | No relevant match in BCO-DMO instrument vocabulary. |
NSF Award Abstract:
Due to their sheer abundance and high activity, microorganisms have the potential to greatly influence how ecosystems are affected by changes in their environment. However, descriptions of microbial physiology and diversity are local and highly complex and thus rarely considered in Earth System Models. Thus, the researchers focus on a convergence research framework that can qualitatively and quantitatively integrate eco-evolutionary changes in microorganisms with global biogeochemistry. Here, the investigators will develop an approach that integrates the knowledge and tools of biologists, mathematicians, engineers, and geoscientists to understand the link between the ocean nutrient and carbon cycles. The integration of data and knowledge from diverse fields will provide a robust, biologically rich, and computationally efficient prediction for the variation in plankton resource requirements and the biogeochemical implications, addressing a fundamental challenge in ocean science. In addition, the project can serve as a road map for many other research groups facing a similar lack of convergence between biology and geoscience.
Traditionally, the cellular elemental ratios of Carbon, Nitrogen, and Phosphorus (C:N:P) of marine communities have been considered static at Redfield proportions but recent studies have demonstrated strong latitudinal variation. Such regional variation may have large - but poorly constrained - implications for marine biodiversity, biogeochemical functioning, and atmospheric carbon dioxide levels. As such, variations in ocean community C:N:P may represent an important biological feedback. Here, the investigators propose a convergence research framework integrating cellular and ecological processes controlling microbial resource allocations with an Earth System model. The approach combines culture experiments and omics measurements to provide a molecular understanding of cellular resource allocations. Using a mathematical framework of increasing complexity describing communicating, moving demes, the team will quantify the extent to which local mixing, environmental heterogeneity and evolution lead to systematic deviations in plankton resource allocations and C:N:P. Optimization tools from engineering science will be used to facilitate the quantitative integration of models and observations across a range of scales and complexity levels. Finally, global ocean modeling will enable understanding of how plankton resource use impacts Earth System processes. By integrating data and knowledge across fields, scales and complexity, the investigators will develop a robust link between variation in plankton C:N:P and global biogeochemical cycles.
NSF abstract:
The elemental ratios of carbon, nitrogen, and phosphorus (C:N:P) have been considered fixed proportions in marine environments but recent work has demonstrated changes across latitudes. Such variation can have large implications for marine biogeochemistry and atmospheric CO_2 levels. As such, future variations in ocean community C:N:P could be a key feedback to global change. The elemental composition of particulate organic matter (POM) represents the aggregate value of diverse microorganisms as well as non-living particles. However, we currently do not understand how individual cells and particles contribute to observed variation in the C:N:P of POM. This project is determining the biomass C:N:P of individual microbial cells grown under a range of conditions and sampled from diverse ocean biomes. Because different individuals are likely to have different fates (e.g., loss to sinking, lysis, predation), understanding how the trait distribution of microbial biomass C:N:P relates to cell size and trophic mode and how environmental conditions affect each trait’s distribution offers new perspectives and insight into marine C, N, and P cycles. This project supports two PhD students and multiple undergraduate students. The PhD students are integrated into existing networks on microbiome science at each institution with opportunities to collaborate across diverse disciplines. Undergraduate students at both institutions are being recruited through existing training programs that target underrepresented groups. In addition, PI Hall is part of a collaborative of Ecologists and Poets at CSU, that look at ways to translate ecological relationships to non-traditional media to make it more accessible and impactful to the general public. This group is exploring the nature of individuality within the marine microbiome by creating trait distributions of written text and removing different modes of individuals (e.g., words) to better understand and communicate how individuals from the smallest organisms on the planet (marine microorganisms) can have large effects on the surrounding ecosystem (i.e., the planet). Results from this project are being incorporated into future projects of the working group at Colorado State University including public presentations, art installations, and published materials.
The aim of this project is to quantify the relationship between environmental conditions and marine microbial C:N:P by assessing the individuality in microbial elemental stoichiometry. To achieve this the project uses energy dispersive spectroscopy (EDS) to assess the stoichiometric trait distribution of populations and communities under different resource and temperature treatments and oceanographic field work across a broad latitudinal gradient. The researchers hypothesize that the relationship between the stoichiometry of particulate organic matter (POM) and environmental conditions are masked by distinct responses of individual constituents of the marine microbiome. They hypothesize that these distinct responses result in a multi-modal distribution of particulate carbon (C), nitrogen (N), and phosphorus (P) of the marine microbiome. The investigators are characterizing the distribution of three stoichiometric traits (biomass C:N, C:P, and N:P) for 50 marine isolates (both autotrophs and heterotrophs) under different resource and temperature environments. They are characterizing the same trait distribution for marine communities sampled from different regions of the ocean for the same resource and temperature environments as the population experiments. They are participating on an ocean going cruise to characterize stoichiometric trait distribution of the marine microbiome across natural gradients of resources and temperature. Understanding how constituent members of microbial communities alter their biomass in response to environmental change is providing a missing link between the variation in the ocean’s environment and particulate C:N:P ratios for diverse marine environments.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
| Funding Source | Award |
|---|---|
| NSF Division of Ocean Sciences (NSF OCE) | |
| NSF Division of Ocean Sciences (NSF OCE) | |
| NSF Division of Integrative Organismal Systems (NSF IOS) |