| Contributors | Affiliation | Role |
|---|---|---|
| Turk-Kubo, Kendra | University of California-Santa Cruz (UCSC) | Principal Investigator |
| Mills, Matthew M. | Stanford University | Scientist |
| York, Amber D. | Woods Hole Oceanographic Institution (WHOI BCO-DMO) | BCO-DMO Data Manager |
This is a formal meta analysis of all available NDC qPCR datasets published between 2000 and 2019. Seawater DNA and RNA samples were collected using two main field approaches, via Niskin bottles attached to rosettes with CTDs, are using the ship's underway seawater systems). Samples were collected by numerous different research groups. DNA and RNA samples were generally filtered through 0.2 µm filers, using a variety of materials, and in some cases, size fractionated samples were collected (e.g. 3 µm and 0.2 µm filters in series). DNA and RNA samples were typically extracted using on column purification-based kits, and in the case where nifH transcription was quantified cDNA (derived from RNA) was typically generated using Superscript reverse transcriptase kits. Purified DNA/RNA/cDNA were typically stored at -80oC prior to analysis. Quantitative PCR assays (55 total), were developed by multiple different groups, and his database only includes data generated using qPCR using Taqman(R) chemistry.
This dataset was compiled/formatted for CMAP by Mary R Gradoville and Kendra Turk-Kubo from the Zehr Research Group at the University of California Santa Cruz.
Data sources: Individual dataset sources are listed in the "source_data" column for each observation (from 59 published studies). Column "source_doi" is the DOI for the reference provided in "source_data." Additional details about the specific data used are provided in the "source_location_of_data" column (e.g., specific tables, figures, whether data provided by the authors, or databases).
Cruise description: This dataset is a compilation of multiple cruises, collectively carried out by numerous research groups, not all funded by NSF. The original citations are provided in the metadata for the dataset.
All data would be retrieved off the qPCR instrument as a simple matrix of sample number and relative fluorescence units, including standards (with known quantities of the target gene). Additional software products are not needed.
For this dataset, data was extrapolated, when possible, using Web Plot digitizer (https://automeris.io/WebPlotDigitizer/).
* Sheet "data" of submitted file "NCDReview_qPCR_database_06092025.xlsx" was exported from excel to capture the formatted values as displayed in excel. The exported csv file was imported into the BCO-DMO data system for this dataset. Table will appear as Data File: 982905_v1_ncd-qpcr-database.csv (along with other download format options).
Additional metadata included in excel sheet "variable metadata" was attached to this dataset as supplemental file 982905_v1_additional_variable_metadata.csv
Additional metadata include in excel sheet "dataset metadata" were added to fields on this BCO-DMO dataset page.
Added DOIs for each source_data reference publication.
* A unique list of each unique reference included in the column "source_data" was made and each provided reference text was matched to a DOI using Crossref's SimpleQuery Tool. And additional column was added to the dataset table "source_doi." The reference text was cleaned for any special characters that didn't display properly in UTF-8 (e.g. "Bostr√ɬ∂m et al. (2007)" -> "Boström et al. (2007)")
The following character replacements were made:
√¢¬Ä¬ê → -
√¢¬Ä¬ì → –
√Ǭ† → make a space (instead of deleting this char)
√ɬ° → á
√ɬ± → ñ
√ɬ© → é
√ɬ≠ → í
√ɬ∂ → ö
√ɬº → ü
√ɬ≥ → ñ
√ɬü → ß
Additional databases included in "source_location_of_data" doi:10.1594/PANGAEA.818214,doi:10.1594/PANGAEA.833791 were added to the section "Related Datasets" on this page.
Missing Data Identifiers:
* In the BCO-DMO data system missing data identifiers are displayed according to the format of data you access. For example, in csv files it will be blank (null) values. In Matlab .mat files it will be NaN values. When viewing data online at BCO-DMO, the missing value will be shown as blank (null) values.
| Parameter | Description | Units |
| time | date, time (local). Local time zone varies by source location. | unitless |
| lat | latitude | decimal degrees |
| lon | longitude | decimal degrees |
| depth | depth | meters (m) |
| source_data | The reference for the source data publication | unitless |
| source_data_doi | Digital object identifier (DOI) for the source_data (reference) | unitless |
| source_location_of_data | Additional information about the source data from original publication | unitless |
| notes | comments on the source data from original publication | unitless |
| NCD_a24809A06_nifH_gene | Non-cyanobacterial diazotroph type 'a24809A06', nifH gene | gene copies per liter (L-1) |
| NCD_aETSP1_nifH_gene | Non-cyanobacterial diazotroph type 'aETSP1', nifH gene | gene copies per liter (L-1) |
| NCD_Alpha_MH144511_nifH_gene | Non-cyanobacterial diazotroph type 'Alpha_MH144511', nifH gene | gene copies per liter (L-1) |
| NCD_BAL398_nifH_gene | Non-cyanobacterial diazotroph type 'BAL398', nifH gene | gene copies per liter (L-1) |
| NCD_BT5667A01_nifH_gene | Non-cyanobacterial diazotroph type 'BT5667A01', nifH gene | gene copies per liter (L-1) |
| NCD_CB914H4_nifH_gene | Non-cyanobacterial diazotroph type 'CB914H4', nifH gene | gene copies per liter (L-1) |
| NCD_CE1_45m_12_a_nifH_gene | Non-cyanobacterial diazotroph type 'CE1_45m_12_a', nifH gene | gene copies per liter (L-1) |
| NCD_OTU0014_Brady_nifH_gene | Non-cyanobacterial diazotroph type 'OTU0014_Brady', nifH gene | gene copies per liter (L-1) |
| NCD_alpha1_nifH_gene | Non-cyanobacterial diazotroph type 'alpha1', nifH gene | gene copies per liter (L-1) |
| NCD_alpha2_nifH_gene | Non-cyanobacterial diazotroph type 'alpha2', nifH gene | gene copies per liter (L-1) |
| NCD_EVHVF_nifH_gene | Non-cyanobacterial diazotroph type 'EVHVF', nifH gene | gene copies per liter (L-1) |
| NCD_M6411A02_nifH_gene | Non-cyanobacterial diazotroph type 'M6411A02', nifH gene | gene copies per liter (L-1) |
| NCD_PlanctH08_nifH_gene | Non-cyanobacterial diazotroph type 'PlanctH08', nifH gene | gene copies per liter (L-1) |
| NCD_PlanctH09_nifH_gene | Non-cyanobacterial diazotroph type 'PlanctH09', nifH gene | gene copies per liter (L-1) |
| NCD_Cluster_III_nifH_gene | Non-cyanobacterial diazotroph type 'Cluster_III', nifH gene | gene copies per liter (L-1) |
| NCD_Cluster_III_C_nifH_gene | Non-cyanobacterial diazotroph type 'Cluster_III_C', nifH gene | gene copies per liter (L-1) |
| NCD_Cluster_III_L_nifH_gene | Non-cyanobacterial diazotroph type 'Cluster_III_L', nifH gene | gene copies per liter (L-1) |
| NCD_CB907H22_nifH_gene | Non-cyanobacterial diazotroph type 'CB907H22', nifH gene | gene copies per liter (L-1) |
| NCD_cIII_ETSP_nifH_gene | Non-cyanobacterial diazotroph type 'cIII_ETSP', nifH gene | gene copies per liter (L-1) |
| NCD_Cluster_3_nifH_gene | Non-cyanobacterial diazotroph type 'Cluster_3', nifH gene | gene copies per liter (L-1) |
| NCD_M6413A02_nifH_gene | Non-cyanobacterial diazotroph type 'M6413A02', nifH gene | gene copies per liter (L-1) |
| NCD_P4_nifH_gene | Non-cyanobacterial diazotroph type 'P4', nifH gene | gene copies per liter (L-1) |
| NCD_D0CY3_nifH_gene | Non-cyanobacterial diazotroph type 'D0CY3', nifH gene | gene copies per liter (L-1) |
| NCD_ECI27_nifH_gene | Non-cyanobacterial diazotroph type 'ECI27', nifH gene | gene copies per liter (L-1) |
| NCD_AT2LU_nifH_gene | Non-cyanobacterial diazotroph type 'AT2LU', nifH gene | gene copies per liter (L-1) |
| NCD_Arc_nifH_gene | Non-cyanobacterial diazotroph type 'Arc', nifH gene | gene copies per liter (L-1) |
| NCD_P6_nifH_gene | Non-cyanobacterial diazotroph type 'P6', nifH gene | gene copies per liter (L-1) |
| NCD_g24774A11_nifH_gene | Non-cyanobacterial diazotroph type 'g24774A11', nifH gene | gene copies per liter (L-1) |
| NCD_gammaA_nifH_gene | Non-cyanobacterial diazotroph type 'gamma A', nifH gene | gene copies per liter (L-1) |
| NCD_AO15_nifH_gene | Non-cyanobacterial diazotroph type 'AO15', nifH gene | gene copies per liter (L-1) |
| NCD_BAL354_nifH_gene | Non-cyanobacterial diazotroph type 'BAL354', nifH gene | gene copies per liter (L-1) |
| NCD_Azo_nifH_gene | Non-cyanobacterial diazotroph type 'Azo', nifH gene | gene copies per liter (L-1) |
| NCD_BAL281_nifH_gene | Non-cyanobacterial diazotroph type 'BAL281', nifH gene | gene copies per liter (L-1) |
| NCD_BT19215A01_nifH_gene | Non-cyanobacterial diazotroph type 'BT19215A01', nifH gene | gene copies per liter (L-1) |
| NCD_CE2_5m_1_g_nifH_gene | Non-cyanobacterial diazotroph type 'CE2_5m_1_g', nifH gene | gene copies per liter (L-1) |
| NCD_EP19212A1_nifH_gene | Non-cyanobacterial diazotroph type 'EP19212A1', nifH gene | gene copies per liter (L-1) |
| NCD_ETSP1_nifH_gene | Non-cyanobacterial diazotroph type 'ETSP1', nifH gene | gene copies per liter (L-1) |
| NCD_ETSP2_nifH_gene | Non-cyanobacterial diazotroph type 'ETSP2', nifH gene | gene copies per liter (L-1) |
| NCD_ETSP3_nifH_gene | Non-cyanobacterial diazotroph type 'ETSP3', nifH gene | gene copies per liter (L-1) |
| NCD_Gamma_1_nifH_gene | Non-cyanobacterial diazotroph type 'Gamma_1', nifH gene | gene copies per liter (L-1) |
| NCD_Gamma_2_nifH_gene | Non-cyanobacterial diazotroph type 'Gamma_2', nifH gene | gene copies per liter (L-1) |
| NCD_Gamma_3_nifH_gene | Non-cyanobacterial diazotroph type 'Gamma_3', nifH gene | gene copies per liter (L-1) |
| NCD_Gamma_4_nifH_gene | Non-cyanobacterial diazotroph type 'Gamma_4', nifH gene | gene copies per liter (L-1) |
| NCD_Gamma_P_nifH_gene | Non-cyanobacterial diazotroph type 'Gamma_P', nifH gene | gene copies per liter (L-1) |
| NCD_M6433A04_nifH_gene | Non-cyanobacterial diazotroph type 'M6433A04', nifH gene | gene copies per liter (L-1) |
| NCD_EQF91_nifH_gene | Non-cyanobacterial diazotroph type 'EQF91', nifH gene | gene copies per liter (L-1) |
| NCD_ALHOU_nifH_gene | Non-cyanobacterial diazotroph type 'ALHOU', nifH gene | gene copies per liter (L-1) |
| NCD_Vib_nifH_gene | Non-cyanobacterial diazotroph type 'Vib', nifH gene | gene copies per liter (L-1) |
| NCD_OcSpi_nifH_gene | Non-cyanobacterial diazotroph type 'OcSpi', nifH gene | gene copies per liter (L-1) |
| NCD_P8_nifH_gene | Non-cyanobacterial diazotroph type 'P8', nifH gene | gene copies per liter (L-1) |
| NCD_OTU0001_Pseudo_nifH_gene | Non-cyanobacterial diazotroph type 'OTU0001_Pseudo', nifH gene | gene copies per liter (L-1) |
| NCD_OTU0009_Kleb_nifH_gene | Non-cyanobacterial diazotroph type 'OTU0009_Kleb', nifH gene | gene copies per liter (L-1) |
| NCD_P3_nifH_gene | Non-cyanobacterial diazotroph type 'P3', nifH gene | gene copies per liter (L-1) |
| NCD_P7_nifH_gene | Non-cyanobacterial diazotroph type 'P7', nifH gene | gene copies per liter (L-1) |
| NCD_P1_nifH_gene | Non-cyanobacterial diazotroph type 'P1', nifH gene | gene copies per liter (L-1) |
| NCD_P2_nifH_gene | Non-cyanobacterial diazotroph type 'P2', nifH gene | gene copies per liter (L-1) |
| NCD_Alpha_MH144511_nifH_transcript | Non-cyanobacterial diazotroph type 'Alpha_MH144511', nifH transcript | transcript copies per liter (L-1) |
| NCD_BT5667A01_nifH_transcript | Non-cyanobacterial diazotroph type 'BT5667A01', nifH transcript | transcript copies per liter (L-1) |
| NCD_OTU0014_Brady_nifH_transcript | Non-cyanobacterial diazotroph type 'OTU0014_Brady', nifH transcript | transcript copies per liter (L-1) |
| NCD_EVHVF_nifH_transcript | Non-cyanobacterial diazotroph type 'EVHVF', nifH transcript | transcript copies per liter (L-1) |
| NCD_Cluster_III_nifH_transcript | Non-cyanobacterial diazotroph type 'Cluster_III', nifH transcript | transcript copies per liter (L-1) |
| NCD_Cluster_III_L_nifH_transcript | Non-cyanobacterial diazotroph type 'Cluster_III_L', nifH transcript | transcript copies per liter (L-1) |
| NCD_D0CY3_nifH_transcript | Non-cyanobacterial diazotroph type 'D0CY3', nifH transcript | transcript copies per liter (L-1) |
| NCD_ECI27_nifH_transcript | Non-cyanobacterial diazotroph type 'ECI27', nifH transcript | transcript copies per liter (L-1) |
| NCD_AT2LU_nifH_transcript | Non-cyanobacterial diazotroph type 'AT2LU', nifH transcript | transcript copies per liter (L-1) |
| NCD_Arc_nifH_transcript | Non-cyanobacterial diazotroph type 'Arc', nifH transcript | transcript copies per liter (L-1) |
| NCD_g24774A11_nifH_transcript | Non-cyanobacterial diazotroph type 'g24774A11', nifH transcript | transcript copies per liter (L-1) |
| NCD_gammaA_nifH_transcript | Non-cyanobacterial diazotroph type 'gamma A', nifH transcript | transcript copies per liter (L-1) |
| NCD_AO15_nifH_transcript | Non-cyanobacterial diazotroph type 'AO15', nifH transcript | transcript copies per liter (L-1) |
| NCD_Azo_nifH_transcript | Non-cyanobacterial diazotroph type 'Azo', nifH transcript | transcript copies per liter (L-1) |
| NCD_ETSP2_nifH_transcript | Non-cyanobacterial diazotroph type 'ETSP2', nifH transcript | transcript copies per liter (L-1) |
| NCD_Gamma_1_nifH_transcript | Non-cyanobacterial diazotroph type 'Gamma_1', nifH transcript | transcript copies per liter (L-1) |
| NCD_Gamma_2_nifH_transcript | Non-cyanobacterial diazotroph type 'Gamma_2', nifH transcript | transcript copies per liter (L-1) |
| NCD_Gamma_3_nifH_transcript | Non-cyanobacterial diazotroph type 'Gamma_3', nifH transcript | transcript copies per liter (L-1) |
| NCD_Gamma_4_nifH_transcript | Non-cyanobacterial diazotroph type 'Gamma_4', nifH transcript | transcript copies per liter (L-1) |
| NCD_EQF91_nifH_transcript | Non-cyanobacterial diazotroph type 'EQF91', nifH transcript | transcript copies per liter (L-1) |
| NCD_ALHOU_nifH_transcript | Non-cyanobacterial diazotroph type 'ALHOU', nifH transcript | transcript copies per liter (L-1) |
| NCD_Vib_nifH_transcript | Non-cyanobacterial diazotroph type 'Vib', nifH transcript | transcript copies per liter (L-1) |
| NCD_OTU0001_Pseudo_nifH_transcript | Non-cyanobacterial diazotroph type 'OTU0001_Pseudo', nifH transcript | transcript copies per liter (L-1) |
| NCD_OTU0009_Kleb_nifH_transcript | Non-cyanobacterial diazotroph type 'OTU0009_Kleb', nifH transcript | transcript copies per liter (L-1) |
| sst | sea surface temperature; CMAP table: tblSST_AVHRR_OI_NRT. Metadata colocated using CMAP | degrees Celsius (degC) |
| sss | sea surface salinity; CMAP table: tblSSS_NRT | Practical Salinity Units (PSU) |
| chl | chlorophyl a; CMAP table: tblCHL_REP | milligrams per cubic meter (mg/m^3) |
| PAR | photosynthetically active radiation; CMAP table: tblModis_PAR | Einstein per meter squared per day (E m-2 day-1) |
| POC | particulate organic carbon; CMAP table: tblModis_POC | milligrams per cubic meter (mg/m^3) |
| wind_speed | wind speed; CMAP table: tblWind_NRT | meters per second (m/s) |
| wind_stress | wind stress; CMAP table: tblWind_NRT | pascals (Pa) |
| wind_stress_curl | wind stress curl; CMAP table: tblWind_NRT | pascals (Pa) |
| sla | sea level anomaly; CMAP table: tblAltimetry_REP | meters (m) |
| AOD | Aerosol Optical Depth; CMAP table: tblModis_AOD_REP | unitless |
| NO3 | nitrate; CMAP table: tblPisces_NRT | millimoles per cubic meter (mmol/m^3) |
| PO4 | phosphate; CMAP table: tblPisces_NRT | millimoles per cubic meter (mmol/m^3) |
| Fe | iron; CMAP table: tblPisces_NRT | millimoles per cubic meter (mmol/m^3) |
| O2 | oxygen; CMAP table: tblPisces_NRT | millimoles per cubic meter (mmol/m^3) |
| Si | silicate (SiO2); CMAP table: tblPisces_NRT | micromoles per liter (umol/L) |
| PP | primary productivity; CMAP table: tblPisces_NRT | grams per cubic meter per day (g/m^3/day) |
| PHYC | phytoplankton concentration; CMAP table: tblPisces_NRT | millimoles per cubic meter (mmol/m^3) |
| DIN | dissolved inorganic nitrogen (DIN); CMAP table: tblDarwin_Nutrient | millimoles (mmol DIN) |
| FeT | total iron (Fe); CMAP table: tblDarwin_Nutrient | millimoles (mmol Fe) |
| SiO2 | silicate (SiO2); CMAP table: tblDarwin_Nutrient | millimoles (mmol Si) |
| sea_water_temp_WOA_clim | sea surface temperature; CMAP table: tblWOA_Climatology | degrees Celsius (degC) |
| density_WOA_clim | sea surface density; CMAP table: tblWOA_Climatology | kilograms per cubic meter (kg/m^3) |
| salinity_WOA_clim | sea surface salinity; CMAP table: tblWOA_Climatology | Practical Salinity Units (PSU) |
| nitrate_WOA_clim | nitrate (NO3); CMAP table: tblWOA_Climatology | micromoles per liter (mcmol/L) |
| phosphate_WOA_clim | phosphate (PO4); CMAP table: tblWOA_Climatology | micromoles per liter (mcmol/L) |
| silicate_WOA_clim | silicate (SiO2); CMAP table: tblWOA_Climatology | micromoles per liter (mcmol/L) |
| oxygen_WOA_clim | oxygen; CMAP table: tblWOA_Climatology | milliliters per liter (mL/L) |
| AOU_WOA_clim | apparent oxygen utilization; CMAP table: tblWOA_Climatology | milliliters per liter (mL/L) |
| o2sat_WOA_clim | oxygen saturation; CMAP table: tblWOA_Climatology | percent (%) |
| conductivity_WOA_clim | conductivity; CMAP table: tblWOA_Climatology | Siemens per meter (S m-1) |
NSF Award Abstract:
Nitrogen (N) is an important element in the ocean that limits the growth of the microscopic marine plants, phytoplankton. Estimates suggest N inputs and losses may not be balanced in the modern ocean, and thus an underestimation of N inputs may explain this imbalance. The conversion of gaseous N2 to biologically available N (N2 fixation) is the largest source of new N to the ocean. It is possible that the "missing" N can be explained by identifying new sources of N2 fixation. N2 fixation relies on a group of microorganisms, termed "diazotrophs", that utilize N2 for growth, unlike other marine microorganisms. Diazotrophs fall into two groups, cyanobacterial diazotrophs, which are able to derive energy through photosynthesis, and non-cyanobacterial diazotrophs (NCDs), which require a non-light-based energy source. Next to nothing is known about the ecology and biology of NCDs, except that they are ubiquitous in the ocean and contain the nitrogen fixing gene, but no direct measurements of their N2 fixation activity exist. Recent molecular advances for studying organisms at the single cell level now makes the measurement of N2 fixation by NCDs possible. This study is focused on determining whether marine NCDs are actually fixing N2 in the environment and understanding how their N2 fixation is modulated. Determining if NCD activity is an important missing N source in the global oceans has the potential to fill a critical gap in our understanding of the marine N cycle. This project supports early career STEM researchers including a graduate student and a postdoctoral scientist, as well as undergraduate students through several programs including UCSC's California Alliance for Minority Participation (CAMP).
Nitrogen fixation, the microbial process of converting N2 into biologically available ammonia, is an important source of N in the oceans. Historically, research has focused on the most conspicuous diazotrophs, such as Trichodesmium, but the discovery of unicellular cyanobacterial and non-cyanobacterial diazotrophs (NCDs) in the open ocean revealed a broader diversity than previously thought. Much of what is known about NCDs is restricted to presence, abundance estimates and transcriptional activity from gene surveys. NCDs are globally distributed throughout coastal and oligotrophic environments, however, it is not known whether NCDs supply N to support primary productivity. Measurements of marine NCDs are needed to determine if NCDs are actively fixing N2. This study is focused on measuring single cell NCD N2 fixation rates from a variety of taxa living in well-lit, oxygen-rich coastal and oligotrophic surface waters in the North Pacific and Arctic Oceans. The investigators are using a cultivation-independent technique called geneFISH to microscopically visualize and localize NCDs and measuring the incorporation of 15N2 into single cells using nanoscale secondary ion mass spectrometry. Beyond measuring in situ NCD N2 fixation rates, experiments are being conducted to determine environmental controls on single cell NCD N2 fixation (light, temperature, dissolved organic matter, dissolved inorganic N, and iron). Obtaining single cell NCD N2 fixation rates from a range of taxa, under different experimental conditions and in coastal and oligotrophic environments will provide information to link their presence to N2 fixation activity, determine the quantitative significance of NCDs in the marine environment, and set the stage for their inclusion in biogeochemical models.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
| Funding Source | Award |
|---|---|
| NSF Division of Ocean Sciences (NSF OCE) | |
| NSF Division of Ocean Sciences (NSF OCE) |