EcoTaxa image output from UVP5 of particles and plankton collected from CTD casts during four US GO-SHIP cruises from 2018 to 2022

Website: https://www.bco-dmo.org/dataset/959827
Data Type: Cruise Results
Version: 1
Version Date: 2025-04-24

Project
» CAREER: Imaging the global patterns and drivers of the ocean's biological carbon pump (Imaging the Biological Carbon Pump)
ContributorsAffiliationRole
McDonnell, Andrew M.P.University of Alaska Fairbanks (UAF)Principal Investigator
Lekanoff, RachelUniversity of Alaska Fairbanks (UAF)Scientist
O'Daly, StephanieUniversity of Alaska Fairbanks (UAF)Scientist, Student
Pretty, JessicaPrince William Sound Science Center (PWSSC)Scientist
York, Amber D.Woods Hole Oceanographic Institution (WHOI BCO-DMO)BCO-DMO Data Manager

Abstract
This dataset consists of EcoTaxa outputs from UVP5 samples of particles and plankton 0 to 6000 m on CTD casts. This dataset includes zooplankton biomass, taxonomic composition, and abundance as well as detrital particle biomass, classification, and abundance classified using the Morphocluster technique (Schroder et al., 2020) and Ecotaxa machine learning assistance (Picheral, 2017). The goal of this data was to assess high-resolution vertical distribution of particles and plankton in the global oceans in relation to other parameters collected through the US GO-SHIP program. The cruises consisted of S04P in the Pacific Sector of the Southern Ocean in 2018, I06S in the African Sector of the Southern Ocean, A22 in the Atlantic Ocean in 2021, and P2 in the Pacific Ocean in 2022.


Coverage

Location: Pacific Sector of the Southern Ocean, African Sector of the Southern Ocean, NE Atlantic Ocean, N Pacific Ocean
Spatial Extent: N:40.0107 E:295.0645 S:-75.289 W:159.912109
Temporal Extent: 2018-03-13 - 2022-07-15

Dataset Description

See the closely related datasets listed under the section heading "Related Datasets." In particular, BCO-DMO dataset (960033): "EcoPart particle output from UVP5 of particles and plankton collected from CTD casts during four US GO-SHIP cruises from 2018 - 2022" https://www.bco-dmo.org/dataset/960033

The related CTD data are available at CCHDO:

S04P: https://cchdo.ucsd.edu/cruise/320620180309 (doi: 10.7942/C2F08X)
I06s: https://cchdo.ucsd.edu/cruise/325020190403 (doi: 10.7942/C29660)
A22: https://cchdo.ucsd.edu/cruise/325020210420 (doi: N/A)
P2 Leg 1: https://cchdo.ucsd.edu/cruise/33RR20220430 (doi: N/A) - Identified at CCHDO as P02W
P2 Leg 2: https://cchdo.ucsd.edu/cruise/33RR20220613 (doi: N/A) - Identified at CCHDO as P02E


Methods & Sampling

The Underwater Vision Profiler 5 (UVP5) was utilized to collect in situ images of particles and plankton across the four repeat hydrography transects. The UVP5 was integrated within the conductivity temperature depth (CTD) rosette, and several images per second were acquired during the downcasts spanning from the surface down to near the bottom or 6000 m. The UVP captured in situ images of particles and plankton in a mixed processing mode in which particles were sized, counted, and tabulated in real time during the profile. Subsequent processing bins these particle counts into discrete, pre-defined size bins ranging from 102 µm to >26 mm in equivalent spherical diameter. Particle biovolume was also computed for each size bin. All data contained here are inclusive of both living and non-living particles and plankton. 

Images larger than about 500 µm in equivalent spherical diameter were saved and imported into Morphocluster, an unsupervised clustering algorithm until 91% classification was achieved. Then the resulting classified and unclassified data was imported to the online Ecotaxa database and image sorting tool (https://ecotaxa.obs-vlfr.fr/). There, unclassified images were first sorted into predicted taxonomic categories using a machine learning algorithm, and then predicted zooplankton images from a subset of vertical profiles were manually validated into the most appropriate taxonomic categories. The limited resolution of the images often precludes definitive taxonomic assignment. As such, data users should explore the actual images and their associated classifications by accessing the project via the Ecotaxa website in order to determine whether or not the taxonomic data is correct and appropriate for their purposes. Data was exported from Ecotaxa on 2024-09-05.


Data Processing Description

The UVP5 software acquires and processes images in real time. The gain, shutter and LED pulses are controlled and the background image is removed. Images are acquired and processed to get size and grey level for each image. Size information on all detected particles is stored and images of individual particles and plankton larger than 500 µm in equivalent spherical diameter are segmented and saved for later identification. Image post processing and metadata creation is accomplished with the Zooprocess software (Version 7.39). Tabulated particle data are used to sum the number and volume of particles within predefined size and depth bins, allowing for the computation of the Datasets. Data and images have been uploaded to the Ecotaxa website (http://ecotaxa.obs-vlfr.fr/) which serves as a tool for particle and zooplankton identification with machine learning and human verification, as well as a repository for all globally collected UVP data. Data files for particle and zooplankton abundances are exported from the Ecotaxa particle module in detailed format. The size distribution data is reported in non-differential forms (simply the concentration of particles in each size bin) as well as on a numeric and particle volume basis assuming all particles are spherical. The size bin limits are defined in equivalent spherical diameter (ESD), where ESD = (4 Sm π-1)-0.5 where Sm is the projected area of each particle in mm² and the particle concentrations are reported for each size bin defined by the log-transformed center of each size bin in µm. The biovolume is computed as the sum of the individual spherical volumes of each particle issued from the calibrated ESD.


BCO-DMO Processing Description

* Data tables within "*export_detailed_.txt" files were not able to be imported into the BCO-DMO data system directly due to incompatibilities in lat/lon and datetime formats in this and related BCO-DMO dataset "960033" "EcoPart particle output from UVP5..."
(see problems and issues section)

Data attached to this dataset include the original format submitted to BCO-DMO:

* Ecotaxa images provided as .zip files attached as "Data Files" to this dataset.
* Ecotaxa output data files (one per cruise) was zipped into Data File "EcoTaxa_Export_Files.zip".
* Example images attached as supplemental files

Supplemental Cruise metadata:

* Individual cruise metadata text files were concatenated into the supplementary file "cruise_metadata.csv". Additional columns were added to the combined table "Cruise_ID_R2R" to clarify which cruise id in the dataset corresponds to the cruise identifier at the Rolling Deck to Repository (R2R). See problems/issues section, which documents the format of latitude and longitude in this table.


Problem Description

Note that some values in the "process_date" field may contain an additional ".0" suffix (example: ecotaxa_export_5Sept2024.txt value "20180823.0"

Note that the latitude and longitude provided are not decimal degrees but rather degrees followed by a decimal point, followed by two digits for minutes, followed by digits indicating decimal minutes. Example: 17.37818 indicates 17° 37.818 (which is the equivalent of degrees, minutes, seconds: 17° 37' 49.08"). This format was entered as indicated by the version of the Zooprocess manual that was referenced at the time the data were processed.

[ table of contents | back to top ]

Data Files

File
Ecotaxa export text file (Sept 5th, 2024 export)
filename: ecotaxa_export_5Sept2024.txt
(Plain Text, 6.00 GB)
MD5:b4ebc3e9e29b77d222315b3ae4b76c42
Detailed Ecotaxa export document with classifications downloaded on September 5, 2024. See the Methods & Sampling and Data Processing sections for more details.

The table within this file is tab-separated.
Cruise A22 EcoTaxa export txt file
filename: a22_ecotaxa_export.txt
(Plain Text, 1.35 GB)
MD5:8c9752ccc045bbac8eedf2b50ac74d2c
Detailed Ecotaxa export document with classifications for Cruise A22. See the Methods & Sampling and Data Processing sections for more details.

The table within this file is tab-separated.
Cruise i06s EcoTaxa Export Text File
filename: i06s_ecotaxa_export_5Sept2024.txt
(Plain Text, 901.02 MB)
MD5:54f9175659dd95968b13dacc9289e773
Detailed Ecotaxa export document with classifications for Cruise i06s. See the Methods & Sampling and Data Processing sections for more details.

The table within this file is tab-separated.
Cruise s04p EcoTaxa Export Text File
filename: s04p_ecotaxa_export_5Sept2024.txt
(Plain Text, 2.47 GB)
MD5:ed16957ac460e6774edd730692622241
Detailed Ecotaxa export document with classifications for Cruise s04p. See the Methods & Sampling and Data Processing sections for more details.

The table within this file is tab-separated.
Cruise P2 Ecotaxa Export Text File
filename: p2_ecotaxa_export.txt
(Plain Text, 748.49 MB)
MD5:73e127dbefb4dc21798af3d4c956e409
Detailed Ecotaxa export document with classifications for Cruise P2. See the Methods & Sampling and Data Processing sections for more details.

The table within this file is tab-separated.
uvp5_sn207_2019_i06s_tcn322.zip
(ZIP Archive (ZIP), 11.62 GB)
MD5:074b114fde27bb8c88ed7b0b80dd27f7
Zipped folder structure with all UVP images from I06s cruise. The work folder has all of the images in separate folders for each cast.
uvp5_sn207_2018_s04p.zip
(ZIP Archive (ZIP), 24.19 GB)
MD5:bddee77549f09b66e05d796613dfd0b1
Zipped folder structure with all UVP images from S04p cruise. The work folder has all of the images in separate folders for each cast.
A22_work.zip
(ZIP Archive (ZIP), 4.52 GB)
MD5:3ad28e366c36667adf8ee073e4b6bf0c
Zipped folder structure with all UVP images from A22 cruise. The work folder has all of the images in separate folders for each cast. The metadata file has metadata for each cast associated with the images.
P2_work.zip
(ZIP Archive (ZIP), 8.09 GB)
MD5:d8c34421189d73e8213fb51b8460eb1c
Zipped folder structure with all UVP images from P2 cruise. The work folder has all of the images in separate folders for each cast. The metadata file has metadata for each cast associated with the images

[ table of contents | back to top ]

Supplemental Files

File
Copepod_i06s_026_1749
filename: i06s_026_1749.jpg
(JPEG Image (.jpg), 9.28 KB)
MD5:6eb15d8c515f19954945c1947afe5843
Example vignette of a copepod. See methodology for more details about how these are used in Morphocluster and Ecotaxa and how the vignettes are extracted from raw UVP images
Cruise and Ecotaxa Project information
filename: cruise_and_ecotaxa_projectIDs.csv
(Comma Separated Values (.csv), 790 bytes)
MD5:8d25acba0fa5409cb76bf319dc9fa6eb
Cruise, EcoPart and Ecotaxa Project information table (related to BCO-DMO datasets 960033 and 959827).

Columns:

EcoPart_export_file, EcoPart export filename contained within EcoPart_Exports_Detailed_PAR_odv.zip
Cruise_ID, Cruise identifier
Cruise_Note, additional note about cruise (e.g. leg 1 or 2)
Cruise_ID_R2R, Cruise identifier as used at rolling deck to repository (R2R)
Cruise_start, cruise start date
Cruise_end, cruise end date
Ecotaxa_project, ecotaxa project name
Ecotaxa_proj_link, ecotaxa project link
Cruise metadata summary table
filename: cruise_metadata.csv
(Comma Separated Values (.csv), 33.80 KB)
MD5:0f48c9a95578912b12ea76ddd207713e
Cruise metadata summary table.

Columns:

Cruise_ID, Cruise identifier
Cruise_ID_R2R, Cruise identifier (as appears at Rolling Deck to Repository (R2R)
Cast_Latitude, cast latitude, format non-standard (see "Problems and Issues section")
Cast_Longitude, cast longitude, decimal degrees, format non-standard (see "Problems and Issues section")
Station, station
ISO_DateTime_UTC, Datetime with timezone (UTC) in ISO 8601 format
Min_Depth, Minimum depth, meters (m)
Maximum_Depth, Maximum depth, meters (m)
Sample_ID_numbers, Sample ID (e.g. ctd063)
Hydrozoa_i06s_030_4956
filename: i06s_030_4956.jpg
(JPEG Image (.jpg), 3.87 KB)
MD5:779df457945a6b4a51f51fd997678412
Example vignette of a hydrozoa. See methodology for more details about how these are used in Morphocluster and Ecotaxa and how the vignettes are extracted from raw UVP images
Medusettidae_i06s_033_5018
filename: i06s_033_5018.jpg
(JPEG Image (.jpg), 4.37 KB)
MD5:7dc3eb37da829e4df17d85607266f93d
Example vignette of a Medusettidae Rhizaria. See methodology for more details about how these are used in Morphocluster and Ecotaxa and how the vignettes are extracted from raw UVP images
uvp5_header_sn201_2022_p2.txt
(Plain Text, 40.23 KB)
MD5:eca3c21d2ff01ef7d89ecd2eb8b061af
P2 metadata file for each cast. This is a header file (also referred to as the "metadata file" that was created in Zooprocess from the raw data that's initially created when the UVP is run. The header file is necessary when you upload data to Ecopart and Ecotaxa. The description of this file is documented in the UVP users manual. It was described (on pages 51 - 53) in the UVP manual that existed at http://sites.google.com/view/piqv/piqv-manuals/instruments-manuals when this dataset was processed.
uvp5_header_sn207_2018_s04p.txt
(Plain Text, 15.92 KB)
MD5:d2eb399f07fdb1deeeb81733124249c0
sn207 2018 metadata file for each cast. This is a header file (also referred to as the "metadata file" that was created in Zooprocess from the raw data that's initially created when the UVP is run. The header file is necessary when you upload data to Ecopart and Ecotaxa. The description of this file is documented in the UVP users manual. It was described (on pages 51 - 53) in the UVP manual that existed at http://sites.google.com/view/piqv/piqv-manuals/instruments-manuals when this dataset was processed.
uvp5_header_sn207_2019_i06s_tcn322.txt
(Plain Text, 5.77 KB)
MD5:6a3867ec6d9ee3a52e34d338d5bc4464
sn207 2019 metadata file for each cast. This is a header file (also referred to as the "metadata file" that was created in Zooprocess from the raw data that's initially created when the UVP is run. The header file is necessary when you upload data to Ecopart and Ecotaxa. The description of this file is documented in the UVP users manual. It was described (on pages 51 - 53) in the UVP manual that existed at http://sites.google.com/view/piqv/piqv-manuals/instruments-manuals when this dataset was processed.
uvp5_header_sn207_2021_a22.txt
(Plain Text, 14.84 KB)
MD5:97d63e2e230efd8b19530ecdcf7e350c
A22 metadata file for each cast. This is a header file (also referred to as the "metadata file" that was created in Zooprocess from the raw data that's initially created when the UVP is run. The header file is necessary when you upload data to Ecopart and Ecotaxa. The description of this file is documented in the UVP users manual. It was described (on pages 51 - 53) in the UVP manual that existed at http://sites.google.com/view/piqv/piqv-manuals/instruments-manuals when this dataset was processed.

[ table of contents | back to top ]

Related Publications

Picheral, M., Guidi, L., Stemmann, L., Karl, D. M., Iddaoud, G., & Gorsky, G. (2010). The Underwater Vision Profiler 5: An advanced instrument for high spatial resolution studies of particle size spectra and zooplankton. Limnology and Oceanography: Methods, 8(9), 462–473. doi:10.4319/lom.2010.8.462
Methods
Quantitative Imaging Platform of Villefranche sur Mer (n.d.) Instruments manuals and tools: UVP5 Manual. Available from http://sites.google.com/view/piqv/piqv-manuals/instruments-manuals
Methods
Schröder, S.-M., Kiko, R., & Koch, R. (2020). MorphoCluster: Efficient Annotation of Plankton Images by Clustering. Sensors, 20(11), 3060. https://doi.org/10.3390/s20113060
Methods

[ table of contents | back to top ]

Related Datasets

IsRelatedTo
Macdonald, A. & Briggs, E. (2018). S04P 2018 [Data set]. CCHDO: CLIVAR and Carbon Hydrographic Data Office. https://cchdo.ucsd.edu/cruise/320620180309. doi:10.7942/C2F08X
Macdonald, A. & Tan, S. (2022). Hydrographic Cruise: 33RR20220430 (P02W) [Data set]. CCHDO: CLIVAR and Carbon Hydrographic Data Office. https://cchdo.ucsd.edu/cruise/33RR20220430
Menezes, V. & Anderson, J. (2021). A22 [Data set]. CCHDO: CLIVAR and Carbon Hydrographic Data Office. https://cchdo.ucsd.edu/cruise/325020210420
O'Daly, S., McDonnell, A. M., Pretty, J., Lekanoff, R. (2025) EcoPart particle output from UVP5 of particles and plankton collected from CTD casts during four US GO-SHIP cruises from 2018 - 2022. Biological and Chemical Oceanography Data Management Office (BCO-DMO). (Version 1) Version Date 2025-05-01 doi:10.26008/1912/bco-dmo.960033.1 [view at BCO-DMO]
Relationship Description: EcoPart output from UVP5 of particles and plankton from the Ecotaxa projects.
Orsi, A. & Rosso, I. (2019). I06 2019 [Data set]. CCHDO: CLIVAR and Carbon Hydrographic Data Office. https://cchdo.ucsd.edu/cruise/325020190403. doi: 10.7942/C29660
Thurnherr, A. & Bigorre, S. (2022). Hydrographic Cruise: 33RR20220613 (P02E) [Data set]. CCHDO: CLIVAR and Carbon Hydrographic Data Office. https://cchdo.ucsd.edu/cruise/33RR20220613
Methods
Picheral M, Colin S, Irisson J-O (2017). EcoTaxa, a tool for the taxonomic classification of images. http://ecotaxa.obs-vlfr.fr

[ table of contents | back to top ]

Parameters

Parameters for this dataset have not yet been identified


[ table of contents | back to top ]

Instruments

Dataset-specific Instrument Name
Generic Instrument Name
Underwater Vision Profiler 5
Dataset-specific Description
SN207 and SN201 A description of the UVP instrument can be found in the following publication: Picheral, M., L. Guidi, L. Stemmann, D. M. Karl, G. Iddaoud, and G. Gorsky. 2010. The Underwater Vision Profiler 5: An advanced instrument for high spatial resolution studies of particle size spectra and zooplankton. Limnol. Oceanogr. Meth. 8: 462-473. (access the PDF at URL: http://cmore.soest.hawaii.edu/cmoredata/LMO/Guidi/Picheral_2010.pdf)
Generic Instrument Description
A description of the UVP5 instrument can be found in the following publication: Picheral, M., L. Guidi, L. Stemmann, D. M. Karl, G. Iddaoud, and G. Gorsky. 2010. The Underwater Vision Profiler 5: An advanced instrument for high spatial resolution studies of particle size spectra and zooplankton. Limnol. Oceanogr. Meth. 8: 462-473. (doi: 10.4319/lom.2010.8.462)


[ table of contents | back to top ]

Deployments

NBP1802

Website
Platform
RVIB Nathaniel B. Palmer
Start Date
2018-03-09
End Date
2018-05-14

TN366

Website
Platform
R/V Thomas G. Thompson
Start Date
2019-04-03
End Date
2019-05-14
Description
Project: Repeat Hydro/CO2

TN390

Website
Platform
R/V Thomas G. Thompson
Start Date
2021-04-20
End Date
2021-05-16

RR2204

Website
Platform
R/V Roger Revelle
Start Date
2022-04-30
End Date
2022-06-10
Description
Project: US GO-SHIP P02

RR2205

Website
Platform
R/V Roger Revelle
Start Date
2022-06-13
End Date
2022-07-16
Description
Project: US GO-SHIP P02

SR1812

Website
Platform
R/V Sally Ride
Start Date
2018-08-09
End Date
2018-09-13
Description
Additional cruise information is available from the Rolling Deck to Repository (R2R): https://www.rvdata.us/search/cruise/SR1812


[ table of contents | back to top ]

Project Information

CAREER: Imaging the global patterns and drivers of the ocean's biological carbon pump (Imaging the Biological Carbon Pump)

Coverage: Global


NSF Award Abstract:
Microscopic plants and animals in the surface ocean remove the atmospheric carbon dioxide dissolved in seawater by using it to make biological materials. After organisms die, some portion of this carbon sinks into the deep sea where it dissolves back into the water or lands on the seafloor. The Biological Carbon Pump is the name for this carbon transfer out of the sunlit surface ocean, and it is a very important process controlling the earth's carbon cycle and climate. Surprisingly, the size and rate of this transfer remains unclear because data for carbon in particles are not available for many times and places in the ocean. The researcher for this project plans to collect a remarkable new data set and examine it to answer major questions about the Biological Carbon Pump. The data will come from joining existing cruises on research ships sailing through all of the world's oceans, creating a systematic global survey of particles in the ocean. These data will be the heart of a new public database for use by other international researchers in their continued study of the Biological Carbon Pump. For this young researcher, the project will also lay the foundation for a career of integrated research, education, and outreach in oceanography. An Alaskan native will go to sea and analyze data, using these results to complete a Ph.D. degree. In addition, the project will design and produce a new aquarium exhibit at the Alaska SeaLife Center (ASLC) to educate the public about the complex workings and global importance of the Biological Carbon Pump. These activities should increase diversity in oceanography and inspire a new generation of scientists.

Reliable and useful data on the Biological Carbon Pump is sparse, mostly due to the experimental, logistical, and technical challenges of studying a complex system over the great expanse of the global ocean. This project aims to tackle a fundamental and ongoing problem in studying the Biological Carbon Pump by providing a global and consistent dataset on particles and plankton in the ocean. In collaboration with the US Repeat Hydrography program, this project will use in situ imaging technology to determine the total abundance, size distribution, and functional groups of particles and mesozooplankton along seven global ocean transects. The design will test fundamental hypotheses related to the presence and nature of regional particle hotspots and examine the global patterns of zooplankton activity and vertical migration as they affect the biological pump. Links between satellite data, calculated flux patterns, attenuation of particles through the mesopelagic, and constraints on biogeochemical models will also be investigated.



[ table of contents | back to top ]

Funding

Funding SourceAward
NSF Division of Ocean Sciences (NSF OCE)

[ table of contents | back to top ]