Input data and code to test hypotheses between genetic and environmental effects on phenotypes

Website: https://www.bco-dmo.org/dataset/963375
Data Type: model results
Version: 1
Version Date: 2025-06-05

Project
» RCN: Evolution in Changing Seas (RCN ECS)
ContributorsAffiliationRole
Lotterhos, KatieNortheastern UniversityPrincipal Investigator
Trussell, Geoffrey C.Northeastern UniversityCo-Principal Investigator
Albecker, MollyNortheastern UniversityScientist
Soenen, KarenWoods Hole Oceanographic Institution (WHOI BCO-DMO)BCO-DMO Data Manager

Abstract
The purpose of this study was to apply a quantitative measure to phenotypic data that were collected by other studies to test hypotheses about natural prevalence and patterns of Covariance between genotypic and environmental effects on phenotypes (CovGE).


Methods & Sampling

There were three major phases to this study which are pertinent for data storage and collection. 

1. The first phase was a quantitative review in which the literature was systematically surveyed, assessed for inclusion, and relevant data extracted. This process resulted in the dataset "Extraction_Initialize.csv", which contains metadata from all studies initially considered for data collection. In some cases, studies were later deemed inappropriate and excluded from subsequent phases, but their metadata remains in this file. Once data collection was completed for the 102 studies that met the inclusion criteria, all phenotypic data—used in the second phase—were compiled into a single .csv file, "Meta_data_checked.csv". 

2. The second phase involved data analysis, where the phenotypic data and metadata were imported into R, formatted, and processed using a for-loop to estimate CovGE and ΔG×E for each phenotype independently. This loop fully implements CovGE estimation following the methods outlined in Albecker et al. (2022) and also generates confidence intervals for CovGE and p-values for ΔG×E. This analysis was executed in lines 1–220 of the "CovGE_Meta_Analysis.R" script, with essential functions housed in the companion script "CovarianceDataFunctions.R". Results of this analysis were saved as, "Meta_output_Nov27_2024.csv" 

3. The third phase involved testing the questions/hypotheses outlined in the manuscript (e.g., Is CnGV or CoGV more prevalent in nature?). These analyses were conducted in the "CovGE_Meta_Analysis.R" script, following the conclusion of the primary analysis on Line 220. This phase also included all plotting and formatting code. Any modifications made to plots in PowerPoint were documented with notes. All code is fully annotated for clarity and reproducibility.

 

Files in this dataset:

1. Code file - "CovGE_Meta_Analysis.R" - R code including all analyses, data formatting, and plotting. See phase 2 and 3 
2. Code file - "CovarianceDataFunctions.R" - R companion code storing all essential functions for analyses. See phase 2 and 3.  Compiled dataset with the extracted phenotypic, genotypic, and environmental data for each study included in quantitative review. See Phase 1 in the description of Contents for additional details. 


[ table of contents | back to top ]

Data Files

File
963375_v1_metadatachecked.csv
(Comma Separated Values (.csv), 6.28 MB)
MD5:9d365212d6e09dfdb839e571c5233d91
Primary data file for dataset ID 963375, version 1

[ table of contents | back to top ]

Supplemental Files

File
CovGE_Functions
filename: CovarianceDataFunctions.R
(R Script, 26.19 KB)
MD5:f2961d321c9a928f278adfc6804a30c7
R companion code storing all essential functions for analyses.
CovGE_MainCode
filename: CovGE_Meta_Analysis.R
(R Script, 43.41 KB)
MD5:4e136c6b31afdbbfd52fb94b636d828c
Analysis code

[ table of contents | back to top ]

Related Datasets

IsRelatedTo
Albecker, M., Trussell, G. C., Lotterhos, K. (2025) Metadata for studies from meta-analysis investigating covariance between genetic and environmental (CovGE) effects in phenotypic results. Biological and Chemical Oceanography Data Management Office (BCO-DMO). (Version 2) Version Date 2025-06-05 doi:10.26008/1912/bco-dmo.877414.2 [view at BCO-DMO]
Relationship Description: Dataset 877414 is the result of literature analysis and basis for the input file of dataset 963375
Albecker, M., Trussell, G. C., Lotterhos, K. (2025) Results from a meta-analysis investigating covariance between genetic and environmental (CovGE) effects in phenotypic results in published literature. Biological and Chemical Oceanography Data Management Office (BCO-DMO). (Version 2) Version Date 2025-06-05 doi:10.26008/1912/bco-dmo.877425.2 [view at BCO-DMO]
Relationship Description: Dataset 877425 is the output file of phase 2 of the project.

[ table of contents | back to top ]

Parameters

ParameterDescriptionUnits
Study_ID_phenotype

Identifier combining study id and phenotype

unitless
First_Author

First name of author of study

unitless
gen_factor

Assigned name for genotype with "G_" and numeric

unitless
nat_env_factor

The native environment of each genotype (started with N_ and switched to E_ later - in code, all are switched to E_).

unitless
exp_env_factor

Assigned categorical label for each environmental treatment - one must correspond with a native environment. These all begin with "E_" and end with numeric.

unitless
phen_n

How many samples were collected in each genotype/environment

unitless
phen_data

The data for the phenotype from the genotype in that environment

unitless
phen_SD

If means and standard deviation were collected, enter SD in this column

unitless
phen_mean_SE

If means and standard error were collected, enter SE in this column

unitless
phen_mean_lowCI_095

If means and confidence intervals were collected, enter lower CI value in this column

unitless
phen_mean_highCI_095

If means and confidence intervals were collected, enter upper CI value in this column

unitless
Source

Lists original, unique xlsx file

unitless

[ table of contents | back to top ]

Project Information

RCN: Evolution in Changing Seas (RCN ECS)


Coverage: United States


NSF abstract:
How marine species will react to changing environment and climate is not well understood. While the interaction between oceanographic and ecological processes has yielded considerable insight into the ecology of marine species, the evolutionary responses of marine species are not well integrated into this framework. This project research coordinated network on "Evolution in Changing Seas" (ECSRCN), will bring marine scientists together with evolutionary biologists having expertise in population genetics, eco-evolutionary dynamics, and phylogenetics to better understand and predict the evolutionary responses of marine species to climate stressors. ECS-RCN will increase the impact of evolutionary studies in marine systems through increased collaboration among scientists from diverse fields. Furthermore, the empirical robustness of these studies will also be improved through the development of standards for experimental design and statistical analysis, especially for genomics data analysis. ECS-RCN will build a diverse network through a dedicated workshop for early-career participants, by advertising with diversity groups, and by dedicating funds to increase diversity. This project will support one postdoctoral researcher who will play a key role in coordinating scientific activities of the network as well as receive interdisciplinary training through network activities, strongly positioning them to become a leader in the field. ECS-RCN will also build the foundation for a lasting network through establishment of a listserv, open access to publications, development of a website, and development of teaching modules for undergraduate and graduate curriculum.

Specifically, ECS-RCN will consider how coupling between oceanographic and evolutionary processes shape adaptive and plastic responses to climate change, from the fundamental level of genomes scaled up to entire populations. Under this theme, the objectives of ECS-RCN are to synthesize the current state of knowledge, to prioritize lines of inquiry that will advance knowledge in marine and evolutionary biology, to determine the appropriate experimental designs and statistical approaches for robustly testing these lines of inquiry (including genomics approaches), and to build a foundation for a diverse and lasting network. These goals will be realized over the course of 3 years, starting with a Synthesis Workshop in Year 1 where working groups will be established, followed by working group meetings and formation of a Genomics Subcommittee in Year 2, and ending with an Integration and Training Workshop aimed at early career scientists in Year 3. To promote synthesis and self-organization at workshops, the workshops will employ the Open Space format. ECS-RCN will promote evolutionary thinking in biological oceanography and integrate unique aspects of marine life-histories into evolutionary principles. ECS-RCN will also advance knowledge in both marine and evolutionary biology through synthesis and the development of frameworks for merging genomics and ecology. The activities will provide novel insights into pressing questions in both marine and evolutionary ecology, such as: what drives geographic patterns of local (mal)adaptation and plasticity?; what are the mechanisms that generate adaptive vs. nonadaptive plasticity?; what is the role of genotype dependent dispersal in adaptation?; what are the genetic constraints on adaptation of function-valued traits to climate change?; and how do epigenetic modifications act as a mediator between adaptation and plasticity? Ultimately, the RCN aims to develop a quantitative understanding of the relative importance of ecological versus evolutionary responses to climate change.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.



[ table of contents | back to top ]

Funding

Funding SourceAward
NSF Division of Ocean Sciences (NSF OCE)

[ table of contents | back to top ]