Additional Resources: Data Repositories
Here is a partial list of data repositories where you can find datasets on nearly every subject in environmental science. This list is credited to the LTER Network’s Synthesis Skills for Early Career Researchers (SSECR) course and Ecological Data Synthesis: A Primer on Essential Methods course. The data repository’s associated R package is listed as well. The R packages can be installed from either CRAN or GitHub.
Name | Description | R Package |
---|---|---|
AmeriFlux | Provides data on carbon, water, and energy fluxes in ecosystems across the Americas, aiding in climate change and carbon cycle research. | amerifluxr |
DataONE | A network of around 60 data repositories. Aggregates environmental and ecological data from global sources, focusing on biodiversity, climate, and ecosystem research. | dataone |
Daymet | Daymet provides long-term, continuous, gridded estimates of daily weather and climatology variables for North America. | daymetr |
EDI | Contains a wide range of ecological and environmental datasets, including long-term observational data, experimental results, and field studies from diverse ecosystems. | EDIutils |
EES-DIVE | The Environmental System Science Data Infrastructure for a Virtual Ecosystem (ESS-DIVE) includes a variety of observational, experimental, modeling and other data products from a wide range of ecological and urban systems. | – |
GBIF | The Global Biodiversity Information Facility (GBIF) aggregates global species occurrence data and biodiversity records, supporting research in species distribution and conservation. | rgbif |
Google Earth Engine | Google Earth Engine is a cloud-based geospatial analysis platform that provides access to vast amounts of satellite imagery and environmental data for monitoring and understanding changes in the Earth’s surface. | rgee |
KNB | An open-source data repository hosting international ecological and environmental research. A member of the DataOne network. | dataone |
Microsoft Planetary Computer | The Microsoft Planetary Computer is a cloud-based platform that combines global environmental datasets with advanced analytical tools to support sustainability and ecological research. | rstac |
NASA | Provides data on earth science, space exploration, and climate, including satellite imagery and observational data for both terrestrial and extraterrestrial studies. Nice GUI-based data download via AppEEARS. | nasadata |
NCBI | Hosts genomic and biological data, including DNA, RNA, and protein sequences, supporting genomics and molecular biology research. | rentrez |
NEON | Provides ecological data from U.S. field sites, covering biodiversity, ecosystems, and environmental changes, supporting large-scale ecological research. | neonUtilities |
NOAA | Offers meteorological, oceanographic, and climate data, essential for understanding atmospheric conditions, marine environments, and long-term climate trends. | GitHub: EpiNOAA-R |
Open Traits Network | While not a repository per se, the Open Traits Network has compiled an extensive lists of repositories for trait data. Check out their repository inventory for trait data | – |
PhenoCam Network | A network of digital cameras that tracks vegetation phenology through images across North America and around the world. | phenocamapi |
US Census Bureau | Census data containing information about education, employment, health, and housing across America. | tidycensus |
USGS | Hosts data on geology, hydrology, biology, and geography, including topographical maps and natural resource assessments. | dataRetrieval |
US National Park Service | Provides geospatial and tabular data products, which includes national park boundaries, vegetation maps, geology maps, air quality, and water quality. | GitHub: NPSutils |