Process-guided deep learning water temperature predictions: 6 Model evaluation (test data and RMSE)

This dataset includes evaluation data ("test" data) and performance metrics for water temperature predictions from multiple modeling frameworks. Process-Based (PB) models were configured and calibrated with training data to reduce root-mean squared error. Uncalibrated models used default configurations (PB0; see Winslow et al. 2016 for details) and no parameters were adjusted according to model fit with observations. Deep Learning (DL) models were Long Short-Term Memory artificial recurrent neural network models which used training data to adjust model structure and weights for temperature predictions (Jia et al. 2019). Process-Guided Deep Learning (PGDL) models were DL models with an added physical constraint for energy conservation as a loss term. These models were pre-trained with uncalibrated Process-Based model outputs (PB0) before training on actual temperature observations. Performance was measured as root-mean squared errors relative to temperature observations during the test period. Test data include compiled water temperature data from a variety of sources, including the Water Quality Portal (Read et al. 2017), the North Temperate Lakes Long-TERM Ecological Research Program (https://lter.limnology.wisc.edu/), the Minnesota department of Natural Resources, and the Global Lake Ecological Observatory Network (gleon.org). This dataset is part of a larger data release of lake temperature model inputs and outputs for 68 lakes in the U.S. states of Minnesota and Wisconsin (http://dx.doi.org/10.5066/P9AQPIVD).

Data and Resources

Field Value
accessLevel public
bureauCode {010:00}
catalog_@context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
catalog_conformsTo https://project-open-data.cio.gov/v1.1/schema
catalog_describedBy https://project-open-data.cio.gov/v1.1/schema/catalog.json
identifier 9398d538-2097-4924-b252-54ae9eb5422a
metadata_type geospatial
modified 2020-08-20
old-spatial {"type": "Polygon", "coordinates": [[[-94.2609062308, 42.5692312673], [-87.9475441739, 42.5692312673], [-87.9475441739, 48.6427837912], [-94.2609062308, 48.6427837912], [-94.2609062308, 42.5692312673]]]}
publisher Climate Adaptation Science Centers
resource-type Dataset
source_datajson_identifier true
source_hash b08a087d7ac5faf605dcb0d51841a161ba04e7e1
source_schema_version 1.1
spatial {"type": "Polygon", "coordinates": [[[-94.2609062308, 42.5692312673], [-87.9475441739, 42.5692312673], [-87.9475441739, 48.6427837912], [-94.2609062308, 48.6427837912], [-94.2609062308, 42.5692312673]]]}
theme {geospatial}
Groups
  • AmeriGEOSS
  • National Provider
  • North America
Tags
  • amerigeo
  • amerigeoss
  • ckan
  • climate-change
  • deep-learning
  • geo
  • geoss
  • hybrid-modeling
  • machine-learning
  • modeling
  • national
  • north-america
  • reservoirs
  • temperate-lakes
  • temperature
  • thermal-profiles
  • united-states
  • us
  • usgs-5d925023e4b0c4f70d0d0594
  • water
isopen False
license_id notspecified
license_title License not specified
maintainer U.S. Geological Survey (Point of Contact)
maintainer_email jread@usgs.gov
metadata_created 2025-11-21T23:26:37.226794
metadata_modified 2025-11-21T23:26:37.226798
notes This dataset includes evaluation data ("test" data) and performance metrics for water temperature predictions from multiple modeling frameworks. Process-Based (PB) models were configured and calibrated with training data to reduce root-mean squared error. Uncalibrated models used default configurations (PB0; see Winslow et al. 2016 for details) and no parameters were adjusted according to model fit with observations. Deep Learning (DL) models were Long Short-Term Memory artificial recurrent neural network models which used training data to adjust model structure and weights for temperature predictions (Jia et al. 2019). Process-Guided Deep Learning (PGDL) models were DL models with an added physical constraint for energy conservation as a loss term. These models were pre-trained with uncalibrated Process-Based model outputs (PB0) before training on actual temperature observations. Performance was measured as root-mean squared errors relative to temperature observations during the test period. Test data include compiled water temperature data from a variety of sources, including the Water Quality Portal (Read et al. 2017), the North Temperate Lakes Long-TERM Ecological Research Program (https://lter.limnology.wisc.edu/), the Minnesota department of Natural Resources, and the Global Lake Ecological Observatory Network (gleon.org). This dataset is part of a larger data release of lake temperature model inputs and outputs for 68 lakes in the U.S. states of Minnesota and Wisconsin (http://dx.doi.org/10.5066/P9AQPIVD).
num_resources 8
num_tags 20
title Process-guided deep learning water temperature predictions: 6 Model evaluation (test data and RMSE)