Summary
This image shows the location of Neversink Watershed within the Delaware River Basin.
ExaSheds is a DOE funded project that aims to advance watershed system science using machine learning-assisted simulation.
The USU team was awarded a subcontract to develop integrated watershed hydro-thermal models for the Upper Neversink Watershed within the Delaware River Basin using the Advanced Terrestrial Simulator. The developed watershed models will provide training data for ML-assisted inverse modeling which will be performed by the PNNL team.
Funding Agency:
Pacific Northwest National Laboratory (PNNL)
USU Team:
Pin Shuai, Utah State University (PI)
Duration:
2022-2023
Total funding:
$60,000
External Link
Updates
- 10/01/2024: New publication out led by Dr. Alex Sun on the development of a deep neural operator learning approach for the parameter estimation in the Neversink Watershed (Sun et al., 2024).
- 09/01/2024: New publication out led by Dr. Jiang on the development of a knowledge-informed deep learning technique for parameter estimation in the Neversink Watershed (Jiang et al., 2024).
- 07/01/2023: New paper out led by Dr. Jiang on using knowledge-informed deep learning for hydrological model calibration in the Coal Creek Watershed in Colorado (Jiang et al., 2023).
References
2024
-
Bridging Hydrological Ensemble Simulation and Learning Using Deep Neural Operators
Alexander Y. Sun, Peishi Jiang, Pin Shuai , and 1 more author
Water Resources Research, 2024
Ensemble-based simulation and learning (ESnL) has long been used in hydrology for parameter inference, but computational demands of process-based ESnL can be quite high. To address this issue, we propose a deep neural operator learning approach. Neural operators are generic machine learning algorithms that can learn functional mappings between infinite-dimensional spaces, providing a highly flexible tool for scientific machine learning. Our approach is built upon DeepONet, a specific deep neural operator, and is designed to address several common problems in hydrology, namely, model parameter estimation, prediction at ungaged locations, and uncertainty quantification. Here we demonstrate the effectiveness of our DeepONet-based workflow using an existing large model ensemble created for an eastern U.S. watershed that is instrumented with 10 streamflow gages. Results suggest DeepONet achieves high efficiency in learning an ML surrogate model from the model ensemble, with the modified Kling-Gupta Efficiency exceeding 0.9 on holdout test sets. Parameter inference, carried out using the trained DeepONet surrogate model and genetic algorithm, also yields robust results. Additionally, we formulate and train a separate DeepONet model for physics-informed, seq-to-seq streamflow forecasting, which further reduces biases in the pre-trained DeepONet surrogate model. While this study focuses primarily on a single watershed, our approach is general and may be extended to enable learning from model ensembles across multiple basins or models. Thus, this research represents a significant contribution to the application of hybrid machine learning in hydrology.
-
Optimizing Parameter Learning and Calibration in an Integrated Hydrological Model: Impact of Observation Length and Information
Peishi Jiang, Pin Shuai, Alexander Y. Sun , and 1 more author
Journal of Hydrology, Nov 2024
Integrated hydrological modeling is gaining popularity due to its mechanistic representation of the surface and subsurface processes. However, estimating the parameters of such process-based models can be computationally expensive if careful consideration is not given to the length of streamflow observations used during model calibration. Here we evaluate the influence of the calibration period, the role of streamflow information content, and the gage location in parameter learning and calibration of a fully integrated hydrological model, the Advanced Terrestrial Simulator (ATS). We conducted the study at the Upper Neversink River Watershed within the Delaware River Basin, where streamflow observations are available at 11 gages with varying record lengths. We leveraged a recently proposed knowledge-informed deep learning technique for parameter estimation. To assess the impact of observation period and gage location, model parameters were learned on scenarios using different chunks of streamflow observations, including (1) using only one year or consecutive multiple years of streamflow observations at the watershed outlet and (2) using one shared year of observations at each of the sub-catchment gages, with the period from 1991-10-01 through 1999-09-30 as the overall calibration period. Using the estimated parameters, ATS was rerun for each scenario and evaluated on a subsequent period from 1999-10-01 through 2002-09-30. Results show that the basin outlet discharge prediction is mostly improved when using at least four years of observations for parameter estimation. Further, the performance of the calibrated ATS run correlates with the information content of the observed streamflows, suggesting that the information-theoretic metrics could be indicators for selecting the observation period for parameter estimation. Finally, we find that observations from an informative gage can be used in learning parameters to predict the streamflow at a nearby gage, which would potentially lower the computational expense by reducing the watershed domain used in calibration. Our success underscores the potential of using information theory to achieve robust model parameter estimation on a reduced computational budget.
2023
-