Image of lakes in part of North Dakota identified by ReaLSAT (red) and HydroLAKES (blue)

Data scientists use new techniques to identify lakes and reservoirs around the world

First-of-its-kind dataset will provide new insights into land and water use and climate change

MINNEAPOLIS / ST. PAUL (07/19/2022) — An interdisciplinary team of researchers, led by University of Minnesota Twin Cities data scientists and supported by the U.S. National Science Foundation and NASA, has published a first-of-its-kind comprehensive global dataset of the lakes and reservoirs on Earth that shows how they have changed over the last 30+ years.

The data will provide environmental researchers with new information about land and fresh water use as well as how lakes and reservoirs are being impacted by humans and climate change. The research is also a major advancement in machine learning techniques.

A paper highlighting the Reservoir and Lake Surface Area Timeseries (ReaLSAT) dataset was recently published in Scientific Data, a peer-reviewed, open-access journal published by Nature.

Highlights of the study include:

  • The ReaLSAT dataset contains the location and surface area variations of 681,137 lakes and reservoirs larger than 0.1 square kilometers (south of 50 degrees north latitude). The previous most comprehensive database, called HydroLAKES, had identified only 245,420 lakes and reservoirs for the part of the world and minimum size being considered in this study.
  • RealSAT provides data on the surface area of each body of water for each month from 1984 to 2015. This makes it possible to quantify changes in lake and reservoir area over time, which is key to understanding how changing climate and land use are altering bodies of fresh water. The HydroLAKES data contains only a static shape for each water body.
  • The RealSAT dataset is the culmination of eight years of research. It represents a major milestone in the application of new knowledge-guided machine learning for use in the environmental sciences. Unlike other existing efforts, this dataset can now be extended nearly automatically via machine learning and can be quickly replicated for a wide variety of earth observation data that are becoming available at increasingly better resolution. 

Around the world, we are seeing lakes and reservoirs changing rapidly with seasonal precipitation patterns, long-term changes in climate, and human management decisions,” said Vipin Kumar, the senior author of the study and Regents Professor and William Norris Endowed Chair in the University of Minnesota Twin Cities Department of Computer Science and Engineering. This new dataset greatly improves the ability of scientists to understand the impact of changing climate and human actions on our fresh water across the globe.”

Global comparison of lakes identified by ReaLSAT and HydroLAKES
This figure highlights the lakes identified by ReaLSAT and HydroLakes (blue), as well as the lakes present in ReaLSAT but not HydroLakes (red) for different regions in world: a) small reservoirs in India, b) water-intensive agriculture in Vietnam, c) natural lakes in the U.S., and wetlands in Venezuela and d) shallow lakes in Australia. Credit: ReaLSAT, University of Minnesota

Building a global dataset of lakes and reservoirs and how they are changing required a new type of machine learning algorithms that meld knowledge of the physical dynamics of water bodies with satellite imagery.

ReaLSAT is a shining example where environmental challenges motivated a new class of knowledge-guided machine learning algorithms that are now being used in numerous scientific applications,” Kumar said.

Scientists who study the environment agree that ReaLSAT will improve their work.

The availability and quality of surface fresh water is central to sustainable use of our planet,” said Paul C. Hanson, a Distinguished Research Professor at the University of Wisconsin-Madison Center for Limnology and a co-author of the study. Because ReaLSAT shows changes in lakes and their boundaries, rather than just water pixels across the landscape, we can now connect ecosystem process about water quality with hundreds of thousands of lakes around the world.”

The research was supported by the U.S. National Science Foundation Awards 1029711, 1838159, 1934633, and NASA grant NNX12AP37G. Access to computing facilities was provided by the University of Minnesota Supercomputing Institute.

To read the entire study, entitled ReaLSAT, a global dataset of reservoir and lake surface area variations” visit the Scientific Data website.

Share