Introduction

The motion of interfaces is a crucial part of the modeling and simulation in applications like two phase flow, crystallization and phase separation, or tumor growth. Within the Copernicus program, radar and optical satellite imagery are available with short revisit periods, allowing for high quality observation of moving fronts of wildfires, oil spills and floods, or calving fronts of glaciers and ice sheets. Front tracking in satellite data therefore is a key ingredient for various applications in Earth Observation and an active research topic in various Helmholtz centers. Apart from questions concerning image analysis, the geometric information contained in the moving fronts is essential for the mathematical models, which are the basis for the simulation and forecasting of complex physical phenomena in the Earth system.

The main objective of the TerraByte-DNN2Sim project is to develop novel imaging algorithms and a data processing pipeline for the automated extraction of moving fronts and further geometric information from satellite data to make this information available for applications in various research fields and all Helmholtz centers using satellite data, e.g. DLR, FZJ, UFZ, AWI. We use this extracted data in simulations to advance the modelling of front evolution that is important in many research areas. We apply these methods to models of glacier calving fronts of the antarctic ice sheet. Computation will be carried out on the new TerraByte cluster that provides efficient access to geoscientific data like satellite images as well as the computational resources necessary for simulation.

Satellite Image Analysis

We will develop a pipeline to extract fronts from satellite image data. Traditional edge detection methods often fail at fuzzy or complex borders. Glacier fronts for example are hard to detect due to sea ice, icebergs, mélange and surface melt. Therefore, a Deep Neural Network (DNN) will be trained to extract fronts from image data. Since the front could span many large images, sophisticated data handling and multi resolution algorithms are required to get one consistent and continuous line. Another challenge is the sparseness and incompleteness of data sources, e.g., due to due to orbit times or cloud cover in the case of satellite images. We aim to produce a time series of the front that is complete, both in space and in time.

Simulation

Given the time series of front positions extracted from image data, our goal is to compute the spacially and temporally resolved speed of the front. The front can be mathematically represented by the zero levelset of an implicit function. The evolution of the front is then described by a partial differential equation, which includes the speed of the front as a parameter. We will implement a software library to solve the inverse problem of computing the speed of the front from known front positions, taking care of the necessary regularization. From there it is possible to compute the position of the front at every time and forecast it into the future. The library should be efficiently parallelized and scale to thousands of CPUs. Capturing the moving front requires a dynamically adapted grid which poses a challenge to the parallelization. The library should support diverse use cases and allow easy integration into or coupling with other simulation codes, e.g., using the coupling framework preCICE.

Application: Calving Fronts in Ice Sheet Models

We will apply the methods described above to the important use case of modelling glacier calving fronts. Correctly capturing the evolving front is crucial for the forecasting of ice sheet or larger climate models into the future. For this application, we will extract the glacier front of the antarctic ice sheet from Sentinel 1 satellite image data. We will couple the developed simulation code with the Ice Sheet System Model (ISSM) software to enhance its moving front module. We hope to significantly improve its modelling capabilities.

TerraByte

Computations in the project will be performed on TerraByte, a new High Performance Computing cluster for earth observation and other geoscience workloads established by DLR and hosted at the Leibnitz Supercomputing Centre (LRZ) of the Bavarian Academy of Sciences and Humanities. The first stage of TerraByte consists of 61 compute nodes with 80 CPU cores each, 15 GPU nodes with 4 Nvidia GPUs, and 36 petabytes of data storage, including fast access to satellite images and other geoscientific data. A further significant extension of compute resources is planned for 2023. TerraByte therefore provides both the efficient data access for image analysis as well as the compute power required for simulation.

Helmholtz Imaging

TerraByte-DNN2Sim is part of Helmholtz Imaging. Helmholtz Imaging's mission is to unlock the potential of imaging in the Helmholtz Association. Image data provide a substantial part of data being generated in scientific research. Helmholtz Imaging is the overarching platform to better leverage and make accessible to everyone the innovative modalities, methodological richness, and data treasures of the Helmholtz Association. Helmholtz Imaging is one of five platforms (HIDA, HIFIS, Helmholtz.AI and HMC) initiated by the Helmholtz Information & Data Science Incubator.