Building on the success of the 2025 FDL Earth Systems Lab (ESL)’s research sprint, SHRUG-FM, a framework for reliability-aware prediction that enables geospatial foundation models to identify and abstain from likely failures, won ‘Best Paper Award’ at the EarthVision 2026 workshop that took place during the Computer Vision and Pattern Recognition (CVPR) 2026 Conference.
Geospatial foundation models are very useful tools for Earth observation research since they allow a single, pre-trained AI model to be quickly adapted for multiple tasks, using drastically less labelled data than more conventional AI models. However, these models can be ‘confidently wrong’, failing to perform reliably in environments that were underrepresented during their training phase.
This becomes particularly problematic when these geospatial foundation models are used in critical, time-sensitive situations such as disaster response after an extreme weather or climate event. But how is it possible to increase model transparency in cases where the model output has a high degree of uncertainty and requires human validation? A team mentored by Ruben Cartuyvels, Internal Research Fellow at ESA Φ-lab, and Patrick Ebel, former Internal Research Fellow at ESA Φ-lab and researcher at Google Research, decided to dive deep into new ways of making foundation models aware of their own uncertainty.
During the 2025 FDL Earth Systems Lab (ESL)’s ‘Foundation Models for Extreme Environments’ sprint, the team developed SHRUG-FM (Systematic Handling of Real-world Uncertainty for Geospatial Foundation Models), a framework that helps foundation models flag when they may fail, addressing two main types of uncertainty: data-driven or model-driven.
SHRUG-FM was developed as mechanism that can either provide a prediction, raise a warning or ‘shrug’ – indicating it doesn’t know the correct answer – when a foundation model is highly uncertain. This adaptable framework aims to make geospatial foundation models more transparent and reliable, showing a consistently reduced prediction risk for critical, climate-sensitive tasks like burn scar segmentation, flood mapping and landslide detection.
On 4 June 2026, SHRUG-FM was presented during the EarthVision 2026 workshop. As part of the Computer Vision and Pattern Recognition (CVPR) 2026 Conference, this workshop aimed to strengthen collaboration between the Earth observation, computer vision and machine learning communities, fostering innovation in automated geospatial analysis. SHRUG-FM won ‘Best Paper Award’, and its paper and code are now available online.

Claudio Iacopino, Head of the ESA Φ-lab Explore Office, commented on the importance of developing frameworks like SHRUG-FM: “Winning ‘Best Paper Award’ proves that the Earth observation research community is shifting its focus to a crucial point in the use of geospatial foundation models: trust. For a long time, the goal was to make these models smarter, and even though that is still an objective, it’s very important to have frameworks that give these models the ability to say ‘I might be wrong here’. Building this kind of reliability is exactly what both research and industries need to see before they fully adopt these models for widespread, everyday use.”
A more detailed explanation about the development of SHRUG-FM is available here.
To know more: ESA Φ-lab, FDL Earth Systems Lab (ESL), EarthVision 2026, Computer Vision and Pattern Recognition (CVPR) 2026 Conference
Photo courtesy of ESA, contains elements from FDL Earth Systems Lab.
