Lightnews — Scholar-powered news

Valerio Marsocci

@valeriomarsocci.bsky.social

#8 Copernicus-FM

This paper introduces: a) a new pre-training dataset; b) a new benchmark dataset; c) a GFM, all based on a diverse set of Copernicus data.

⬆️: really appreciate the grid embeddings part
⬇️: some doubts about claims about generalizability

arxiv.org/pdf/2503.11849

March 27, 2025 at 9:30 AM

Valerio Marsocci

@valeriomarsocci.bsky.social

🔥 🎯 Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection (#7)

New preprint around :)

Incorporating inductive biases specific to MSI can enhance the fine-tuning of large Earth observation models, pre-trained on RGB

arxiv.org/pdf/2503.09493

March 17, 2025 at 10:18 AM

Valerio Marsocci

@valeriomarsocci.bsky.social

#6 Lossy Neural Compression for Geospatial Analytics

The authors introduce NC and discuss the characteristics of EO and climate data, w.r.t natural images

⬆️: great entry point
⬇️: no baseline exps

arxiv.org/pdf/2503.01505

March 10, 2025 at 1:00 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

#5 Is SSL on Satellite Imagery Better than ImageNet? A Systematic Study with Sentinel-2

This study pretrains two SSL methods on ImageNet and GeoNet. The improvement with GeoNet is minimal.

⬆️ useful to reduce computation?
⬇️ more considerations about the resolutions?

arxiv.org/pdf/2502.10669

February 24, 2025 at 9:08 AM

Valerio Marsocci

@valeriomarsocci.bsky.social

#4 Galileo

Galileo is a family of pretrained RS models designed to flexibly process multimodal RS data. It has two loss: one in the pixel space, one in the latent space.

⬆️: multi-modal/temporal/sensor
⬇️: why just using Sentinel data?

arxiv.org/pdf/2502.09356

February 14, 2025 at 8:43 AM

Valerio Marsocci

@valeriomarsocci.bsky.social

#2 Can Location Embeddings Enhance Super-Resolution of Satellite Imagery?

It looks like they can :)

⬆️: validating it on a real-world task
⬇️: is it super-resolution or mapping S2 to NAIP?

arxiv.org/pdf/2501.15847

January 30, 2025 at 9:59 AM

Valerio Marsocci

@valeriomarsocci.bsky.social

#1 Diffusion Models for RS

This paper provides a comprehensive review of the applications of diffusion models in remote sensing

⬆️ excellent entry point
⬇️ not sure about the statement about the "inherent denoising ability" of diffusion models

arxiv.org/abs/2404.08926

January 21, 2025 at 1:44 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

Back on social, after a break (can you guess where?)

Last year I decided to do a #50paperschallenge

I ended up with 43. Still:
🥵 I read more than 50 papers. I just didn't post all
😇 the strategy worked independently of the posted ones

For this reason, this year I will do a #40paperschallenge!

January 14, 2025 at 2:35 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

#41 Beyond Grid Data

GNNs open new possibilities for EO, handling irregular, multi-source datasets (e.g. point clouds) for smarter weather forecasts, disaster relief, etc..

⬆️: excels at non-Euclidean spatial data
⬇️: limited scalability across diverse data (?)

arxiv.org/abs/2411.03223

December 12, 2024 at 1:33 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

We observed interesting insights, such as:

1. generally speaking GFMs don't really excel when compared to supervised baselines

2. for some specific scenarios (e.g. HR data), it makes sense to use them

3. multi-temporal data are still under-estimated

other insights in the paper!

🧵

December 6, 2024 at 2:23 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

With this benchmark (PANGAEA), we tried to address the following research challenges:

* provide a robust evaluation protocol to benchmark GFMs
* investigate GFMs capabilities, with a focus on a) domain generalization, b) comparison to supervised baselines, c) performance with limited labels

🧵

December 6, 2024 at 2:23 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

We collected 11 datasets to create an inclusive, diverse benchmarks, based on these criteria:
* application domain
* geographical distribution
* type of task
* modality
* temporality

Spoiler: no patch-level classification tasks are included!

🧵

December 6, 2024 at 2:23 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

🚀🚀🌏

Are geospatial foundation models really impactful?

Check it in our new pre-print!

Welcome to **PANGAEA: a global and inclusive benchmark for GFMs**

arxiv.org/abs/2412.04204

Check also the public GitHub repo (other news/updates soon):
github.com/VMarsocci/pa...

a short thread 🧵

December 6, 2024 at 2:23 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

#39 TCH in African savannas

Can global SatML models solve local challenges?
This study finds local models outperform global & fine-tuned models for TCH mapping in Africa

⬆️: interesting set of research questions
⬇️: what about "generalist" geospatial foundation models?

arxiv.org/pdf/2411.14354

November 29, 2024 at 3:15 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

also, in the past I posted about this interesting benchmark paper:

#32 GeoFMs for crop type mapping

it investigates the ability of geoFMs to transfer to new geographic regions in agriculture

⬆️the pivotal topic for real-world applications
⬇️the limited number of geoFMs

arxiv.org/pdf/2409.09451

November 28, 2024 at 2:47 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

#38 SPECIALIZED FOUNDATION MODELS STRUGGLE
TO BEAT SUPERVISED BASELINES

Specialized FMs in genomics, satellite imaging, and time series, struggle w.r.t. supervised learning pipelines

⬆️: very relevant work
⬇️: just classification, limiting the real-world capabilities*

arxiv.org/abs/2411.02796

November 28, 2024 at 2:44 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

#25 SatlasNet

Satlas (#ICCV ‘23) proposes both a dataset (SatlasPretrain) and a model (SatlasNet).

SatlasNet is a supervised Swin-based model with multi-head for different tasks

⬆️multi-task model
⬇️supervised setting

arxiv.org/pdf/2211.15660

November 20, 2024 at 12:30 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

#29 SpectralGPT

SpectralGPT (#TPAMI) leverages 3D token generation for spatial-spectral coupling to process images of different sizes, resolutions, etc.

⬆️great flexibility in the input
⬇️missing some modalities (e.g. SAR)

ieeexplore.ieee.org/document/104...

November 20, 2024 at 12:30 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

#26 Prithvi

Prithvi is the geospatial foundation model developed by NASA and IBM. Trained on HLS, it employs an MAE with 3D positional encoding to consider multi-temporality

⬆️multi-temporality and the tasks
⬇️limited in the geographical extent

arxiv.org/pdf/2310.18660

November 20, 2024 at 12:30 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

#37 ALISE

ALISE (ALigned SITS Encoder) is a model for processing irregular and unaligned Satellite Image Time Series (SITS)

⬆️: great sparse data and label handling
⬇️: would be great to extend the domains (geographical and sensor-related)

arxiv.org/abs/2407.08448

November 20, 2024 at 12:30 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

#36 TaxaBind

TaxaBind introduces a unified embedding space for 6 data types, solving ecological tasks like species mapping & zero-shot classification

⬆️: robust zero-shot classification
⬇️: what if we want to add/test new tasks?

arxiv.org/abs/2411.00683

November 19, 2024 at 1:12 PM

Valerio Marsocci

@valeriomarsocci.bsky.social

#1 paper I posted before this year was SatCLIP

#SatCLIP learns the implicit representations of the image features that characterize a specific location

what I liked most: geo adaptation and per-continent performance

paper: arxiv.org/pdf/2311.171...

November 19, 2024 at 12:26 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news