|
|
--- |
|
|
license: apache-2.0 |
|
|
language: |
|
|
- en |
|
|
pipeline_tag: image-to-image |
|
|
tags: |
|
|
- remote-sensing |
|
|
- earth-observation |
|
|
- vae |
|
|
- tokenizer |
|
|
--- |
|
|
|
|
|
# EO-VAE: Towards A Multi-sensor Tokenizer for Earth Observation |
|
|
|
|
|
EO-VAE is a multi-sensor variational autoencoder designed to serve as a foundational tokenizer for the Earth Observation (EO) domain. Unlike traditional approaches that require separate models for different sensors, EO-VAE utilizes a single model to encode and reconstruct flexible channel combinations through dynamic hypernetworks. |
|
|
|
|
|
## Model Summary |
|
|
- **Model Name:** EO-VAE |
|
|
- **Paper:** [EO-VAE: Towards A Multi-sensor Tokenizer for Earth Observation Data](https://arxiv.org/abs/2602.12177) |
|
|
- **License:** Apache-2.0 |
|
|
- **Task:** Image-to-Image / Tokenization (Remote Sensing) |
|
|
|
|
|
## Usage |
|
|
|
|
|
```python |
|
|
import torch |
|
|
from eo_vae.models.new_autoencoder import EOFluxVAE |
|
|
|
|
|
model = EOFluxVAE.from_pretrained( |
|
|
repo_id="nilsleh/eo-vae", |
|
|
ckpt_filename="eo-vae.ckpt", |
|
|
config_filename="model_config.yaml", |
|
|
device="cpu", |
|
|
) |
|
|
|
|
|
# Run reconstruction / latent extraction |
|
|
x = torch.randn(1, 3, 256, 256) |
|
|
# Example wavelengths for Sentinel-2 RGB |
|
|
wvs = torch.tensor([0.665, 0.56, 0.49], dtype=torch.float32) |
|
|
|
|
|
with torch.no_grad(): |
|
|
recon = model.reconstruct(x, wvs) # [B, 3, 256, 256] |
|
|
z = model.encode_spatial_normalized(x, wvs) # [B, 32, 32, 32] for 256x256 input |
|
|
``` |
|
|
|
|
|
|
|
|
These are the wavelengths used across modalities: |
|
|
|
|
|
|
|
|
```python |
|
|
WAVELENGTHS = { |
|
|
'S2RGB': [0.665, 0.56, 0.49], |
|
|
'S1RTC': [5.4, 5.6], |
|
|
'S2L2A': [ |
|
|
0.443, 0.490, 0.560, 0.665, 0.705, 0.740, |
|
|
0.783, 0.842, 0.865, 1.610, 2.190, 0.945, |
|
|
], |
|
|
'S2L1C': [ |
|
|
0.443, 0.490, 0.560, 0.665, 0.705, 0.740, |
|
|
0.783, 0.842, 0.865, 0.945, 1.375, 1.610, 2.190, |
|
|
], |
|
|
} |
|
|
``` |
|
|
|
|
|
If you use this model in your work, please cite: |
|
|
|
|
|
[**EO-VAE: Towards A Multi-sensor Tokenizer for Earth Observation Data**](https://arxiv.org/abs/2602.12177) |
|
|
|
|
|
```bibtex |
|
|
@article{eo-vae, |
|
|
title={EO-VAE: Towards A Multi-sensor Tokenizer for Earth Observation Data}, |
|
|
author={Lehmann, Nils and Wang, Yi and Xiong, Zhitong and Zhu, Xiaoxiang}, |
|
|
journal={arXiv preprint arXiv:2602.12177}, |
|
|
year={2026} |
|
|
} |
|
|
``` |