Add Copernicus-FM #2646

wangyi111 · 2025-03-14T21:48:28Z

Add Copernicus-FM, an extension of the DOFA foundation model, able to process any spectral or non-spectral sensor modality using extended dynamic hypernetworks and flexible metadata encoding.

Key features:

A unified model for both spectral and non-spectral modalities -- dynamic hypernetworks with Fourier / language encoding
Efficient processing of any spatial resolution -- adaptive patch embedding kernel size
Flexible metadata integration -- Fourier encoding with learnable meta tokens for geolocation, scale and time

References

torchgeo/models/copernicusfm.py

adamjstewart · 2025-03-18T16:21:28Z

torchgeo/models/copernicusfm.py

+        Args:
+            x: Input mini-batch.
+            meta_info: Longitudes, latitudes, times, and areas of each patch.
+                Use NaN for unknown metadata.


This is an unintuitive UI. I would rather have separate values for each which are either Tensor or None. It's also a shame that we can't mix this in a single mini-batch, if a single value is NaN that metadata is ignored.

can make it possible to mix in the batch in principle, but needs looping over the batch dim to assign known/unknown, probably will change a lot of codes.

torchgeo/models/copernicusfm.py

adamjstewart

This is ready from my side, but I'll give others a couple days to review. I'm particularly concerned about whether the documentation is sufficient for people to figure out how to use the model. There are ways we could make this more user-friendly, but I don't want to diverge too much from the original source code.

calebrob6 · 2025-03-19T15:40:59Z

Just read through this and found myself wanting an example of how to use it (same thing I hit previously when trying to use Scale-MAE)

E.g. even though the args in forward are documented -- what do they mean:

Maybe it'd be nice to put an example in the docstring? (This also applies to other pre-trained models actually)

adamjstewart · 2025-03-19T16:16:53Z

Agreed, we actually got a similar request for DOFA: zhu-xlab/DOFA#14

These newer models (Copernicus-FM, Panopticon) add a lot more (optional) metadata, so are even more confusing to use. Not sure if this should be API documentation or tutorials or what. I probably don't have a ton of time to work on this personally but @wangyi111 might.

wangyi111 · 2025-03-20T09:27:16Z

Agreed, we actually got a similar request for DOFA: zhu-xlab/DOFA#14

These newer models (Copernicus-FM, Panopticon) add a lot more (optional) metadata, so are even more confusing to use. Not sure if this should be API documentation or tutorials or what. I probably don't have a ton of time to work on this personally but @wangyi111 might.

Should be easy for me to add the docstring. Regarding tutorial, is there such place for demonstrating a pretrained model? I only see https://torchgeo.readthedocs.io/en/stable/tutorials/pretrained_weights.html

adamjstewart · 2025-03-20T09:55:47Z

Yep, that's the right location. We could either expand that tutorial to cover additional models, or add a second tutorial specifically for using FMs.

adamjstewart

I vote we merge this as is and @wangyi111 can open a separate PR to expand our tutorials on Scale-MAE, DOFA, Copernicus-FM, etc. Any objections?

wangyi111 · 2025-03-20T12:30:47Z

I vote we merge this as is and @wangyi111 can open a separate PR to expand our tutorials on Scale-MAE, DOFA, Copernicus-FM, etc. Any objections?

oh I just added some docstring to copernicusfm class

torchgeo/models/copernicusfm.py

adamjstewart · 2025-03-20T15:03:49Z

How would you feel about renaming a few things for consistency:

img_feat -> image or x
meta_info -> metadata
wave_list, wvs -> wavelengths
wv_planes -> wavelength_dim
bandwidth -> bandwidths
hypernet -> input_mode

Could also split meta_info into 4 separate variables for ease of use. Don't want to diverge too much from the original implementation, but also want to make it user friendly and intuitive.

wangyi111 · 2025-03-20T19:53:13Z

How would you feel about renaming a few things for consistency:

img_feat -> image or x

meta_info -> metadata

wave_list, wvs -> wavelengths

wv_planes -> wavelength_dim

bandwidth -> bandwidths

hypernet -> input_mode

Could also split meta_info into 4 separate variables for ease of use. Don't want to diverge too much from the original implementation, but also want to make it user friendly and intuitive.

Good for me. Only one is wv_plane, which is not only the dim of wavelength but also bandwidth and language embed. maybe something like hyper_dim? i kind of wanted to call it meta_dim but metadata also means another thing in this model..

adamjstewart · 2025-03-21T08:17:45Z

Maybe in_dim or in_features?

wangyi111 · 2025-03-21T08:54:38Z

these can still lead to input image features, maybe hyper_planes?

adamjstewart · 2025-03-21T11:16:28Z

Finished renaming. Remaining ideas to improve usabillity:

Could remove input_mode and key it based on whether wavelengths/bandwidths or language_embed is provided
Could use type hints to make it clear that wavelengths/bandwidths or language_embed is required, not both nor neither
Could split metadata into lat/lon/time/area, makes it easier to skip certain variables

Don't want to spend too much time on this because we still need to finish Copernicus-Bench and Copernicus-Pretrain, but once it's merged it makes it harder to change without breaking backwards compatibility.

torchgeo/models/copernicusfm.py

Co-authored-by: Yi Wang <[email protected]>

Add Copernicus-FM

0f343bf

github-actions bot added documentation Improvements or additions to documentation models Models and pretrained weights labels Mar 14, 2025

wangyi111 commented Mar 14, 2025

View reviewed changes

torchgeo/models/copernicusfm.py Outdated Show resolved Hide resolved

adamjstewart added this to the 0.7.0 milestone Mar 15, 2025

adamjstewart added 3 commits March 18, 2025 13:28

Ruff

45dc2b2

Add arXiv paper

43a1d9d

Support loading via torch.hub

83e2fd3

adamjstewart reviewed Mar 18, 2025

View reviewed changes

torchgeo/models/copernicusfm.py Outdated Show resolved Hide resolved

adamjstewart added 4 commits March 18, 2025 14:42

Fix ruff, add copyright, remove duplicate code

aae231f

Fix most type hints

b58a0e5

Fix mypy and docs

007d46b

Merge branch 'main' into copernicus-fm

9aac645

wangyi111 commented Mar 18, 2025

View reviewed changes

torchgeo/models/copernicusfm.py Outdated Show resolved Hide resolved

wangyi111 commented Mar 18, 2025

View reviewed changes

torchgeo/models/copernicusfm.py Show resolved Hide resolved

Add more docs

9db7e6e

adamjstewart reviewed Mar 18, 2025

View reviewed changes

torchgeo/models/copernicusfm.py Outdated Show resolved Hide resolved

Most inputs are optional

152cf2a

adamjstewart reviewed Mar 18, 2025

View reviewed changes

torchgeo/models/copernicusfm.py Outdated Show resolved Hide resolved

Add preliminary tests

7ac5b83

github-actions bot added the testing Continuous integration testing label Mar 18, 2025

adamjstewart reviewed Mar 18, 2025

View reviewed changes

adamjstewart added 5 commits March 18, 2025 17:22

Rust

eeea765

μm -> nm

4a66630

wave_list/bandwidths only required for spectral input mode

6caead5

Remove predefined Llama embeddings

74c15d0

ruff

8cb162a

wangyi111 commented Mar 19, 2025

View reviewed changes

torchgeo/models/copernicusfm.py Outdated Show resolved Hide resolved

adamjstewart added 2 commits March 19, 2025 12:05

Redistribute with stable, checksummable URL

e4b8a95

Remove key

4e18338

adamjstewart marked this pull request as ready for review March 19, 2025 12:50

adamjstewart previously approved these changes Mar 19, 2025

View reviewed changes

No deprecation warning

2259709

adamjstewart dismissed their stale review via 2259709 March 19, 2025 18:30

adamjstewart previously approved these changes Mar 20, 2025

View reviewed changes

Add example usage into CopernicusFM docstring

ffbffa3

wangyi111 dismissed adamjstewart’s stale review via ffbffa3 March 20, 2025 12:29

adamjstewart reviewed Mar 20, 2025

View reviewed changes

torchgeo/models/copernicusfm.py Outdated Show resolved Hide resolved

torchgeo/models/copernicusfm.py Outdated Show resolved Hide resolved

wangyi111 and others added 3 commits March 20, 2025 13:37

add metadata units in docstring

a8ccb31

ruff

58a331d

Line length

cc2b84f

Rename for consistency

23c1ecf

adamjstewart previously approved these changes Mar 24, 2025

View reviewed changes

wangyi111 commented Mar 26, 2025

View reviewed changes

torchgeo/models/copernicusfm.py Outdated Show resolved Hide resolved

torchgeo/models/copernicusfm.py Outdated Show resolved Hide resolved

Update torchgeo/models/copernicusfm.py

c803faf

Co-authored-by: Yi Wang <[email protected]>

adamjstewart dismissed their stale review via c803faf March 26, 2025 14:42

adamjstewart and others added 2 commits March 26, 2025 15:43

Update torchgeo/models/copernicusfm.py

3d5331b

Co-authored-by: Yi Wang <[email protected]>

Ruff

011c37b

adamjstewart approved these changes Mar 26, 2025

View reviewed changes

adamjstewart merged commit 81f8c0f into microsoft:main Mar 26, 2025
19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Copernicus-FM #2646

Add Copernicus-FM #2646

wangyi111 commented Mar 14, 2025 •

edited by adamjstewart

Loading

adamjstewart Mar 18, 2025

wangyi111 Mar 18, 2025

adamjstewart left a comment

calebrob6 commented Mar 19, 2025

adamjstewart commented Mar 19, 2025

wangyi111 commented Mar 20, 2025

adamjstewart commented Mar 20, 2025

adamjstewart left a comment

wangyi111 commented Mar 20, 2025

adamjstewart commented Mar 20, 2025

wangyi111 commented Mar 20, 2025

adamjstewart commented Mar 21, 2025

wangyi111 commented Mar 21, 2025

adamjstewart commented Mar 21, 2025

Add Copernicus-FM #2646

Add Copernicus-FM #2646

Conversation

wangyi111 commented Mar 14, 2025 • edited by adamjstewart Loading

References

adamjstewart Mar 18, 2025

Choose a reason for hiding this comment

wangyi111 Mar 18, 2025

Choose a reason for hiding this comment

adamjstewart left a comment

Choose a reason for hiding this comment

calebrob6 commented Mar 19, 2025

adamjstewart commented Mar 19, 2025

wangyi111 commented Mar 20, 2025

adamjstewart commented Mar 20, 2025

adamjstewart left a comment

Choose a reason for hiding this comment

wangyi111 commented Mar 20, 2025

adamjstewart commented Mar 20, 2025

wangyi111 commented Mar 20, 2025

adamjstewart commented Mar 21, 2025

wangyi111 commented Mar 21, 2025

adamjstewart commented Mar 21, 2025

wangyi111 commented Mar 14, 2025 •

edited by adamjstewart

Loading