Changes needed for jepa latent forecasting by shmh40 · Pull Request #2110 · ecmwf/WeatherGenerator

shmh40 · 2026-03-25T15:26:54Z

Description

We want to be able to do JEPA loss where the student has data at time t, and teacher data at t+1. This requires changes in multi_stream_data_sampler to shift the window for the teacher source data, and a light change to the SSL latent loss if we want to do this without spatial masking, as otherwise the masks for the student and teacher are the same in latent space, so no patches to compute the loss on.

Example config included. Potentially teacher_time_offset should be inside the target_input.

Issue Number

Closes #2109

Is this PR a draft? Mark it as draft.

Checklist before asking for review

I have performed a self-review of my code
My changes comply with basic sanity checks:
- I have fixed formatting issues with ./scripts/actions.sh lint
- I have run unit tests with ./scripts/actions.sh unit-test
- I have documented my code and I have updated the docstrings.
- I have added unit tests, if relevant
I have tried my changes with data and code:
- I have run the integration tests with ./scripts/actions.sh integration-test
- (bigger changes) I have run a full training and I have written in the comment the run_id(s): launch-slurm.py --time 60
- (bigger changes and experiments) I have shared a hegdedoc in the github issue with all the configurations and runs for this experiments
I have informed and aligned with people impacted by my change:
- for config changes: the MatterMost channels and/or a design doc
- for changes of dependencies: the MatterMost software development channel

…a-latent-forecast

…hmh40/dev/jepa-latent-forecast

clessig · 2026-03-29T08:57:55Z

@shmh40 : could you explain conceptually why a separate teacher offset is needed and why this is not covered by

forecast:
  offset: 1

shmh40 · 2026-03-30T10:00:24Z

As far as I can tell, in student-teacher mode, in msds we have source_select and target_select as only "network_input". This means that they only use _build_stream_data_input, since _build_stream_data_output only adds to the stream_data if the mode is "target_coords" or "target_values". forecast: offset: sets output_offset, which is only used for output_data (in _get_data_windows and _build_stream_data_output), and since this output is never fed to the student and teacher networks, it makes no difference. I have tested with some print statements.

teacher_time_offset explicitly offsets the input_data_target, which is then fed to the teacher.

…hmh40/dev/jepa-latent-forecast

…acky

shmh40 · 2026-04-10T16:42:51Z

Superseded by #2196 going into develop-ssl.

clessig · 2026-04-12T14:07:18Z

Superseded by #2196 going into develop-ssl.

We cannot have a branch develop-ssl. This will diverge the developments and make it very hard to merge things later.

changes needed for jepa latent forecasting

b431422

shmh40 requested a review from sophie-xhonneux March 25, 2026 15:26

shmh40 self-assigned this Mar 25, 2026

shmh40 added the model:pretrain label Mar 25, 2026

shmh40 added this to WeatherGen-dev Mar 25, 2026

github-actions bot added the model Related to model training or definition (not generic infra) label Mar 25, 2026

shmh40 added 2 commits March 26, 2026 13:12

Merge remote-tracking branch 'origin/develop' into develop-ssl

370887b

Merge remote-tracking branch 'origin/develop-ssl' into shmh40/dev/jep…

d4ffc85

…a-latent-forecast

shmh40 changed the base branch from develop to develop-ssl March 26, 2026 12:14

Merge remote-tracking branch 'origin/sophiex/dev-ssl/deep-ssl' into s…

4782049

…hmh40/dev/jepa-latent-forecast

shmh40 changed the base branch from develop-ssl to sophiex/dev-ssl/deep-ssl March 26, 2026 16:28

shmh40 added 2 commits March 26, 2026 17:41

add missing qk_norm_type to config

ee0c5f6

change ema halflife and 2d rope

b60596e

shmh40 added 6 commits March 30, 2026 18:02

Merge remote-tracking branch 'origin/sophiex/dev-ssl/deep-ssl' into s…

df06606

…hmh40/dev/jepa-latent-forecast

add layernorm

bd47dea

allow load optimiser state

9b38900

update to save teacher ema state

1f684ac

trainer loads ema for both training teacher and validation teacher, h…

d50276b

…acky

fix for teacher offset leaking into finetuning

adb4a08

shmh40 closed this Apr 10, 2026

github-project-automation bot moved this to Done in WeatherGen-dev Apr 10, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changes needed for jepa latent forecasting#2110

Changes needed for jepa latent forecasting#2110
shmh40 wants to merge 12 commits intosophiex/dev-ssl/deep-sslfrom
shmh40/dev/jepa-latent-forecast

shmh40 commented Mar 25, 2026 •

edited

Loading

Uh oh!

clessig commented Mar 29, 2026

Uh oh!

shmh40 commented Mar 30, 2026

Uh oh!

shmh40 commented Apr 10, 2026

Uh oh!

clessig commented Apr 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

shmh40 commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issue Number

Checklist before asking for review

Uh oh!

clessig commented Mar 29, 2026

Uh oh!

shmh40 commented Mar 30, 2026

Uh oh!

shmh40 commented Apr 10, 2026

Uh oh!

clessig commented Apr 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shmh40 commented Mar 25, 2026 •

edited

Loading