per cell lejepa implementation by shmh40 · Pull Request #2111 · ecmwf/WeatherGenerator

shmh40 · 2026-03-25T18:58:43Z

Description

Draft implementation of per-cell LeJEPA with SIGReg loss. LeJEPALoss should probably just go under the LatentLoss class, to do. Also needs checking.

We have a "SelfTeacher" since we don't do any stop grad/EMA etc. with LeJEPA.

Issue Number

Closes ???

Is this PR a draft? Mark it as draft.

Checklist before asking for review

I have performed a self-review of my code
My changes comply with basic sanity checks:
- I have fixed formatting issues with ./scripts/actions.sh lint
- I have run unit tests with ./scripts/actions.sh unit-test
- I have documented my code and I have updated the docstrings.
- I have added unit tests, if relevant
I have tried my changes with data and code:
- I have run the integration tests with ./scripts/actions.sh integration-test
- (bigger changes) I have run a full training and I have written in the comment the run_id(s): launch-slurm.py --time 60
- (bigger changes and experiments) I have shared a hegdedoc in the github issue with all the configurations and runs for this experiments
I have informed and aligned with people impacted by my change:
- for config changes: the MatterMost channels and/or a design doc
- for changes of dependencies: the MatterMost software development channel

…has its own class which is not good

shmh40 · 2026-03-25T19:00:00Z

…hmh40/dev/lejepa-per-cell

clessig · 2026-03-29T11:59:52Z

        self.latent_heads = nn.ModuleDict()
        self.latent_pre_norm = nn.LayerNorm(cf.ae_global_dim_embed)

+        ssl_loss_types = ("LossLatentSSLStudentTeacher", "LossLeJEPA")


I would prefer if we come up with a consistent naming, e.g. SSL_ for the ssl losses, so that we do not need to maintain an explicit list here.

clessig · 2026-03-29T12:02:26Z

    student_masks = student_masks.squeeze(dim=1)
    teacher_masks = teacher_masks.squeeze(dim=1)

    if temporal:


Why do we need this here? This should be apparent from the masking--forecasting has no mask on the target.

clessig · 2026-03-29T12:07:54Z

+    z = z.float()  # float32 for cos/sin precision
+    n, d = z.shape
+
+    # Trapezoidal quadrature weights with Gaussian window on [0, 3]


The quadrature should go to a separate function.

… into shmh40/dev/lejepa-per-cell

shmh40 added 2 commits March 25, 2026 19:55

first effort at lejepa, sigreg computed across cells, and lejepaloss …

843002b

…has its own class which is not good

example config

f0642ed

shmh40 self-assigned this Mar 25, 2026

shmh40 added this to WeatherGen-dev Mar 25, 2026

shmh40 added the model:pretrain label Mar 25, 2026

shmh40 requested a review from sophie-xhonneux March 25, 2026 19:00

github-actions bot added the model Related to model training or definition (not generic infra) label Mar 25, 2026

shmh40 added 2 commits March 26, 2026 21:46

merge in develop-ssl and conflict in model_interface

4a26b24

Merge remote-tracking branch 'origin/sophiex/dev-ssl/deep-ssl' into s…

6c0dee8

…hmh40/dev/lejepa-per-cell

github-actions bot added data Anything related to the datasets used in the project eval anything related to the model evaluation pipeline infra Issues related to infrastructure labels Mar 26, 2026

shmh40 added 2 commits March 26, 2026 22:32

gaussianity check and kwargs needed for lejepa loss after merge

be63b9b

updated lejepa loss

50b9d6b

clessig reviewed Mar 29, 2026

View reviewed changes

Merge remote-tracking branch 'origin/shmh40/dev/jepa-latent-forecast'…

b063751

… into shmh40/dev/lejepa-per-cell

shmh40 closed this Apr 13, 2026

github-project-automation bot moved this to Done in WeatherGen-dev Apr 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

per cell lejepa implementation#2111

per cell lejepa implementation#2111
shmh40 wants to merge 7 commits intoshmh40/dev/jepa-latent-forecastfrom
shmh40/dev/lejepa-per-cell

shmh40 commented Mar 25, 2026

Uh oh!

shmh40 commented Mar 25, 2026

Uh oh!

clessig Mar 29, 2026

Uh oh!

clessig Mar 29, 2026

Uh oh!

clessig Mar 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

shmh40 commented Mar 25, 2026

Description

Issue Number

Checklist before asking for review

Uh oh!

shmh40 commented Mar 25, 2026

Uh oh!

clessig Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

clessig Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

clessig Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants