Align jepa forecast finetuning by csjfwang · Pull Request #2149 · ecmwf/WeatherGenerator

csjfwang · 2026-03-31T09:11:48Z

Description

Align config_jepa_forecasting_finetuning.yml with config_forecasting.yml for fair comparison in the future.

Issue Number

Fixes #2150

Is this PR a draft? Mark it as draft.

Checklist before asking for review

I have performed a self-review of my code
My changes comply with basic sanity checks:
- I have fixed formatting issues with ./scripts/actions.sh lint
- I have run unit tests with ./scripts/actions.sh unit-test
- I have documented my code and I have updated the docstrings.
- I have added unit tests, if relevant
I have tried my changes with data and code:
- I have run the integration tests with ./scripts/actions.sh integration-test
- (bigger changes) I have run a full training and I have written in the comment the run_id(s): launch-slurm.py --time 60
- (bigger changes and experiments) I have shared a hegdedoc in the github issue with all the configurations and runs for this experiments
I have informed and aligned with people impacted by my change:
- for config changes: the MatterMost channels and/or a design doc
- for changes of dependencies: the MatterMost software development channel

shmh40 · 2026-03-31T09:37:45Z

config/config_jepa_forecasting_finetuning.yml

 latent_noise_deterministic_latents: True

-freeze_modules: ".*encoder.*|.*latent_pre_norm.*|.*latent_heads.*"
+freeze_modules: ""


We should keep these modules frozen as default also :)

shmh40 · 2026-03-31T09:37:51Z

config/config_jepa_forecasting_finetuning.yml

-fe_layer_norm_after_blocks: []  # Index starts at 0. Thus, [3] adds a LayerNorm after the fourth layer
-fe_impute_latent_noise_std: 0.0  # 1e-4
+fe_layer_norm_after_blocks: [7]  # Index starts at 0. Thus, [3] adds a LayerNorm after the fourth layer
+fe_impute_latent_noise_std: 1e-4


Sorry, could we actually leave the latent noise as 0 for now!

shmh40 · 2026-03-31T09:38:26Z

config/config_jepa_forecasting_finetuning.yml

 #####################################

-streams_directory: "./config/streams/era5_1deg/"
+streams_directory: "./config/streams/era5_1deg_forecasting/"


Great, thanks

shmh40 · 2026-03-31T09:39:17Z

config/config_jepa_forecasting_finetuning.yml

    lr_start: 1e-6
    lr_max: 5e-5
-    lr_final_decay: 1e-6
+    lr_final_decay: 2e-6


great, thanks

shmh40 · 2026-03-31T09:40:19Z

config/config_jepa_forecasting_finetuning.yml

  training_mode: ["masking"]

-  num_mini_epochs: 32
+  num_mini_epochs: 64


@sophie-xhonneux probably this one we can leave as 32 epochs?

shmh40

Great, thanks! Just a few changes needed and then let's wait for @sophie-xhonneux and @MatKbauer to double check too.

shmh40

Also can you create an issue and link to it in the PR so that the checks pass :)

csjfwang · 2026-03-31T09:43:43Z

Also can you create an issue and link to it in the PR so that the checks pass :)

Thanks, will create one now!

csjfwang · 2026-03-31T11:29:39Z

config/config_jepa_forecasting_finetuning.yml

 with_mixed_precision: True
 with_flash_attention: True
 compile_model: False
 with_fsdp: False


@shmh40 @sophie-xhonneux
Should we set with_fsdp: True, as in config_forecasting.yml?

Let's leave it as False for now, thanks!

shmh40

Thank you @csjfwang !

wang85 and others added 30 commits July 16, 2025 10:07

Replace cf.rank==0 with utils.distributed.is_root

317501e

replace cf.rank==0 with weathergen.utils.distributed.is_root

77de417

Merge branch 'ecmwf:develop' into develop

6439618

Merge branch 'ecmwf:develop' into develop

8993875

Merge branch 'ecmwf:develop' into develop

f4a9d85

Merge branch 'ecmwf:develop' into develop

f8fdef4

Merge branch 'ecmwf:develop' into develop

ca89e7b

Merge branch 'ecmwf:develop' into develop

49d7a4d

Merge branch 'ecmwf:develop' into develop

f39f094

Merge branch 'ecmwf:develop' into develop

ebb03ea

Merge branch 'ecmwf:develop' into develop

f40737d

Merge branch 'ecmwf:develop' into develop

87fa078

Merge branch 'ecmwf:develop' into develop

5dfe275

Merge branch 'ecmwf:develop' into develop

b7244d9

Merge branch 'ecmwf:develop' into develop

5be41f5

Merge branch 'ecmwf:develop' into develop

39d3965

Merge branch 'ecmwf:develop' into develop

015ec88

Merge branch 'ecmwf:develop' into develop

cb1b7cc

Merge branch 'ecmwf:develop' into develop

90da4cf

Merge branch 'ecmwf:develop' into develop

f04891b

Merge branch 'ecmwf:develop' into develop

105d992

Merge branch 'ecmwf:develop' into develop

5f56073

Merge branch 'ecmwf:develop' into develop

95ee18a

Merge branch 'ecmwf:develop' into develop

3c702d3

Merge branch 'ecmwf:develop' into develop

6f14a30

Merge branch 'ecmwf:develop' into develop

5e87881

Merge branch 'ecmwf:develop' into develop

0c7d305

Merge branch 'ecmwf:develop' into develop

e43ac94

Merge branch 'ecmwf:develop' into develop

5f63bcc

Merge branch 'ecmwf:develop' into develop

c51eb94

csjfwang and others added 7 commits February 27, 2026 16:32

Merge branch 'ecmwf:develop' into develop

a2f56d3

Merge branch 'ecmwf:develop' into develop

eca4792

Merge branch 'ecmwf:develop' into develop

e0562f7

Merge branch 'ecmwf:develop' into develop

e3e7108

Merge branch 'ecmwf:develop' into develop

bb78f7d

Merge branch 'ecmwf:develop' into develop

b4005f1

align config_jepa_forecasting_finetuning.yml with config_forecasting.yml

a5ed484

github-project-automation bot added this to WeatherGen-dev Mar 31, 2026

csjfwang requested review from MatKbauer, shmh40 and sophie-xhonneux March 31, 2026 09:13

shmh40 reviewed Mar 31, 2026

View reviewed changes

shmh40 self-requested a review March 31, 2026 09:42

shmh40 requested changes Mar 31, 2026

View reviewed changes

github-project-automation bot moved this to In Progress in WeatherGen-dev Mar 31, 2026

github-actions bot added model Related to model training or definition (not generic infra) model:pretrain science Scientific questions labels Mar 31, 2026

csjfwang commented Mar 31, 2026

View reviewed changes

align jepa forecast finetuning with forecast config

2335d73

shmh40 approved these changes Mar 31, 2026

View reviewed changes

Merge branch 'develop' into align_jepa_forecast_finetuning

efbe1c7

shmh40 merged commit 5b14709 into ecmwf:develop Mar 31, 2026
5 checks passed

github-project-automation bot moved this from In Progress to Done in WeatherGen-dev Mar 31, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Align jepa forecast finetuning#2149

Align jepa forecast finetuning#2149
shmh40 merged 55 commits intoecmwf:developfrom
csjfwang:align_jepa_forecast_finetuning

csjfwang commented Mar 31, 2026 •

edited

Loading

Uh oh!

shmh40 Mar 31, 2026 •

edited

Loading

Uh oh!

shmh40 Mar 31, 2026

Uh oh!

shmh40 Mar 31, 2026

Uh oh!

shmh40 Mar 31, 2026

Uh oh!

shmh40 Mar 31, 2026 •

edited

Loading

Uh oh!

shmh40 left a comment

Uh oh!

shmh40 left a comment

Uh oh!

csjfwang commented Mar 31, 2026

Uh oh!

csjfwang Mar 31, 2026 •

edited

Loading

Uh oh!

shmh40 Mar 31, 2026

Uh oh!

shmh40 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

csjfwang commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issue Number

Checklist before asking for review

Uh oh!

shmh40 Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shmh40 Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

shmh40 Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

shmh40 Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

shmh40 Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shmh40 left a comment

Choose a reason for hiding this comment

Uh oh!

shmh40 left a comment

Choose a reason for hiding this comment

Uh oh!

csjfwang commented Mar 31, 2026

Uh oh!

csjfwang Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shmh40 Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

shmh40 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

csjfwang commented Mar 31, 2026 •

edited

Loading

shmh40 Mar 31, 2026 •

edited

Loading

shmh40 Mar 31, 2026 •

edited

Loading

csjfwang Mar 31, 2026 •

edited

Loading