Add changes for pushforward trick by SavvasMel · Pull Request #1997 · ecmwf/WeatherGenerator

SavvasMel · 2026-03-06T13:06:48Z

Description

This PR adds the necessary changes for the pushforward trick.

Issue Number

Closes #1740

Is this PR a draft? Mark it as draft.

Checklist before asking for review

I have performed a self-review of my code
My changes comply with basic sanity checks:
- I have fixed formatting issues with ./scripts/actions.sh lint
- I have run unit tests with ./scripts/actions.sh unit-test
- I have documented my code and I have updated the docstrings.
- I have added unit tests, if relevant
I have tried my changes with data and code:
- I have run the integration tests with ./scripts/actions.sh integration-test
- (bigger changes) I have run a full training and I have written in the comment the run_id(s): launch-slurm.py --time 60
- (bigger changes and experiments) I have shared a hegdedoc in the github issue with all the configurations and runs for this experiments
I have informed and aligned with people impacted by my change:
- for config changes: the MatterMost channels and/or a design doc
- for changes of dependencies: the MatterMost software development channel

clessig · 2026-03-15T18:18:57Z

src/weathergen/model/model_interface.py

+                # reshard_after_forward=False keeps FE parameters unsharded
+                # during the multi-step rollout loop.
+                # Needed for pushforward trick.
+                fully_shard(module, reshard_after_forward=False, **fsdp_kwargs)


@sophie-xhonneux : is this maybe related to the problem we are seeing with the EMATeacher where we need to reshard?

src/weathergen/model/model.py

Rebase

clessig

Essentially ready to merge, just some minor details.

clessig · 2026-03-29T11:26:10Z

src/weathergen/model/engines.py

+
+    def __init__(self):
+        super().__init__()
+        self.fe_blocks = torch.nn.ModuleList()


self.blocks = ... Also do we need this as all? If anything I would expect self.blocks = torch.nn.Identity() but since we implement forward it might not be needed

I actually added that for this:

num_params_fe = get_num_parameters(self.forecast_engine.fe_blocks)

See below.

clessig · 2026-03-29T11:30:06Z

src/weathergen/model/model.py

        tokens = tokens.reshape(shape).sum(axis=1)

+        # Allow for pushforward trick
+        p_fwd = self.cf.get("pushforward_trick", False)


I would expect that the push-forward trick config is part of forecast, i.e.

training_config : .... forecast: num_steps: 3 push_forward: True

github-project-automation bot added this to WeatherGen-dev Mar 6, 2026

github-actions bot added model Related to model training or definition (not generic infra) science Scientific questions labels Mar 9, 2026

SavvasMel requested a review from clessig March 12, 2026 11:57

clessig reviewed Mar 15, 2026

View reviewed changes

SavvasMel force-pushed the SavvasMel/develop/pushf_trick branch from 056d4be to 8ad0cff Compare March 28, 2026 10:58

github-actions bot added data Anything related to the datasets used in the project eval anything related to the model evaluation pipeline infra Issues related to infrastructure labels Mar 28, 2026

SavvasMel added 6 commits March 28, 2026 12:06

Add changes for pushforward trick

3bee53b

Rebase

Remove comments

fac6735

Linting

44275f1

Introduce an Identity Engine

95e55cf

apply changes based on review

6d3ac78

Lots of Linting

02a0d46

SavvasMel force-pushed the SavvasMel/develop/pushf_trick branch from 2f89054 to 02a0d46 Compare March 28, 2026 11:09

SavvasMel requested a review from clessig March 28, 2026 11:11

clessig reviewed Mar 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add changes for pushforward trick#1997

Add changes for pushforward trick#1997
SavvasMel wants to merge 6 commits intoecmwf:developfrom
SavvasMel:SavvasMel/develop/pushf_trick

SavvasMel commented Mar 6, 2026

Uh oh!

clessig Mar 15, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clessig left a comment

Uh oh!

clessig Mar 29, 2026

Uh oh!

SavvasMel Mar 29, 2026

Uh oh!

clessig Mar 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

SavvasMel commented Mar 6, 2026

Description

Issue Number

Checklist before asking for review

Uh oh!

clessig Mar 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clessig left a comment

Choose a reason for hiding this comment

Uh oh!

clessig Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

SavvasMel Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

clessig Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants