[task] fix: fix bug for args.train.accelerator.fsdp_config.mixed_precision.enable by UserChen666 · Pull Request #638 · ByteDance-Seed/VeOmni

UserChen666 · 2026-04-09T11:51:23Z

What does this PR do?

fix bug for #636

Checklist Before Starting

Search for relative PRs/issues and link here: ...
PR title follows [{modules}] {type}: {description} format (see check_pr_title.yml for the full list of allowed modules and types)
- Breaking changes: prepend [BREAKING] — e.g. [BREAKING][parallel, model] feat: dynamic batching

Test

Validation results (training curves, eval metrics) for changes not covered by CI.

API and Usage Example

Show API changes and usage examples if applicable.

Design & Code Changes

High-level design description and specific change list.

Checklist Before Submitting

Read the Contribute Guide
Applied pre-commit checks
Added/updated documentation
If tasks/ training scripts were moved or renamed: updated docs/ examples and verified python3 scripts/ci/check_doc_task_paths.py passes (also enforced by the Check doc task paths CI workflow)
Added tests to CI workflow (or explained why not feasible)

CLAassistant · 2026-04-09T11:51:31Z

All committers have signed the CLA.

gemini-code-assist

Code Review

This pull request updates various training scripts to use the mixed precision configuration object instead of a boolean flag. However, the review identifies a critical issue where the build_parallelize_model function expects a MixedPrecisionConfig object rather than a boolean, meaning the current implementation ignores the user's configuration. The reviewer suggests passing the full mixed_precision configuration object to ensure the settings are correctly applied.

gemini-code-assist · 2026-04-09T11:59:59Z

        enable_full_shard=args.train.accelerator.fsdp_config.full_shard,
        enable_reshard_after_forward=args.train.accelerator.fsdp_config.reshard_after_forward,
-        enable_mixed_precision=args.train.enable_mixed_precision,
+        enable_mixed_precision=args.train.accelerator.fsdp_config.mixed_precision.enable,


The build_parallelize_model function expects a mixed_precision parameter of type MixedPrecisionConfig, not a boolean enable_mixed_precision. Passing a boolean to a non-existent parameter name will cause the function to use its default MixedPrecisionConfig(enable=True), effectively ignoring the user's configuration for mixed precision.

Suggested change

enable_mixed_precision=args.train.accelerator.fsdp_config.mixed_precision.enable,

mixed_precision=args.train.accelerator.fsdp_config.mixed_precision,

gemini-code-assist · 2026-04-09T11:59:59Z

        enable_full_shard=args.train.accelerator.fsdp_config.full_shard,
        enable_reshard_after_forward=args.train.accelerator.fsdp_config.reshard_after_forward,
-        enable_mixed_precision=args.train.enable_mixed_precision,
+        enable_mixed_precision=args.train.accelerator.fsdp_config.mixed_precision.enable,


The build_parallelize_model function expects a mixed_precision parameter of type MixedPrecisionConfig, not a boolean enable_mixed_precision. Passing a boolean to a non-existent parameter name will cause the function to use its default MixedPrecisionConfig(enable=True), effectively ignoring the user's configuration for mixed precision.

Suggested change

enable_mixed_precision=args.train.accelerator.fsdp_config.mixed_precision.enable,

mixed_precision=args.train.accelerator.fsdp_config.mixed_precision,

gemini-code-assist · 2026-04-09T11:59:59Z

        enable_full_shard=args.train.accelerator.fsdp_config.full_shard,
        enable_reshard_after_forward=args.train.accelerator.fsdp_config.reshard_after_forward,
-        enable_mixed_precision=args.train.enable_mixed_precision,
+        enable_mixed_precision=args.train.accelerator.fsdp_config.mixed_precision.enable,


The build_parallelize_model function expects a mixed_precision parameter of type MixedPrecisionConfig, not a boolean enable_mixed_precision. Passing a boolean to a non-existent parameter name will cause the function to use its default MixedPrecisionConfig(enable=True), effectively ignoring the user's configuration for mixed precision.

Suggested change

enable_mixed_precision=args.train.accelerator.fsdp_config.mixed_precision.enable,

mixed_precision=args.train.accelerator.fsdp_config.mixed_precision,

FoolPlayer · 2026-04-17T06:44:29Z

Hi, can you help fix lint error with

make style   # auto-fix lint + format
make quality # verify everything passes

UserChen666 · 2026-04-17T07:25:13Z

Hi, can you help fix lint error with
make style   # auto-fix lint + format
make quality # verify everything passes

I will do it immediately .

UserChen666 · 2026-04-17T07:30:50Z

Hi, can you help fix lint error with
make style   # auto-fix lint + format
make quality # verify everything passes

done

fix bug for args.train.accelerator.fsdp_config.mixed_precision.enable

6010234

github-actions Bot added bug Something isn't working fix labels Apr 9, 2026

UserChen666 changed the title ~~fix bug for args.train.accelerator.fsdp_config.mixed_precision.enable~~ [arguments]fix bug for args.train.accelerator.fsdp_config.mixed_precision.enable Apr 9, 2026

gemini-code-assist Bot reviewed Apr 9, 2026

View reviewed changes

fix bug for args.train.accelerator.fsdp_config.mixed_precision.enable

a1e0f45

FoolPlayer mentioned this pull request Apr 17, 2026

[docker] feat: add Ascend A3 Dockerfile and docs #659

Merged

fix bug for args.train.accelerator.fsdp_config.mixed_precision.enable

79d6657

code format

07cc6e4

FoolPlayer changed the title ~~[arguments]fix bug for args.train.accelerator.fsdp_config.mixed_precision.enable~~ [task] fix: fix bug for args.train.accelerator.fsdp_config.mixed_precision.enable Apr 18, 2026

FoolPlayer approved these changes Apr 18, 2026

View reviewed changes

FoolPlayer merged commit 27b8723 into ByteDance-Seed:main Apr 18, 2026
17 of 18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[task] fix: fix bug for args.train.accelerator.fsdp_config.mixed_precision.enable#638

[task] fix: fix bug for args.train.accelerator.fsdp_config.mixed_precision.enable#638
FoolPlayer merged 4 commits intoByteDance-Seed:mainfrom
UserChen666:main

UserChen666 commented Apr 9, 2026

Uh oh!

CLAassistant commented Apr 9, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 9, 2026

Uh oh!

gemini-code-assist Bot Apr 9, 2026

Uh oh!

gemini-code-assist Bot Apr 9, 2026

Uh oh!

Uh oh!

FoolPlayer commented Apr 17, 2026

Uh oh!

UserChen666 commented Apr 17, 2026

Uh oh!

UserChen666 commented Apr 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	enable_mixed_precision=args.train.accelerator.fsdp_config.mixed_precision.enable,
	mixed_precision=args.train.accelerator.fsdp_config.mixed_precision,

Conversation

UserChen666 commented Apr 9, 2026

What does this PR do?

Checklist Before Starting

Test

API and Usage Example

Design & Code Changes

Checklist Before Submitting

Uh oh!

CLAassistant commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

FoolPlayer commented Apr 17, 2026

Uh oh!

UserChen666 commented Apr 17, 2026

Uh oh!

UserChen666 commented Apr 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CLAassistant commented Apr 9, 2026 •

edited

Loading