ByteDance-Seed / VeOmni Public

Notifications You must be signed in to change notification settings
Fork 178
Star 1.8k

Code
Issues 54
Pull requests 30
Discussions
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security and quality
Insights

Pull requests: ByteDance-Seed/VeOmni

Labels 21 Milestones 0

New pull request New

30 Open 445 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

fix: NPU Hang/Deadlock during DTensor parameter loading in FSDP2 ascend

everything about Ascend support

fix

#642 opened Apr 10, 2026 by First-Frost-code

Loading…

[WIP]: support npu qwen3p5 and qwen3p5-vl ascend

everything about Ascend support

wip

#641 opened Apr 10, 2026 by yicheng-gong Contributor

Loading…

[ci] fix: disable pin_memory in multisource dataset test for NPU ascend

everything about Ascend support

ci fix

#639 opened Apr 9, 2026 by TimYangst Collaborator

Loading…

4 tasks

[arguments]fix bug for args.train.accelerator.fsdp_config.mixed_precision.enable bug

Something isn't working

fix

#638 opened Apr 9, 2026 by UserChen666

Loading…

[misc] fix: revert fsdp basic modules, fix it with VL models only fix misc

Every misc

#634 opened Apr 8, 2026 by heidongxianhua Contributor

Loading…

[docker] feat: update to torch2.11 + cu130 docker

#629 opened Apr 2, 2026 by FoolPlayer Collaborator

Loading…

[model]feat: add NPU support for Qwen3.5 ascend

everything about Ascend support

#628 opened Apr 2, 2026 by yanghw116

Loading…

[agent] feat: add profile skill

#626 opened Apr 1, 2026 by FoolPlayer Collaborator

Loading…

[model]feat: Qwen3.5 is compatible with NPU ascend

everything about Ascend support

#600 opened Mar 23, 2026 by wang-hua-2019 Contributor

Loading…

[model] feat: [transformers-v5] Introduce new registration based kernel replacement.

#569 opened Mar 16, 2026 by piyifan123 Collaborator

Loading…

[model] feat: [transformers v5] support qwen3vl for transformer v5 hf_v5

Related for transformers v5

#527 opened Mar 2, 2026 by yiwzhao Collaborator

Loading…

[data] feat: add MultiSourceDataset for weighted sampling

#522 opened Feb 28, 2026 by hjshi84 Collaborator • Draft

6 tasks

[parallel] feat: Vision Data Parallel — O(1) communication alternative to patch-level SP

#505 opened Feb 24, 2026 by aoshen524

Loading…

2 of 3 tasks

[models] chore: Change transformers v5 support for qwen3_moe to use HF v5 style expert weight layout and add a converter impl. hf_v5

Related for transformers v5

misc

Every misc

#500 opened Feb 24, 2026 by piyifan123 Collaborator • Draft

[misc] fix: use dedicated Gloo process group for HF safetensor save to avoid NCCL timeouts ckpt

Checkpoint related.

fix misc

Every misc

#492 opened Feb 18, 2026 by Ziyi-Wang Collaborator

Loading…

[task] feat: support sequence classification tasks

#470 opened Feb 11, 2026 by yiwzhao Collaborator

Loading…

[model] fix: Incorrect usage of the 'check_model_inputs' decorator fix

#457 opened Feb 5, 2026 by HSYZhang Contributor

Loading…

6 tasks done

[misc] chore: add_copy_right misc

Every misc

#438 opened Jan 30, 2026 by FoolPlayer Collaborator

Loading…

Draft [models] feat: Add a modeling patch gen sample for qwen3

#424 opened Jan 26, 2026 by piyifan123 Collaborator

Loading…

6 tasks

[docs]Update ascend_quick_start doc ascend

everything about Ascend support

#225 opened Nov 27, 2025 by Alter-A1ways

Loading…

4 of 6 tasks

[optim, config] feat: add support for Muon optimizer via dion doc

Improvements or additions to documentation

#216 opened Nov 25, 2025 by clarkipeng

Loading…

[fix] [model] auto-patch all Attention layers to ensure cu_seq_lens stays on CPU for NPU fused-attention. ascend

everything about Ascend support

#199 opened Nov 17, 2025 by A1waysBeenHere Contributor

Loading…

4 of 6 tasks

Add TensorBoard support for training metrics logging

#195 opened Nov 14, 2025 by iqiancheng Contributor

Loading…

train qwen3-vl-moe on ShareGPT4V-small with quick-start

#194 opened Nov 14, 2025 by iqiancheng Contributor

Loading…

enhance logs with tflops/mfu

#151 opened Oct 20, 2025 by ziqi-wlb Contributor

Loading…

Previous 1 2 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!