Skip to content

Pull requests: ByteDance-Seed/VeOmni

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: NPU Hang/Deadlock during DTensor parameter loading in FSDP2 ascend everything about Ascend support fix
#642 opened Apr 10, 2026 by First-Frost-code Loading…
[WIP]: support npu qwen3p5 and qwen3p5-vl ascend everything about Ascend support wip
#641 opened Apr 10, 2026 by yicheng-gong Contributor Loading…
[ci] fix: disable pin_memory in multisource dataset test for NPU ascend everything about Ascend support ci fix
#639 opened Apr 9, 2026 by TimYangst Collaborator Loading…
4 tasks
[docker] feat: update to torch2.11 + cu130 docker
#629 opened Apr 2, 2026 by FoolPlayer Collaborator Loading…
[model]feat: add NPU support for Qwen3.5 ascend everything about Ascend support
#628 opened Apr 2, 2026 by yanghw116 Loading…
[agent] feat: add profile skill
#626 opened Apr 1, 2026 by FoolPlayer Collaborator Loading…
[model]feat: Qwen3.5 is compatible with NPU ascend everything about Ascend support
#600 opened Mar 23, 2026 by wang-hua-2019 Contributor Loading…
[model] feat: [transformers v5] support qwen3vl for transformer v5 hf_v5 Related for transformers v5
#527 opened Mar 2, 2026 by yiwzhao Collaborator Loading…
[data] feat: add MultiSourceDataset for weighted sampling
#522 opened Feb 28, 2026 by hjshi84 Collaborator Draft
6 tasks
[misc] fix: use dedicated Gloo process group for HF safetensor save to avoid NCCL timeouts ckpt Checkpoint related. fix misc Every misc
#492 opened Feb 18, 2026 by Ziyi-Wang Collaborator Loading…
[task] feat: support sequence classification tasks
#470 opened Feb 11, 2026 by yiwzhao Collaborator Loading…
[model] fix: Incorrect usage of the 'check_model_inputs' decorator fix
#457 opened Feb 5, 2026 by HSYZhang Contributor Loading…
6 tasks done
[misc] chore: add_copy_right misc Every misc
#438 opened Jan 30, 2026 by FoolPlayer Collaborator Loading…
Draft [models] feat: Add a modeling patch gen sample for qwen3
#424 opened Jan 26, 2026 by piyifan123 Collaborator Loading…
6 tasks
[docs]Update ascend_quick_start doc ascend everything about Ascend support
#225 opened Nov 27, 2025 by Alter-A1ways Loading…
4 of 6 tasks
[optim, config] feat: add support for Muon optimizer via dion doc Improvements or additions to documentation
#216 opened Nov 25, 2025 by clarkipeng Loading…
[fix] [model] auto-patch all Attention layers to ensure cu_seq_lens stays on CPU for NPU fused-attention. ascend everything about Ascend support
#199 opened Nov 17, 2025 by A1waysBeenHere Contributor Loading…
4 of 6 tasks
Add TensorBoard support for training metrics logging
#195 opened Nov 14, 2025 by iqiancheng Contributor Loading…
train qwen3-vl-moe on ShareGPT4V-small with quick-start
#194 opened Nov 14, 2025 by iqiancheng Contributor Loading…
enhance logs with tflops/mfu
#151 opened Oct 20, 2025 by ziqi-wlb Contributor Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.