-
Notifications
You must be signed in to change notification settings - Fork 322
Pull requests: datajuicer/data-juicer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: resolve 'multiple values for num_proc' error in TextFormatter
#905
opened Feb 4, 2026 by
cmgzn
Loading…
[WIP] Feat: Add video_calibration_mapper and video_split_by_frame_mapper
#902
opened Feb 1, 2026 by
1van2ha0
Loading…
3 tasks
[WIP] Add Camera Pose op
dj:multimodal
issues/PRs about multimodal data processing
dj:op
issues/PRs about some specific OPs
enhancement
New feature or request
#894
opened Jan 27, 2026 by
Qirui-jiao
Loading…
Add Hand Reconstruction op (HaWoR)
dj:multimodal
issues/PRs about multimodal data processing
dj:op
issues/PRs about some specific OPs
enhancement
New feature or request
#893
opened Jan 27, 2026 by
Qirui-jiao
Loading…
[feature] op-level isolated environment spec in ray mode
dj:dist
issues/PRs about distributed data processing
dj:op
issues/PRs about some specific OPs
enhancement
New feature or request
environment
related to third-party dependency, DJ-pypi, DJ-docker, etc.
#892
opened Jan 23, 2026 by
HYLcool
Loading…
[WIP] feat: Support iceberg、hudi、delta、hdfs data source.
dj:core
issues/PRs about the core functions of Data-Juicer
dj:dataset
issues/PRs about the dj-dataset
#875
opened Jan 6, 2026 by
Dludora
Loading…
[WIP] feat: Pr 839 s3 download checkpoint resume and unittest for s3 download
#870
opened Dec 25, 2025 by
Dludora
Loading…
Depth seg new op
dj:op
issues/PRs about some specific OPs
#862
opened Dec 22, 2025 by
archernsy
Loading…
Add Operator-Level Parallel Data Processing with Ray Actors
dj:dist
issues/PRs about distributed data processing
dj:efficiency
regarding to efficiency issues and enhancements
enhancement
New feature or request
#761
opened Aug 19, 2025 by
Cccccc0630
Loading…
[NewOp] Add generate_challenging_qa_mapper based on MindGYM principles
#703
opened Jun 14, 2025 by
Bat-Reality
Loading…
[WIP] Optimization framework
dj:core
issues/PRs about the core functions of Data-Juicer
dj:efficiency
regarding to efficiency issues and enhancements
#702
opened Jun 13, 2025 by
cyruszhang
Loading…
[NewOp] Add domain_diversity_selector based on DaaR principles
#699
opened Jun 12, 2025 by
lingzhq
Loading…
Add humanvbench operators
dj:multimodal
issues/PRs about multimodal data processing
dj:op
issues/PRs about some specific OPs
good first issue
Good for newcomers
#553
opened Jan 17, 2025 by
SYSUzhouting
Loading…
ProTip!
Mix and match filters to narrow down what you’re looking for.