-
Notifications
You must be signed in to change notification settings - Fork 578
Pull requests: EvolvingLMMs-Lab/lmms-eval
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: release accelerator model refs during cleanup
#1321
opened May 2, 2026 by
xk-huang
Loading…
1 of 7 tasks
feat: add HD-EPIC VQA benchmark (CVPR 2025)
#1316
opened Apr 30, 2026 by
aliazani
Loading…
1 task done
fix(api/task): handle None generation responses in process_results
#1311
opened Apr 26, 2026 by
dankit
Loading…
1 of 7 tasks
feat: add Bedrock and local vLLM providers for llm_judge
#1298
opened Apr 14, 2026 by
ShownX
Loading…
Fix missing Task import for type annotation in evaluator
#1291
opened Apr 10, 2026 by
luv-oct22
Loading…
2 tasks
feat: add physics reasoning benchmarks (PhysBench, ContPhy, PhysGame, PhysicsRW, PhysReason)
#1272
opened Mar 26, 2026 by
Luodian
Contributor
Loading…
4 tasks
feat: add VBench video generation evaluation benchmark
#1271
opened Mar 26, 2026 by
Luodian
Contributor
Loading…
3 tasks
feat: add MiniMax as LLM judge provider
#1263
opened Mar 22, 2026 by
octo-patch
Loading…
3 tasks done
ProTip!
Adding no:label will show everything without a label.