Skip to content

Pull requests: groq/openbench

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat(livemcpbench): Adding support for liveMCPBench
#127 opened Aug 28, 2025 by lvogel04 Draft
1 of 23 tasks
feat: mbpp
#117 opened Aug 21, 2025 by nmayorga7 Draft
7 of 23 tasks
feat: multichallenge
#116 opened Aug 20, 2025 by nmayorga7 Draft
7 of 23 tasks
feat: graphwalks token filter
#115 opened Aug 20, 2025 by nmayorga7 Loading…
9 of 24 tasks
feat: add LiveCodeBench
#112 opened Aug 19, 2025 by TheFloatingString Loading…
3 of 23 tasks
feat: add Slide VQA
#77 opened Aug 14, 2025 by TheFloatingString Draft
feat: add chartqa
#74 opened Aug 13, 2025 by TheFloatingString Loading…
feat: add mathvista
#73 opened Aug 13, 2025 by TheFloatingString Loading…
docs: update default model name
#67 opened Aug 13, 2025 by TheFloatingString Loading…
feat: add BBQ
#66 opened Aug 13, 2025 by TheFloatingString Loading…
feat: add BBH
#62 opened Aug 13, 2025 by TheFloatingString Loading…
math_500 initial commit
#60 opened Aug 13, 2025 by srao-groq Loading…
feat: add tau-bench
#57 opened Aug 13, 2025 by TheFloatingString Draft
feat: mtob
#56 opened Aug 13, 2025 by TheFloatingString Loading…
OpenBench: Matt Shumer changes
#55 opened Aug 12, 2025 by mshumer Loading…
feat: add IFEval
#51 opened Aug 12, 2025 by TheFloatingString Loading…
feat: add CRUXEval I/O w/ COT
#27 opened Aug 11, 2025 by aj-groq Loading…
ProTip! Exclude everything labeled bug with -label:bug.