Feedback Form Source Code for Java Projects

Provider-agnostic, open-source evaluation infrastructure for language models

openbench provides standardized, reproducible benchmarking for LLMs across 30+ evaluation suites (and growing) spanning knowledge, math, reasoning, coding, science, reading comprehension, health, long ...

Build Faster with Auto Claude, Open Source AI That Plans, Codes & Syncs with GitHub

Stay in flow with Auto Claude using multi-terminal tools and session restore, so you run tests and pick up where you left off ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Provider-agnostic, open-source evaluation infrastructure for language models

Build Faster with Auto Claude, Open Source AI That Plans, Codes & Syncs with GitHub

Trending now