LLMonitor Benchmarks

leaderboard | dataset | compare | about


"When a measure becomes a target, it ceases to be a good measure."

How this works:


Comments on rubrics:


Notes

Edit: as this got popular, I added an email form to receive notifications for future benchmark results:


(no spam, max 1 email per month)