Benchable Logo Benchable Open Beta
  • Home
  • Benchmarks
  • Models
Login Register

Benchmarks

Create Benchmark
Public Benchmarks 9
Visibility Owner
  • All Categories
  • Coding
  • Translation
  • Summarization
  • Classification
  • Knowledge
  • Mathematics
  • Information Retrieval
  • Ethics
  • Reasoning
  • Instruction Following
  • Mixed Tasks
  • Other
  • None
Stats Actions
Coding (Baseline) PUBLIC Benchable Coding 100 86 290
1
  • View Benchmark
  • View Executions 86

  • Login to Edit/Run
Email Classification (Baseline) PUBLIC Benchable Classification 100 93 316
0
  • View Benchmark
  • View Executions 93

  • Login to Edit/Run
Ethics (Baseline) PUBLIC Benchable Ethics 100 98 305
0
  • View Benchmark
  • View Executions 98

  • Login to Edit/Run
General Knowledge (Baseline) PUBLIC Benchable Knowledge 200 104 313
0
  • View Benchmark
  • View Executions 104

  • Login to Edit/Run
Hallucinations (Baseline) PUBLIC Benchable Other 50 15 211
0
  • View Benchmark
  • View Executions 15

  • Login to Edit/Run
Instruction Following (Baseline) PUBLIC Benchable Instruction Following 100 63 283
0
  • View Benchmark
  • View Executions 63

  • Login to Edit/Run
Keyword Topic Relevance Classification PUBLIC jeremygf Classification 10 6 12
0
  • View Benchmark
  • View Executions 6

  • Login to Edit/Run
Mathematics (Baseline) PUBLIC Benchable Mathematics 100 11 176
0
  • View Benchmark
  • View Executions 11

  • Login to Edit/Run
Reasoning (Baseline) PUBLIC Benchable Reasoning 50 25 239
0
  • View Benchmark
  • View Executions 25

  • Login to Edit/Run
Showing 1 to 9 of 9 benchmarks
About FAQ Changelog Privacy Policy Terms of Service Contact

© 2025 Benchable. All rights reserved.

Your Privacy Matters

We use analytics cookies to understand how you use Benchable and improve your experience. No tracking cookies are set until you explicitly accept. Read our Privacy Policy