Benchmarks
Visibility | Owner | Stats | Actions | |||
---|---|---|---|---|---|---|
Mathematics (Baseline) | PUBLIC | Benchable | Mathematics | 100 11 176 |
|
|
Hallucinations (Baseline) | PUBLIC | Benchable | Other | 50 15 211 |
|
|
Instruction Following (Baseline) | PUBLIC | Benchable | Instruction Following | 100 63 283 |
|
|
Keyword Topic Relevance Classification | PUBLIC | jeremygf | Classification | 10 6 12 |
|
|
Coding (Baseline) | PUBLIC | Benchable | Coding | 100 86 290 |
|
|
Reasoning (Baseline) | PUBLIC | Benchable | Reasoning | 50 25 239 |
|
|
Ethics (Baseline) | PUBLIC | Benchable | Ethics | 100 98 305 |
|
|
General Knowledge (Baseline) | PUBLIC | Benchable | Knowledge | 200 104 313 |
|
|
Email Classification (Baseline) | PUBLIC | Benchable | Classification | 100 93 316 |
|
Showing 1 to
9 of
9 benchmarks