Benchmarks
| Visibility | Owner | Stats | Actions | |||
|---|---|---|---|---|---|---|
| inai | PUBLIC | renish11 | Coding | 0 1 0 |
|
|
| Mathematics (Baseline) | PUBLIC | Benchable | Mathematics | 100 38 201 |
|
|
| Hallucinations (Baseline) | PUBLIC | Benchable | Other | 50 40 239 |
|
|
| Instruction Following (Baseline) | PUBLIC | Benchable | Instruction Following | 100 88 312 |
|
|
| Keyword Topic Relevance Classification | PUBLIC | jeremygf | Classification | 10 6 12 |
|
|
| Coding (Baseline) | PUBLIC | Benchable | Coding | 100 112 319 |
|
|
| Reasoning (Baseline) | PUBLIC | Benchable | Reasoning | 50 50 267 |
|
|
| Ethics (Baseline) | PUBLIC | Benchable | Ethics | 100 123 334 |
|
|
| General Knowledge (Baseline) | PUBLIC | Benchable | Knowledge | 200 129 341 |
|
|
| Email Classification (Baseline) | PUBLIC | Benchable | Classification | 100 118 344 |
|
Showing 1 to
10
of
10 benchmarks