Benchmark Details

Strawberry challenge
Benchmark Information

Count how many R's are in the provided word.

Category: Reasoning
Visibility: PUBLIC
Max Completion Tokens: 1
Created: Loading...
Updated: Loading...
Input Tokens:

23

Est. Output Tokens:

1

System Prompt
Count how many R's are in the provided word. You can only answer with a number.
Validation Rules
Response Contains Text All Text
Benchmark Steps
# User Prompt Response
1
Strawberry
3