Reka Edge

Image input Video input Text input Text output Unavailable
Author's Description

Reka Edge is an extremely efficient 7B multimodal vision-language model that accepts image/video+text inputs and generates text outputs. This model is optimized specifically to deliver industry-leading performance in image understanding, video analysis, object detection, and agentic tool-use.

Key Specifications
Cost
$$
Context
16K
Released
Mar 20, 2026
Supported Parameters

This model supports the following parameters:

Tool Choice Tools Temperature Max Tokens Structured Outputs Presence Penalty Stop Top P Seed Frequency Penalty
Features

This model supports the following features:

Structured Outputs Tools
Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.1
Completion $0.1

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Reka
Reka | rekaai/reka-edge-2603 16K $0.1 / 1M tokens $0.1 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by rekaai