Google: Gemini 1.5 Pro

Text input Image input Text output
Author's Description

Google's latest multimodal model, supports image and video[0] in text or chat prompts. Optimized for language tasks including: - Code generation - Text generation - Text editing - Problem solving - Recommendations - Information extraction - Data extraction or generation - AI agents Usage of Gemini is subject to Google's [Gemini Terms of Use](https://ai.google.dev/terms). * [0]: Video input is not available through OpenRouter at this time.

Key Specifications
Cost
$$$$
Context
2M
Parameters
1T (Rumoured)
Released
Apr 08, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Tool Choice Top P Temperature Seed Tools Structured Outputs Response Format Frequency Penalty Max Tokens
Features

This model supports the following features:

Tools Structured Outputs Response Format
Performance Summary

Google's Gemini 1.5 Pro, a multimodal model launched on April 8, 2024, demonstrates strong overall performance, consistently ranking among the fastest models available and offering highly competitive pricing. Its reliability is exceptional, with a 100th percentile ranking indicating minimal technical failures. In terms of benchmark performance, Gemini 1.5 Pro exhibits notable strengths in several areas. It achieved perfect accuracy in the Ethics (Baseline) benchmark, standing out as the most accurate model at its price point and speed. Its General Knowledge (Baseline) performance was also very strong at 99.5% accuracy, placing it in the 83rd percentile. The model showed solid capabilities in Email Classification (98.0% accuracy) and Reasoning (70.0% accuracy). While its Coding (Baseline) accuracy of 87.0% is respectable, it falls within the 73rd percentile. A significant weakness was observed in the Instruction Following (Baseline) benchmark, where it registered 0.0% accuracy, indicating a critical area for improvement. Despite this, its overall cost-effectiveness and speed across most tasks make it a compelling option for various language-centric applications, including code generation, text editing, and information extraction.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.25
Completion $5

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google
Google | google/gemini-pro-1.5 2M $1.25 / 1M tokens $5 / 1M tokens
Google AI Studio
Google AI Studio | google/gemini-pro-1.5 2M $1.25 / 1M tokens $5 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by google