Author's Description
The 2b version of the InternVL3 series, for an even higher inference speed and very reasonable performance. An advanced multimodal large language model (MLLM) series that demonstrates superior overall performance. Compared to InternVL 2.5, InternVL3 exhibits superior multimodal perception and reasoning capabilities, while further extending its multimodal capabilities to encompass tool usage, GUI agents, industrial image analysis, 3D vision perception, and more.
Key Specifications
Supported Parameters
This model supports the following parameters:
Performance Summary
The OpenGVLab: InternVL3 2B model, created on April 30, 2025, is positioned as a multimodal large language model emphasizing speed and reasonable performance. It performs among the fastest models, typically ranking in the top tier for speed (70th percentile across benchmarks). Furthermore, it consistently offers highly competitive pricing, placing it in the 88th percentile. In terms of performance across benchmark categories, the InternVL3 2B demonstrates significant weaknesses in knowledge-based tasks. Its accuracy on General Knowledge is notably low at 15.0% (12th percentile), and similarly, its Ethics performance is 27.0% (12th percentile). While these benchmarks show low accuracy, the model maintains competitive costs and moderate durations for these tasks. Its performance in the Email Classification task is also low at 79.0% accuracy (8th percentile), despite a very competitive cost and a relatively fast duration. Overall, the model's primary strength lies in its speed and cost-efficiency, making it an attractive option for applications where these factors are paramount, even if it comes at the expense of accuracy in complex reasoning and classification tasks.
Model Pricing
Current Pricing
| Feature | Price (per 1M tokens) |
|---|---|
| Prompt | $0.05 |
| Completion | $0.1 |
Price History
Available Endpoints
| Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
|---|---|---|---|---|
|
Nineteen
|
Nineteen | opengvlab/internvl3-2b | 12K | $0.05 / 1M tokens | $0.1 / 1M tokens |
Benchmark Results
| Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
|---|
Other Models by opengvlab
|
|
Released | Params | Context |
|
Speed | Ability | Cost |
|---|---|---|---|---|---|---|---|
| OpenGVLab: InternVL3 78B | Sep 15, 2025 | 78B | 32K |
Image input
Text input
Text output
|
★★ | ★★★★ | $ |
| OpenGVLab: InternVL3 14B Unavailable | Apr 30, 2025 | 14B | 12K |
Image input
Text input
Text output
|
★★★★★ | ★★★★★ | $$$ |