Arcee AI: Virtuoso Large

Text input Text output
Author's Description

Virtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72 B parameters, tuned to tackle cross‑domain reasoning, creative writing and enterprise QA. Unlike many 70 B peers, it retains the 128 k context inherited from Qwen 2.5, letting it ingest books, codebases or financial filings wholesale. Training blended DeepSeek R1 distillation, multi‑epoch supervised fine‑tuning and a final DPO/RLHF alignment stage, yielding strong performance on BIG‑Bench‑Hard, GSM‑8K and long‑context Needle‑In‑Haystack tests. Enterprises use Virtuoso‑Large as the "fallback" brain in Conductor pipelines when other SLMs flag low confidence. Despite its size, aggressive KV‑cache optimizations keep first‑token latency in the low‑second range on 8× H100 nodes, making it a practical production‑grade powerhouse.

Key Specifications
Cost
$$$$
Context
131K
Parameters
72B (Rumoured)
Released
May 05, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Logit Bias Tool Choice Top P Temperature Min P Tools Frequency Penalty Max Tokens
Features

This model supports the following features:

Tools
Performance Summary

Arcee AI's Virtuoso-Large, a 72B parameter LLM, demonstrates strong overall performance, particularly excelling in speed and reliability. It consistently ranks among the fastest models, achieving the 93rd percentile across six benchmarks, and offers competitive pricing, placing in the 44th percentile. Notably, its reliability is exceptional, with a perfect 100th percentile ranking, indicating minimal technical failures and consistent response delivery. Across specific benchmarks, Virtuoso-Large shows a balanced profile. It achieved perfect accuracy in Ethics and near-perfect scores in Email Classification (99.0%) and General Knowledge (99.5%), often being the most accurate among models of comparable speed. Its Instruction Following capabilities are strong at 63.0% accuracy, securing the top spot for speed in this category. While its Coding (84.0%) and Reasoning (72.0%) scores are solid, they are not its absolute strongest areas, though still competitive. The model's 128k context length, inherited from Qwen 2.5, is a significant strength, enabling it to process extensive documents. Despite its size, aggressive KV-cache optimizations ensure practical production-grade latency. Virtuoso-Large is positioned as a robust "fallback" brain for complex enterprise tasks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.75
Completion $1.2

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Together
Together | arcee-ai/virtuoso-large 131K $0.75 / 1M tokens $1.2 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by arcee-ai