Loading...
Loading...
Virtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72 B parameters, tuned to tackle cross‑domain reasoning, creative writing and enterprise QA. Unlike many 70 B peers, it retains the 128 k context inherited from Qwen 2.5, letting it ingest books, codebases or financial filings wholesale. Training blended DeepSeek R1 distillation, multi‑epoch supervised fine‑tuning and a final DPO/RLHF alignment stage, yielding strong performance on BIG‑Bench‑Hard, GSM‑8K and long‑context Needle‑In‑Haystack tests. Enterprises use Virtuoso‑Large as the "fallback" brain in Conductor pipelines when other SLMs flag low confidence. Despite its size, aggressive KV‑cache optimizations keep first‑token latency in the low‑second range on 8× H100 nodes, making it a practical production‑grade powerhouse.
Preço/1M
$0.86
462nd mais barato
178% acima da mediana
Top 68%
Janela de Contexto
131K
145th maior
Top 63%
Entrada
$0.75
por 1M tokens
Saída
$1.20
por 1M tokens
Combinado
$0.86
por 1M tokens
Mais barato que 32% dos modelos. Preço mediano é $0.31/1M tokens.
Diário
$0.86
Mensal
$25.88
Janela de Contexto
131K
tokens
Maior que 37% dos modelos
Saída Máxima
64K
tokens
49% do contexto
Comparação de Janela de Contexto