Loading...
Loading...
A sophisticated text-based Mixture-of-Experts (MoE) model featuring 21B total parameters with 3B activated per token, delivering exceptional multimodal understanding and generation through heterogeneous MoE structures and modality-isolated routing. Supporting an extensive 131K token context length, the model achieves efficient inference via multi-expert parallel collaboration and quantization, while advanced post-training techniques including SFT, DPO, and UPO ensure optimized performance across diverse applications with specialized routing and balancing losses for superior task handling.
Preço/1M
$0.12
241st mais barato
60% abaixo da mediana
Top 36%
Janela de Contexto
120K
267th maior
Top 76%
Entrada
$0.07
por 1M tokens
Saída
$0.28
por 1M tokens
Combinado
$0.12
por 1M tokens
Mais barato que 64% dos modelos. Preço mediano é $0.31/1M tokens.
Diário
$0.12
Mensal
$3.68
Janela de Contexto
120K
tokens
Maior que 24% dos modelos
Saída Máxima
8K
tokens
7% do contexto
Comparação de Janela de Contexto