Loading...
Loading...
Quality Index
41.6
36th of 440
Top 8%
Coding Index
34.7
50th of 350
Top 15%
Price/1M
$1.10
491st cheapest
255% above median
Top 73%
Speed
152 tok/s
Top 12%
TTFT
1.01s
Context Window
262K
61st largest
Top 25%
Input
$0.40
per 1M tokens
Output
$3.20
per 1M tokens
Blended
$1.10
per 1M tokens
Cheaper than 27% of models. Median price is $0.31/1M tokens.
Daily
$1.10
Monthly
$33.00
152
tokens/sec
Faster than 88% of models
1.01
seconds
Faster than 30% of models
Market Median
46 tok/s
231% faster
Median TTFT
0.43s
135% slower
Throughput/Dollar
138
tok/s per $/1M
Speed Comparison
Context Window
262K
tokens
Larger than 75% of models
Max Output
66K
tokens
25% of context
524.8K
Downloads (30d)
72
Likes
apache-2.0
Permissive license allowing commercial use, modification, and distribution.
Hardware Requirements
Quantization Available
Check HuggingFace for GGUF, GPTQ, and AWQ quantized versions that require significantly less VRAM.