Loading...
Loading...
Quality Index
20.0
186th of 440
Top 43%
Coding Index
19.4
152nd of 350
Top 43%
Math Index
29.0
187th of 268
Top 70%
Price/1M
$0.90
472nd cheapest
190% above median
Top 70%
Speed
26 tok/s
Top 60%
TTFT
1.45s
Context Window
160K
144th largest
Top 41%
Input
$0.45
per 1M tokens
Output
$2.25
per 1M tokens
Blended
$0.90
per 1M tokens
Cheaper than 30% of models. Median price is $0.31/1M tokens.
Daily
$0.90
Monthly
$27.00
26
tokens/sec
Faster than 40% of models
1.45
seconds
Faster than 18% of models
Market Median
46 tok/s
43% slower
Median TTFT
0.43s
237% slower
Throughput/Dollar
29
tok/s per $/1M
Speed Comparison
Context Window
160K
tokens
Larger than 59% of models
Max Output
33K
tokens
20% of context
1.0M
Downloads (30d)
981
Likes
apache-2.0
Permissive license allowing commercial use, modification, and distribution.
Hardware Requirements
Quantization Available
Check HuggingFace for GGUF, GPTQ, and AWQ quantized versions that require significantly less VRAM.