Loading...
Loading...
Quality Index
37.1
60th of 440
Top 14%
Coding Index
30.3
78th of 350
Top 22%
Price/1M
$0.69
418th cheapest
122% above median
Top 62%
Speed
112 tok/s
Top 24%
TTFT
1.02s
Context Window
262K
61st largest
Top 25%
Input
$0.25
per 1M tokens
Output
$2.00
per 1M tokens
Blended
$0.69
per 1M tokens
Cheaper than 38% of models. Median price is $0.31/1M tokens.
Daily
$0.69
Monthly
$20.64
112
tokens/sec
Faster than 76% of models
1.02
seconds
Faster than 28% of models
Market Median
46 tok/s
144% faster
Median TTFT
0.43s
138% slower
Throughput/Dollar
163
tok/s per $/1M
Speed Comparison
Context Window
262K
tokens
Larger than 75% of models
Max Output
66K
tokens
25% of context
1.9M
Downloads (30d)
119
Likes
apache-2.0
Permissive license allowing commercial use, modification, and distribution.
Hardware Requirements
Quantization Available
Check HuggingFace for GGUF, GPTQ, and AWQ quantized versions that require significantly less VRAM.