Loading...
Loading...
Quality Index
17.2
227th of 440
Top 52%
Math Index
63.0
110th of 268
Top 41%
Price/1M
$0.27
323rd cheapest
13% below median
Top 48%
Speed
61 tok/s
Top 41%
TTFT
0.47s
Context Window
33K
290th largest
Top 91%
Input
$0.27
per 1M tokens
Output
$0.27
per 1M tokens
Blended
$0.27
per 1M tokens
Cheaper than 52% of models. Median price is $0.31/1M tokens.
Daily
$0.27
Monthly
$8.10
61
tokens/sec
Faster than 59% of models
0.47
seconds
Faster than 47% of models
Market Median
46 tok/s
33% faster
Median TTFT
0.43s
10% slower
Throughput/Dollar
226
tok/s per $/1M
Speed Comparison
Context Window
33K
tokens
Larger than 9% of models
Max Output
33K
tokens
100% of context
987.8K
Downloads (30d)
1.5K
Likes
mit
Very permissive license with minimal restrictions.
Hardware Requirements
Quantization Available
Check HuggingFace for GGUF, GPTQ, and AWQ quantized versions that require significantly less VRAM.