Loading...
Loading...
Quality Index
17.2
227th of 440
Top 52%
Coding Index
15.6
187th of 350
Top 54%
Math Index
68.3
99th of 268
Top 37%
Price/1M
$1.23
503rd cheapest
295% above median
Top 75%
Speed
77 tok/s
Top 35%
TTFT
1.01s
Context Window
131K
145th largest
Top 63%
Input
$0.70
per 1M tokens
Output
$2.80
per 1M tokens
Blended
$1.23
per 1M tokens
Cheaper than 25% of models. Median price is $0.31/1M tokens.
Daily
$1.23
Monthly
$36.75
77
tokens/sec
Faster than 65% of models
1.01
seconds
Faster than 29% of models
Market Median
46 tok/s
68% faster
Median TTFT
0.43s
135% slower
Throughput/Dollar
63
tok/s per $/1M
Speed Comparison
Context Window
131K
tokens
Larger than 37% of models
Max Output
33K
tokens
25% of context
1.0M
Downloads (30d)
190
Likes
apache-2.0
Permissive license allowing commercial use, modification, and distribution.
Hardware Requirements
Quantization Available
Check HuggingFace for GGUF, GPTQ, and AWQ quantized versions that require significantly less VRAM.