Loading...
Loading...
Quality Index
16.1
242nd of 440
Top 55%
Coding Index
14.3
203rd of 350
Top 58%
Math Index
72.3
89th of 268
Top 34%
Price/1M
$0.35
346th cheapest
13% above median
Top 52%
Speed
123 tok/s
Top 22%
TTFT
0.99s
Context Window
131K
145th largest
Top 63%
Input
$0.20
per 1M tokens
Output
$0.80
per 1M tokens
Blended
$0.35
per 1M tokens
Cheaper than 48% of models. Median price is $0.31/1M tokens.
Daily
$0.35
Monthly
$10.50
123
tokens/sec
Faster than 78% of models
0.99
seconds
Faster than 31% of models
Market Median
46 tok/s
169% faster
Median TTFT
0.43s
129% slower
Throughput/Dollar
352
tok/s per $/1M
Speed Comparison
Context Window
131K
tokens
Larger than 37% of models
Max Output
33K
tokens
25% of context
3.1M
Downloads (30d)
551
Likes
apache-2.0
Permissive license allowing commercial use, modification, and distribution.
Hardware Requirements
Quantization Available
Check HuggingFace for GGUF, GPTQ, and AWQ quantized versions that require significantly less VRAM.