Loading...
Loading...
Quality Index
33.3
79th of 440
Top 18%
Coding Index
28.6
88th of 350
Top 25%
Math Index
93.4
14th of 268
Top 5%
Price/1M
$0.26
316th cheapest
15% below median
Top 47%
Speed
277 tok/s
Top 3%
TTFT
0.49s
Context Window
131K
145th largest
Top 63%
Input
$0.15
per 1M tokens
Output
$0.60
per 1M tokens
Blended
$0.26
per 1M tokens
Cheaper than 53% of models. Median price is $0.31/1M tokens.
Daily
$0.26
Monthly
$7.89
277
tokens/sec
Faster than 97% of models
0.49
seconds
Faster than 45% of models
Market Median
46 tok/s
505% faster
Median TTFT
0.43s
15% slower
Throughput/Dollar
1054
tok/s per $/1M
Speed Comparison
Context Window
131K
tokens
Larger than 37% of models
4.6M
Downloads (30d)
4.6K
Likes
apache-2.0
Permissive license allowing commercial use, modification, and distribution.
Hardware Requirements
Quantization Available
Check HuggingFace for GGUF, GPTQ, and AWQ quantized versions that require significantly less VRAM.