Loading...
Loading...
Price/1M
$0.16
275th cheapest
48% below median
Top 41%
Context Window
131K
145th largest
Top 63%
Input
$0.08
per 1M tokens
Output
$0.40
per 1M tokens
Blended
$0.16
per 1M tokens
Cheaper than 59% of models. Median price is $0.31/1M tokens.
Daily
$0.16
Monthly
$4.80
Context Window
131K
tokens
Larger than 37% of models
Max Output
131K
tokens
100% of context
Context Window Comparison
1.1M
Downloads (30d)
370
Likes
apache-2.0
Permissive license allowing commercial use, modification, and distribution.
Hardware Requirements
Quantization Available
Check HuggingFace for GGUF, GPTQ, and AWQ quantized versions that require significantly less VRAM.