Loading...
Loading...
Price/1M
$0.03
191st cheapest
91% below median
Top 28%
Context Window
16K
324th largest
Top 93%
Input
$0.02
per 1M tokens
Output
$0.05
per 1M tokens
Blended
$0.03
per 1M tokens
Cheaper than 72% of models. Median price is $0.31/1M tokens.
Daily
$0.03
Monthly
$0.82
Context Window
16K
tokens
Larger than 7% of models
Max Output
16K
tokens
100% of context
Context Window Comparison
1.3M
Downloads (30d)
2.1K
Likes
llama3.1
Meta's license allowing commercial use with restrictions for large-scale deployments.
Hardware Requirements
Quantization Available
Check HuggingFace for GGUF, GPTQ, and AWQ quantized versions that require significantly less VRAM.