Loading...
Loading...
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.
Quality Index
30.1
102nd of 442
Top 23%
Coding Index
25.9
99th of 352
Top 28%
Price/1M
$0.15
272nd cheapest
51% below median
Top 40%
Speed
91 tok/s
Top 31%
TTFT
0.67s
Context Window
203K
105th largest
Top 30%
Input
$0.07
per 1M tokens
Output
$0.40
per 1M tokens
Blended
$0.15
per 1M tokens
Cheaper than 60% of models. Median price is $0.31/1M tokens.
Daily
$0.15
Monthly
$4.56
91
tokens/sec
Faster than 69% of models
0.67
seconds
Faster than 38% of models
22.76
seconds
Faster than 10% of models
Market Median
46 tok/s
98% faster
Median TTFT
0.42s
60% slower
Throughput/Dollar
595
tok/s per $/1M
Speed Comparison
Context Window
203K
tokens
Larger than 70% of models