About

Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window. It comes in two flavors: non-reasoning and reasoning. Read more about the model on xAI's [news post](http://x.ai/news/grok-4-fast). Reasoning can be enabled/disabled using the `reasoning` `enabled` parameter in the API. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#controlling-reasoning-tokens)

Model Family

xAI: Grok 4.20 Beta2026-03-12 xAI: Grok 4.20 Multi-Agent Beta2026-03-12 Grok 4.20 Beta 0309 (Reasoning)2026-03-10 Grok 4.20 Beta 0309 (Non-reasoning)2026-03-10 Grok Voice Agent2025-12-17 Grok 4.1 Fast (Reasoning)2025-11-19 Grok 4.1 Fast (Non-reasoning)2025-11-19 Grok 4 Fast (Reasoning)2025-09-19

Benchmarks

MMLU-Pro

73.0%

GPQA Diamond

60.6%

HLE

5.0%

LiveCodeBench

40.1%

SciCode

32.9%

TerminalBench Hard

12.1%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025

41.3%

IFBench

37.7%

Long Context Recall

20.0%

Tau2

63.7%

Market AverageTop Score

Grok 4 Fast (Non-reasoning)

About

Model Family

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Quick Compare

Similar Models