About

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in Kimi K2, it activates 32 billion parameters per forward pass and supports 256 k-token context windows. The model is optimized for persistent step-by-step thought, dynamic tool invocation, and complex reasoning workflows that span hundreds of turns. It interleaves step-by-step reasoning with tool use, enabling autonomous research, coding, and writing that can persist for hundreds of sequential actions without drift. It sets new open-source benchmarks on HLE, BrowseComp, SWE-Multilingual, and LiveCodeBench, while maintaining stable multi-agent behavior through 200–300 tool calls. Built on a large-scale MoE architecture with MuonClip optimization, it combines strong reasoning depth with high inference efficiency for demanding agentic and analytical tasks.

Model Family

Kimi K2.5 (Reasoning)2026-01-27 Kimi K2.5 (Non-reasoning)2026-01-27 Kimi Linear 48B A3B Instruct2025-10-30 Kimi K2 09052025-09-05 Kimi K22025-07-11 Kimi VL A3B Instruct2025-04-09

Benchmarks

MMLU-Pro

84.8%

GPQA Diamond

83.8%

HLE

22.3%

LiveCodeBench

85.3%

SciCode

42.4%

TerminalBench Hard

31.1%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025

94.7%

IFBench

68.1%

Long Context Recall

66.3%

Tau2

93.0%

Market AverageTop Score

Kimi K2 Thinking

About

Model Family

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models