About

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for lower-latency inference and deployability on consumer or single-GPU hardware. The model is trained in OpenAI’s Harmony response format and supports reasoning level configuration, fine-tuning, and agentic capabilities including function calling, tool use, and structured outputs.

Model Family

GPT-5.4 mini (xhigh)2026-03-17 GPT-5.4 nano (xhigh)2026-03-17 GPT-5.4 nano (medium)2026-03-17 GPT-5.4 mini (medium)2026-03-17 GPT-5.4 nano (Non-Reasoning)2026-03-17 GPT-5.4 mini (Non-Reasoning)2026-03-17 GPT-5.4 (xhigh)2026-03-05 GPT-5.4 (Non-reasoning)2026-03-05

Benchmarks

MMLU-Pro

74.8%

GPQA Diamond

68.8%

HLE

9.8%

LiveCodeBench

77.7%

SciCode

34.4%

TerminalBench Hard

10.6%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025

89.3%

IFBench

65.1%

Long Context Recall

30.7%

Tau2

60.2%

Market AverageTop Score

Open Source

HuggingFace

apache-2.020BGGUF / GPTQ / AWQ

Downloads (30d)

7.1M

Likes

4.5K

VRAM (FP16)

24-48 GB

GPU

A6000 / M3 Ultra

gpt-oss-20B (high)

About

Model Family

Market Position

Pricing

Cost Calculator

vs. Similar Models

Performance

Benchmarks

Open Source

Quick Compare

Similar Models