BestLLMfor EN Your hardware. Your LLM. Your call.
APIOpen data Find my LLM
Editorial ranking · 2026

Best local LLM for radeon rx 7900 xt

Top 7 open-source picks for radeon rx 7900 xt, ranked by benchmark performance and real-world fit. Updated monthly.

#1

gpt-oss 20B

21B · OpenAI · Apache 2.0

OpenAI's compact open-weight MoE with 3.6B active out of 21B total parameters. Matches o3-mini on a laptop-class GPU under Apache 2.0.

VRAM Q4: 13 GB · Context: 125k
Read full fiche →
#2

ERNIE 4.5 21B-A3B Thinking

21B · Baidu · Apache 2.0

Baidu's compact reasoning MoE with 3B active parameters out of 21B total. Fast inference thanks to the small active set, with Chinese-language strength.

VRAM Q4: 13 GB · Context: 128k
Read full fiche →
#3

LLaDA 2.0 Uni 16B

16B · Ant Group / inclusionAI · Apache 2.0

Ant Group's first open Apache 2.0 diffusion LLM: a 16B/1B MoE paired with a 6.2B diffusion decoder, unifying text and vision generation and editing. Released April 2026.

VRAM Q4: 18 GB · Context: 8k
Read full fiche →
#4

Mistral Small 3

24B · Mistral AI · Apache 2.0

Mistral AI's 24B dense model that closes most of the gap with 70B-class models. Best quality-per-parameter we've measured at this size in 2025.

VRAM Q4: 14 GB · Context: 32k
Read full fiche →
#5

Mistral Small 3.1 24B

24B · Mistral AI · Apache 2.0

Mistral AI's Small 3.1 — Small 3 plus a vision encoder, a 128k context, and ~150 tok/s inference under Apache 2.0. Small 3.2 (June 2025) is a drop-in upgrade.

VRAM Q4: 14 GB · Context: 125k
Read full fiche →
#6

Devstral Small 2 24B

24B · Mistral AI · Apache 2.0

Mistral AI's 24B coding specialist co-developed with All Hands AI, scoring 72.2% on SWE-Bench under Apache 2.0. Fits on a single RTX 4090.

VRAM Q4: 14 GB · Context: 250k
Read full fiche →
#7

Mistral Small 3.2 24B

24B · Mistral AI · Apache 2.0

Mistral AI's June 2025 refresh of Small 3.1: a 24B Apache 2.0 dense model with vision input, sharper function calling, and roughly half the rate of runaway generations seen in 3.1.

VRAM Q4: 14 GB · Context: 125k
Read full fiche →