BestLLMfor EN Your hardware. Your LLM. Your call.
APIOpen data Find my LLM
Editorial ranking · 2026

Best local LLM for rtx 3080

Top 7 open-source picks for rtx 3080, ranked by benchmark performance and real-world fit. Updated monthly.

#1

Qwen 2.5 VL 7B

7B · Alibaba · Apache 2.0

A 7B vision-language model from Alibaba with state-of-the-art results in its class, scoring 95.7 on DocVQA. Handles hour-long video, bounding-box grounding, and multilingual OCR.

VRAM Q4: 6 GB · Context: 125k
Read full fiche →
#2

Qwen 2.5 Omni 7B

7B · Alibaba · Apache 2.0

Alibaba's first true omni-modal open model — text, image, audio, and video in, with text and speech out. A research-grade preview rather than a production-ready release.

VRAM Q4: 6 GB · Context: 32k
Read full fiche →
#3

Qwen 3.5 9B

9B · Alibaba · Apache 2.0

Alibaba's next-generation dense 9B model with a 262K native context window and an improved toggleable thinking mode. Apache 2.0 licensed.

VRAM Q4: 6 GB · Context: 255k
Read full fiche →
#4

Qwen 3 VL 8B

8B · Alibaba · Apache 2.0

The dense 8B entry in Qwen 3 VL, offering strong OCR and document analysis with a remarkable 256k multimodal context for its size.

VRAM Q4: 6 GB · Context: 256k
Read full fiche →
#5

Apertus 8B

8B · Swiss AI · Apache 2.0

The compact Swiss AI release trained on the Alps supercomputer, covering 1000+ languages including Swiss German and Romansh. Apache 2.0.

VRAM Q4: 6 GB · Context: 64k
Read full fiche →
#6

InternVL 3.5 8B

8B · OpenGVLab · Apache 2.0

OpenGVLab's 8B vision-language model leading MMMU among open models. Built at Shanghai AI Lab and released under Apache 2.0.

VRAM Q4: 6 GB · Context: 32k
Read full fiche →
#7

Granite 4.0 H-Tiny 7B-A1B

7B · IBM · Apache 2.0

IBM's edge-class hybrid MoE with 7B total and only 1B active parameters — Apache 2.0 licensed and built for embedded and low-cost serving.

VRAM Q4: 4 GB · Context: 125k
Read full fiche →