Best local LLM for rtx 2080 ti
Top 7 open-source picks for rtx 2080 ti, ranked by benchmark performance and real-world fit. Updated monthly.
Qwen 3 14B
A 14B dense model from Alibaba that matches Qwen 2.5 32B Base on STEM and code, with the same hybrid thinking system as the rest of the Qwen 3 family. The pragmatic sweet spot for a single 24GB GPU.
Phi-4 Reasoning 14B
Microsoft's 14B reasoner that beats R1-Distill-Llama-70B on AIME and GPQA with 50x fewer parameters. MIT-licensed, English-first, with a 32K context.
DeepSeek R1 Distill Qwen 14B
DeepSeek's R1 reasoning distilled into Qwen 14B under MIT. AIME24 69.7 and MATH-500 93.9 — beats o1-mini on most reasoning benchmarks.
Qwen 2.5 VL 7B
A 7B vision-language model from Alibaba with state-of-the-art results in its class, scoring 95.7 on DocVQA. Handles hour-long video, bounding-box grounding, and multilingual OCR.
Qwen 2.5 Omni 7B
Alibaba's first true omni-modal open model — text, image, audio, and video in, with text and speech out. A research-grade preview rather than a production-ready release.
Qwen 3.5 9B
Alibaba's next-generation dense 9B model with a 262K native context window and an improved toggleable thinking mode. Apache 2.0 licensed.
Qwen 3 VL 8B
The dense 8B entry in Qwen 3 VL, offering strong OCR and document analysis with a remarkable 256k multimodal context for its size.