Best local LLM for rtx 3080
Top 7 open-source picks for rtx 3080, ranked by benchmark performance and real-world fit. Updated monthly.
Qwen 2.5 VL 7B
A 7B vision-language model from Alibaba with state-of-the-art results in its class, scoring 95.7 on DocVQA. Handles hour-long video, bounding-box grounding, and multilingual OCR.
Qwen 2.5 Omni 7B
Alibaba's first true omni-modal open model — text, image, audio, and video in, with text and speech out. A research-grade preview rather than a production-ready release.
Qwen 3.5 9B
Alibaba's next-generation dense 9B model with a 262K native context window and an improved toggleable thinking mode. Apache 2.0 licensed.
Qwen 3 VL 8B
The dense 8B entry in Qwen 3 VL, offering strong OCR and document analysis with a remarkable 256k multimodal context for its size.
Apertus 8B
The compact Swiss AI release trained on the Alps supercomputer, covering 1000+ languages including Swiss German and Romansh. Apache 2.0.
InternVL 3.5 8B
OpenGVLab's 8B vision-language model leading MMMU among open models. Built at Shanghai AI Lab and released under Apache 2.0.
Granite 4.0 H-Tiny 7B-A1B
IBM's edge-class hybrid MoE with 7B total and only 1B active parameters — Apache 2.0 licensed and built for embedded and low-cost serving.