BestLLMfor EN Your hardware. Your LLM. Your call.
APIOpen data Find my LLM
Model fiche

dots.llm1 Instruct

By Rednote · China

chat general moe
Parameters
142B
License
MIT
Context
32k
VRAM (Q4)
85 GB
Released
April 2025

Overview

Xiaohongshu's first LLM under the Rednote brand — a 142B MoE with 14B active params trained without synthetic data, matching Qwen2.5-72B. Released under MIT.

When to pick this model

  • Creative and lifestyle content generation
  • Chinese-language social and consumer-facing products
  • Research on training without synthetic data
  • MIT-licensed alternative to Qwen for content-heavy use cases
  • Workloads where natural, non-generic prose matters

VRAM requirements by quantization

QuantizationVRAM required
Q4_K_M (recommended)85 GB
Q5_K_M102 GB
Q8_0152 GB
FP16 (no quantization)284 GB

VRAM figures include model weights plus a typical 8k KV cache and ~600 MB runtime overhead (Ollama / llama.cpp baseline). Add headroom for higher context lengths.

Strengths

  • 14B active params in a 142B MoE — efficient inference
  • MIT license
  • Strong creative and lifestyle content generation
  • No synthetic data in training — more natural outputs

Limitations

  • Roughly 85 GB VRAM in Q4 — multi-GPU territory
  • 32k context lags modern flagships
  • Output style optimized for Chinese social media — may not fit Western tone

Architecture & training

Architecture: MoE · 142B total / 14B active · Rednote (Xiaohongshu) · 32k ctx

Training: Rednote — strong in creative generation and lifestyle content.

Verdict

An MIT-licensed alternative for creative Chinese content; outside that niche, Qwen3 is the safer pick.

Quick start

ollama pull hf.co/rednote/dots-llm1-GGUF

Or use the open-source MCP server to query this model from Claude Desktop, Cursor, or any MCP-compatible client.

Tools

Is dots.llm1 Instruct the right pick for you?

Compute self-hosted ROI → Back to catalog