Model fiche
Ring-1T
By Ant Group · China
reasoning
moe
Overview
Ant Group's MIT-licensed open reasoner: 1T total parameters with 50B active, using a novel ring-all-reduce MoE architecture. Top of the open-reasoning leaderboards.
When to pick this model
- Datacenter-scale reasoning workloads
- Research into novel MoE architectures
- Frontier benchmarking against closed reasoners
- Long-context reasoning up to 131k tokens
VRAM requirements by quantization
| Quantization | VRAM required |
|---|---|
| Q4_K_M (recommended) | 600 GB |
| Q5_K_M | 720 GB |
| Q8_0 | 1080 GB |
| FP16 (no quantization) | 2000 GB |
VRAM figures include model weights plus a typical 8k KV cache and ~600 MB runtime overhead (Ollama / llama.cpp baseline). Add headroom for higher context lengths.
Strengths
- First trillion-parameter Chinese open-weight model
- MIT license with full commercial freedom
- Original ring-MoE all-reduce architecture
- 131k context window
Limitations
- Around 600 GB VRAM at Q4 — datacenter only
- Commercial licensing for downstream use is complex
- Operationally heavy to deploy and tune
Architecture & training
Architecture: MoE ring-all-reduce · 1T total · Ant Group · 131k context
Training: Ant Group — first Chinese trillion-parameter open-weight model, ring-MoE architecture.
Verdict
A frontier open reasoner with a permissive license — practical only for teams running real datacenter infrastructure.
Quick start
# Infrastructure data-center requise — non disponible en local standardOr use the open-source MCP server to query this model from Claude Desktop, Cursor, or any MCP-compatible client.