BestLLMfor EN Your hardware. Your LLM. Your call.
APIOpen data Find my LLM
Model fiche

Aya 23 8B

By Cohere For AI · Canada

chat general multilingual
Parameters
8B
License
CC-BY-NC 4.0
Context
8k
VRAM (Q4)
5 GB
Released
May 2024

Overview

Cohere For AI's pre-Expanse 8B multilingual model covering 23 languages, now superseded by Aya Expanse 8B and restricted to non-commercial use.

When to pick this model

  • Reproducing pre-Expanse multilingual baselines
  • Research comparisons of Cohere multilingual lineage
  • Non-commercial multilingual prototypes
  • Teaching examples for multilingual instruction tuning

VRAM requirements by quantization

QuantizationVRAM required
Q4_K_M (recommended)5 GB
Q5_K_M6 GB
Q8_09 GB
FP16 (no quantization)16 GB

VRAM figures include model weights plus a typical 8k KV cache and ~600 MB runtime overhead (Ollama / llama.cpp baseline). Add headroom for higher context lengths.

Strengths

  • Solid pre-Expanse multilingual coverage
  • 23 languages including French, Arabic, and Chinese
  • Compact 8B footprint for the breadth

Limitations

  • CC-BY-NC 4.0 license blocks commercial use
  • 8k context is restrictive by current standards
  • Superseded by Aya Expanse 8B in every dimension

Architecture & training

Architecture: Dense 8B ยท IFT on Aya Collection

Training: 23 languages.

Verdict

Skip for new work โ€” Aya Expanse 8B replaces this model with better quality under the same license.

Quick start

ollama run aya:8b

Or use the open-source MCP server to query this model from Claude Desktop, Cursor, or any MCP-compatible client.

Tools

Is Aya 23 8B the right pick for you?

Compute self-hosted ROI → Back to catalog