Skip to content

Models

We retain the right to change models so that we can provide models that suit your applications and our infrastructure best. You can, however, use one of the following model aliases for more reliable applications:

  • alias-ha (general model, high availability)
  • alias-huge (most powerful model)
  • alias-code (coding support)
  • alias-reasoning
  • alias-image-generation (to generate images from text)
  • alias-vision (to ask questions about images)

Most models also fall back to other models.

The status page lists all models and their current status.

Context Length

Here are the maximum context lengths for each chat model:

Model name Max tokens
google/gemma-4-31B-it 200704
meta-llama/Llama-3.1-8B-Instruct 32768
meta-llama/Llama-3.3-70B-Instruct 40000
MiniMaxAI/MiniMax-M2.5 196608
moonshotai/Kimi-K2.6 262144
openai/gpt-oss-120b 131072
openGPT-X/Teuken-7B-instruct-v0.6 4096
Qwen/Qwen3-Coder-30B-A3B-Instruct 128000
Qwen/Qwen3-VL-8B-Instruct 131072