Free Models

nvidia
Free/1M

NVIDIA: Nemotron 3.5 Content Safety (free)

NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model fr...

πŸ“ 128,000 ctx Compare →
nvidia
Free/1M

NVIDIA: Nemotron 3 Ultra (free)

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA,...

πŸ“ 1,000,000 ctx Compare →
openrouter
Free/1M

Owl Alpha

Owl Alpha is a high-performance foundation model designed for agentic workloads. Natively ...

πŸ“ 1,048,756 ctx Compare →
nvidia
Free/1M

NVIDIA: Nemotron 3 Nano Omni (free)

NVIDIA Nemotronβ„’ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as ...

πŸ“ 256,000 ctx Compare →
poolside
Free/1M

Poolside: Laguna XS.2 (free)

Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://po...

πŸ“ 262,144 ctx Compare →
poolside
Free/1M

Poolside: Laguna M.1 (free)

Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai), optimi...

πŸ“ 262,144 ctx Compare →
moonshotai
Free/1M

MoonshotAI: Kimi K2.6 (free)

Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon cod...

πŸ“ 262,144 ctx Compare →
google
Free/1M

Google: Gemma 4 26B A4B (free)

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google Deep...

πŸ“ 262,144 ctx Compare →
google
Free/1M

Google: Gemma 4 31B (free)

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and...

πŸ“ 262,144 ctx Compare →
google
Free/1M

Google: Lyria 3 Pro Preview

Full-length songs are priced at $0.08 per song. Lyria 3 is Google's family of music genera...

πŸ“ 1,048,576 ctx Compare →
google
Free/1M

Google: Lyria 3 Clip Preview

30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music...

πŸ“ 1,048,576 ctx Compare →
nvidia
Free/1M

NVIDIA: Nemotron 3 Super (free)

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B par...

πŸ“ 1,000,000 ctx Compare →
openrouter
Free/1M

Free Models Router

The simplest way to get free inference. openrouter/free is a router that selects free mode...

πŸ“ 200,000 ctx Compare →
liquid
Free/1M

LiquidAI: LFM2.5-1.2B-Thinking (free)

LFM2.5-1.2B-Thinking is a lightweight reasoning-focused model optimized for agentic tasks,...

πŸ“ 32,768 ctx Compare →
liquid
Free/1M

LiquidAI: LFM2.5-1.2B-Instruct (free)

LFM2.5-1.2B-Instruct is a compact, high-performance instruction-tuned model built for fast...

πŸ“ 32,768 ctx Compare →
nvidia
Free/1M

NVIDIA: Nemotron 3 Nano 30B A3B (free)

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficien...

πŸ“ 256,000 ctx Compare →
nvidia
Free/1M

NVIDIA: Nemotron Nano 12B 2 VL (free)

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model design...

πŸ“ 128,000 ctx Compare →
qwen
Free/1M

Qwen: Qwen3 Next 80B A3B Instruct (free)

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series op...

πŸ“ 262,144 ctx Compare →
nvidia
Free/1M

NVIDIA: Nemotron Nano 9B V2 (free)

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA,...

πŸ“ 128,000 ctx Compare →
openai
Free/1M

OpenAI: gpt-oss-120b (free)

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model fro...

πŸ“ 131,072 ctx Compare →
openai
Free/1M

OpenAI: gpt-oss-20b (free)

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 ...

πŸ“ 131,072 ctx Compare →
z-ai
Free/1M

Z.ai: GLM 4.5 Air (free)

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-b...

πŸ“ 131,072 ctx Compare →
qwen
Free/1M

Qwen: Qwen3 Coder 480B A35B (free)

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model develop...

πŸ“ 1,048,576 ctx Compare →
cognitivecomputations
Free/1M

Venice: Uncensored (free)

Venice Uncensored Dolphin Mistral 24B Venice Edition is a fine-tuned variant of Mistral-Sm...

πŸ“ 32,768 ctx Compare →
meta-llama
Free/1M

Meta: Llama 3.3 70B Instruct (free)

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction...

πŸ“ 131,072 ctx Compare →
meta-llama
Free/1M

Meta: Llama 3.2 3B Instruct (free)

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for adv...

πŸ“ 131,072 ctx Compare →
nousresearch
Free/1M

Nous: Hermes 3 405B Instruct (free)

Hermes 3 is a generalist language model with many improvements over Hermes 2, including ad...

πŸ“ 131,072 ctx Compare →