Free Models

liquid
Free/1M

LiquidAI: LFM2.5-1.2B-Thinking (free)

LFM2.5-1.2B-Thinking is a lightweight reasoning-focused model optimized for agentic tasks,...

📝 32,768 ctx Compare →
liquid
Free/1M

LiquidAI: LFM2.5-1.2B-Instruct (free)

LFM2.5-1.2B-Instruct is a compact, high-performance instruction-tuned model built for fast...

📝 32,768 ctx Compare →
allenai
Free/1M

AllenAI: Molmo2 8B (free)

Molmo2-8B is an open vision-language model developed by the Allen Institute for AI (Ai2) a...

📝 36,864 ctx Compare →
nvidia
Free/1M

NVIDIA: Nemotron 3 Nano 30B A3B (free)

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficien...

📝 256,000 ctx Compare →
mistralai
Free/1M

Mistral: Devstral 2 2512 (free)

Devstral 2 is a state-of-the-art open-source model by Mistral AI specializing in agentic c...

📝 262,144 ctx Compare →
arcee-ai
Free/1M

Arcee AI: Trinity Mini (free)

Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featu...

📝 131,072 ctx Compare →
tngtech
Free/1M

TNG: R1T Chimera (free)

TNG-R1T-Chimera is an experimental LLM with a faible for creative storytelling and charact...

📝 163,840 ctx Compare →
nvidia
Free/1M

NVIDIA: Nemotron Nano 12B 2 VL (free)

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model design...

📝 128,000 ctx Compare →
qwen
Free/1M

Qwen: Qwen3 Next 80B A3B Instruct (free)

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series op...

📝 262,144 ctx Compare →
nvidia
Free/1M

NVIDIA: Nemotron Nano 9B V2 (free)

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA,...

📝 128,000 ctx Compare →
openai
Free/1M

OpenAI: gpt-oss-120b (free)

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model fro...

📝 131,072 ctx Compare →
openai
Free/1M

OpenAI: gpt-oss-20b (free)

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 ...

📝 131,072 ctx Compare →
z-ai
Free/1M

Z.AI: GLM 4.5 Air (free)

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-b...

📝 131,072 ctx Compare →
qwen
Free/1M

Qwen: Qwen3 Coder 480B A35B (free)

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model develop...

📝 262,000 ctx Compare →
moonshotai
Free/1M

MoonshotAI: Kimi K2 0711 (free)

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moo...

📝 32,768 ctx Compare →
cognitivecomputations
Free/1M

Venice: Uncensored (free)

Venice Uncensored Dolphin Mistral 24B Venice Edition is a fine-tuned variant of Mistral-Sm...

📝 32,768 ctx Compare →
google
Free/1M

Google: Gemma 3n 2B (free)

Gemma 3n E2B IT is a multimodal, instruction-tuned model developed by Google DeepMind, des...

📝 8,192 ctx Compare →
tngtech
Free/1M

TNG: DeepSeek R1T2 Chimera (free)

DeepSeek-TNG-R1T2-Chimera is the second-generation Chimera model from TNG Tech. It is a 67...

📝 163,840 ctx Compare →
deepseek
Free/1M

DeepSeek: R1 0528 (free)

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par wi...

📝 163,840 ctx Compare →
google
Free/1M

Google: Gemma 3n 4B (free)

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, s...

📝 8,192 ctx Compare →
qwen
Free/1M

Qwen: Qwen3 4B (free)

Qwen3-4B is a 4 billion parameter dense language model from the Qwen3 series, designed to ...

📝 40,960 ctx Compare →
tngtech
Free/1M

TNG: DeepSeek R1T Chimera (free)

DeepSeek-R1T-Chimera is created by merging DeepSeek-R1 and DeepSeek-V3 (0324), combining t...

📝 163,840 ctx Compare →
mistralai
Free/1M

Mistral: Mistral Small 3.1 24B (free)

Mistral Small 3.1 24B Instruct is an upgraded variant of Mistral Small 3 (2501), featuring...

📝 128,000 ctx Compare →
google
Free/1M

Google: Gemma 3 4B (free)

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...

📝 32,768 ctx Compare →
google
Free/1M

Google: Gemma 3 12B (free)

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...

📝 32,768 ctx Compare →
google
Free/1M

Google: Gemma 3 27B (free)

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...

📝 131,072 ctx Compare →
google
Free/1M

Google: Gemini 2.0 Flash Experimental (free)

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gem...

📝 1,048,576 ctx Compare →
meta-llama
Free/1M

Meta: Llama 3.3 70B Instruct (free)

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction...

📝 131,072 ctx Compare →
meta-llama
Free/1M

Meta: Llama 3.2 3B Instruct (free)

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for adv...

📝 131,072 ctx Compare →
qwen
Free/1M

Qwen: Qwen2.5-VL 7B Instruct (free)

Qwen2.5 VL 7B is a multimodal LLM from the Qwen Team with the following key enhancements: ...

📝 32,768 ctx Compare →
nousresearch
Free/1M

Nous: Hermes 3 405B Instruct (free)

Hermes 3 is a generalist language model with many improvements over Hermes 2, including ad...

📝 131,072 ctx Compare →
meta-llama
Free/1M

Meta: Llama 3.1 405B Instruct (free)

The highly anticipated 400B class of Llama3 is here! Clocking in at 128k context with impr...

📝 131,072 ctx Compare →