Reasoning Models

liquid
Free/1M

LiquidAI: LFM2.5-1.2B-Thinking (free)

LFM2.5-1.2B-Thinking is a lightweight reasoning-focused model optimized for agentic tasks,...

πŸ“ 32,768 ctx Compare →
openai
$1.75/1M

OpenAI: GPT-5.2-Codex

GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering a...

πŸ“ 400,000 ctx Compare →
allenai
$0.20/1M

AllenAI: Olmo 3.1 32B Instruct

Olmo 3.1 32B Instruct is a large-scale, 32-billion-parameter instruction-tuned language mo...

πŸ“ 65,536 ctx Compare →
minimax
$0.27/1M

MiniMax: MiniMax M2.1

MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding,...

πŸ“ 196,608 ctx Compare →
z-ai
$0.40/1M

Z.AI: GLM 4.7

GLM-4.7 is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced p...

πŸ“ 202,752 ctx Compare →
google
$0.50/1M

Google: Gemini 3 Flash Preview

Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic wor...

πŸ“ 1,048,576 ctx Compare →
allenai
$0.15/1M

AllenAI: Olmo 3.1 32B Think

Olmo 3.1 32B Think is a large-scale, 32-billion-parameter model designed for deep reasonin...

πŸ“ 65,536 ctx Compare →
xiaomi
$0.09/1M

Xiaomi: MiMo-V2-Flash

MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi. It is a Mix...

πŸ“ 262,144 ctx Compare →
openai
$1.75/1M

OpenAI: GPT-5.2 Chat

GPT-5.2 Chat (AKA Instant) is the fast, lightweight member of the 5.2 family, optimized fo...

πŸ“ 128,000 ctx Compare →
openai
$21.00/1M

OpenAI: GPT-5.2 Pro

GPT-5.2 Pro is OpenAI’s most advanced model, offering major improvements in agentic codi...

πŸ“ 400,000 ctx Compare →
openai
$1.75/1M

OpenAI: GPT-5.2

GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic ...

πŸ“ 400,000 ctx Compare →
relace
$1.00/1M

Relace: Relace Search

The relace-search model uses 4-12 `view_file` and `grep` tools in parallel to explore a co...

πŸ“ 256,000 ctx Compare →
z-ai
$0.30/1M

Z.AI: GLM 4.6V

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and l...

πŸ“ 131,072 ctx Compare →
essentialai
$0.15/1M

EssentialAI: Rnj 1 Instruct

Rnj-1 is an 8B-parameter, dense, open-weight model family developed by Essential AI and tr...

πŸ“ 32,768 ctx Compare →
openai
$1.25/1M

OpenAI: GPT-5.1-Codex-Max

GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, hi...

πŸ“ 400,000 ctx Compare →
amazon
$0.30/1M

Amazon: Nova 2 Lite

Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can proc...

πŸ“ 1,000,000 ctx Compare →
arcee-ai
Free/1M

Arcee AI: Trinity Mini (free)

Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featu...

πŸ“ 131,072 ctx Compare →
arcee-ai
$0.05/1M

Arcee AI: Trinity Mini

Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featu...

πŸ“ 131,072 ctx Compare →
deepseek
$0.27/1M

DeepSeek: DeepSeek V3.2 Speciale

DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum re...

πŸ“ 163,840 ctx Compare →
deepseek
$0.25/1M

DeepSeek: DeepSeek V3.2

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficienc...

πŸ“ 163,840 ctx Compare →
prime-intellect
$0.20/1M

Prime Intellect: INTELLECT-3

INTELLECT-3 is a 106B-parameter Mixture-of-Experts model (12B active) post-trained from GL...

πŸ“ 131,072 ctx Compare →
tngtech
Free/1M

TNG: R1T Chimera (free)

TNG-R1T-Chimera is an experimental LLM with a faible for creative storytelling and charact...

πŸ“ 163,840 ctx Compare →
tngtech
$0.25/1M

TNG: R1T Chimera

TNG-R1T-Chimera is an experimental LLM with a faible for creative storytelling and charact...

πŸ“ 163,840 ctx Compare →
anthropic
$5.00/1M

Anthropic: Claude Opus 4.5

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software e...

πŸ“ 200,000 ctx Compare →
allenai
$0.15/1M

AllenAI: Olmo 3 32B Think

Olmo 3 32B Think is a large-scale, 32-billion-parameter model purpose-built for deep reaso...

πŸ“ 65,536 ctx Compare →
allenai
$0.12/1M

AllenAI: Olmo 3 7B Think

Olmo 3 7B Think is a research-oriented language model in the Olmo family designed for adva...

πŸ“ 65,536 ctx Compare →
google
$2.00/1M

Google: Nano Banana Pro (Gemini 3 Pro Image Preview)

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on G...

πŸ“ 65,536 ctx Compare →
x-ai
$0.20/1M

xAI: Grok 4.1 Fast

Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases...

πŸ“ 2,000,000 ctx Compare →
google
$2.00/1M

Google: Gemini 3 Pro Preview

Gemini 3 Pro is Google’s flagship frontier model for high-precision multimodal reasoning...

πŸ“ 1,048,576 ctx Compare →
openai
$1.25/1M

OpenAI: GPT-5.1

GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-...

πŸ“ 400,000 ctx Compare →
openai
$1.25/1M

OpenAI: GPT-5.1 Chat

GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 family, optimized for...

πŸ“ 128,000 ctx Compare →
openai
$1.25/1M

OpenAI: GPT-5.1-Codex

GPT-5.1-Codex is a specialized version of GPT-5.1 optimized for software engineering and c...

πŸ“ 400,000 ctx Compare →
moonshotai
$0.40/1M

MoonshotAI: Kimi K2 Thinking

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending ...

πŸ“ 262,144 ctx Compare →
amazon
$2.50/1M

Amazon: Nova Premier 1.0

Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reason...

πŸ“ 1,000,000 ctx Compare →
perplexity
$3.00/1M

Perplexity: Sonar Pro Search

Exclusively available on the OpenRouter API, Sonar Pro's new Pro Search mode is Perplexity...

πŸ“ 200,000 ctx Compare →
openai
$0.08/1M

OpenAI: gpt-oss-safeguard-20b

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This...

πŸ“ 131,072 ctx Compare →
nvidia
Free/1M

NVIDIA: Nemotron Nano 12B 2 VL (free)

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model design...

πŸ“ 128,000 ctx Compare →
nvidia
$0.20/1M

NVIDIA: Nemotron Nano 12B 2 VL

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model design...

πŸ“ 131,072 ctx Compare →
minimax
$0.20/1M

MiniMax: MiniMax M2

MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end cod...

πŸ“ 196,608 ctx Compare →
qwen
$0.50/1M

Qwen: Qwen3 VL 32B Instruct

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-...

πŸ“ 262,144 ctx Compare →
deepcogito
$3.50/1M

Deep Cogito: Cogito V2 Preview Llama 405B

Cogito v2 405B is a dense hybrid reasoning model that combines direct answering capabiliti...

πŸ“ 32,768 ctx Compare →
anthropic
$1.00/1M

Anthropic: Claude Haiku 4.5

Claude Haiku 4.5 is Anthropic’s fastest and most efficient model, delivering near-fronti...

πŸ“ 200,000 ctx Compare →
qwen
$0.18/1M

Qwen: Qwen3 VL 8B Thinking

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal mode...

πŸ“ 256,000 ctx Compare →
qwen
$0.08/1M

Qwen: Qwen3 VL 8B Instruct

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built...

πŸ“ 131,072 ctx Compare →
openai
$10.00/1M

OpenAI: GPT-5 Image

[GPT-5](https://openrouter.ai/openai/gpt-5) Image combines OpenAI's GPT-5 model with state...

πŸ“ 400,000 ctx Compare →
nvidia
$0.10/1M

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model...

πŸ“ 131,072 ctx Compare →
baidu
$0.07/1M

Baidu: ERNIE 4.5 21B A3B Thinking

ERNIE-4.5-21B-A3B-Thinking is Baidu's upgraded lightweight MoE model, refined to boost rea...

πŸ“ 131,072 ctx Compare →
qwen
$0.20/1M

Qwen: Qwen3 VL 30B A3B Thinking

Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with v...

πŸ“ 131,072 ctx Compare →
openai
$15.00/1M

OpenAI: GPT-5 Pro

GPT-5 Pro is OpenAI’s most advanced model, offering major improvements in reasoning, cod...

πŸ“ 400,000 ctx Compare →
z-ai
$0.35/1M

Z.AI: GLM 4.6

Compared with GLM-4.5, this generation brings several key improvements: Longer context wi...

πŸ“ 202,752 ctx Compare →
z-ai
$0.44/1M

Z.AI: GLM 4.6 (exacto)

Compared with GLM-4.5, this generation brings several key improvements: Longer context wi...

πŸ“ 204,800 ctx Compare →
anthropic
$3.00/1M

Anthropic: Claude Sonnet 4.5

Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-...

πŸ“ 1,000,000 ctx Compare →
deepseek
$0.21/1M

DeepSeek: DeepSeek V3.2 Exp

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an inter...

πŸ“ 163,840 ctx Compare →
google
$0.30/1M

Google: Gemini 2.5 Flash Preview 09-2025

Gemini 2.5 Flash Preview September 2025 Checkpoint is Google's state-of-the-art workhorse ...

πŸ“ 1,048,576 ctx Compare →
google
$0.10/1M

Google: Gemini 2.5 Flash Lite Preview 09-2025

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized...

πŸ“ 1,048,576 ctx Compare →
qwen
$0.45/1M

Qwen: Qwen3 VL 235B A22B Thinking

Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with...

πŸ“ 262,144 ctx Compare →
qwen
$0.20/1M

Qwen: Qwen3 VL 235B A22B Instruct

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text ge...

πŸ“ 262,144 ctx Compare →
qwen
$1.20/1M

Qwen: Qwen3 Max

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in ...

πŸ“ 256,000 ctx Compare →
openai
$1.25/1M

OpenAI: GPT-5 Codex

GPT-5-Codex is a specialized version of GPT-5 optimized for software engineering and codin...

πŸ“ 400,000 ctx Compare →
deepseek
$0.21/1M

DeepSeek: DeepSeek V3.1 Terminus (exacto)

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that ...

πŸ“ 163,840 ctx Compare →
deepseek
$0.21/1M

DeepSeek: DeepSeek V3.1 Terminus

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that ...

πŸ“ 163,840 ctx Compare →
x-ai
$0.20/1M

xAI: Grok 4 Fast

Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token cont...

πŸ“ 2,000,000 ctx Compare →
alibaba
$0.09/1M

Tongyi DeepResearch 30B A3B

Tongyi DeepResearch is an agentic large language model developed by Tongyi Lab, with 30 bi...

πŸ“ 131,072 ctx Compare →
opengvlab
$0.10/1M

OpenGVLab: InternVL3 78B

The InternVL3 series is an advanced multimodal large language model (MLLM). Compared to In...

πŸ“ 32,768 ctx Compare →
qwen
$0.15/1M

Qwen: Qwen3 Next 80B A3B Thinking

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that ou...

πŸ“ 128,000 ctx Compare →
qwen
Free/1M

Qwen: Qwen3 Next 80B A3B Instruct (free)

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series op...

πŸ“ 262,144 ctx Compare →
qwen
$0.09/1M

Qwen: Qwen3 Next 80B A3B Instruct

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series op...

πŸ“ 262,144 ctx Compare →
meituan
$0.20/1M

Meituan: LongCat Flash Chat

LongCat-Flash-Chat is a large-scale Mixture-of-Experts (MoE) model with 560B total paramet...

πŸ“ 131,072 ctx Compare →
qwen
$0.40/1M

Qwen: Qwen Plus 0728

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoni...

πŸ“ 1,000,000 ctx Compare →
qwen
$0.40/1M

Qwen: Qwen Plus 0728 (thinking)

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoni...

πŸ“ 1,000,000 ctx Compare →
nvidia
Free/1M

NVIDIA: Nemotron Nano 9B V2 (free)

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA,...

πŸ“ 128,000 ctx Compare →
nvidia
$0.04/1M

NVIDIA: Nemotron Nano 9B V2

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA,...

πŸ“ 131,072 ctx Compare →
moonshotai
$0.39/1M

MoonshotAI: Kimi K2 0905

Kimi K2 0905 is the September update of [Kimi K2 0711](moonshotai/kimi-k2). It is a large-...

πŸ“ 262,144 ctx Compare →
moonshotai
$0.60/1M

MoonshotAI: Kimi K2 0905 (exacto)

Kimi K2 0905 is the September update of [Kimi K2 0711](moonshotai/kimi-k2). It is a large-...

πŸ“ 262,144 ctx Compare →
deepcogito
$0.88/1M

Deep Cogito: Cogito V2 Preview Llama 70B

Cogito v2 70B is a dense hybrid reasoning model that combines direct answering capabilitie...

πŸ“ 32,768 ctx Compare →
deepcogito
$0.18/1M

Cogito V2 Preview Llama 109B

An instruction-tuned, hybrid-reasoning Mixture-of-Experts model built on Llama-4-Scout-17B...

πŸ“ 32,767 ctx Compare →
stepfun-ai
$0.57/1M

StepFun: Step3

Step3 is a cutting-edge multimodal reasoning modelβ€”built on a Mixture-of-Experts archite...

πŸ“ 65,536 ctx Compare →
qwen
$0.05/1M

Qwen: Qwen3 30B A3B Thinking 2507

Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimize...

πŸ“ 32,768 ctx Compare →
x-ai
$0.20/1M

xAI: Grok Code Fast 1

Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding....

πŸ“ 256,000 ctx Compare →
nousresearch
$0.11/1M

Nous: Hermes 4 70B

Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. ...

πŸ“ 131,072 ctx Compare →
nousresearch
$1.00/1M

Nous: Hermes 4 405B

Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nou...

πŸ“ 131,072 ctx Compare →
deepseek
$0.15/1M

DeepSeek: DeepSeek V3.1

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that support...

πŸ“ 32,768 ctx Compare →
mistralai
$0.40/1M

Mistral: Mistral Medium 3.1

Mistral Medium 3.1 is an updated version of Mistral Medium 3, which is a high-performance ...

πŸ“ 131,072 ctx Compare →
baidu
$0.14/1M

Baidu: ERNIE 4.5 VL 28B A3B

A powerful multimodal Mixture-of-Experts chat model featuring 28B total parameters with 3B...

πŸ“ 30,000 ctx Compare →
z-ai
$0.60/1M

Z.AI: GLM 4.5V

GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on...

πŸ“ 65,536 ctx Compare →
openai
$1.25/1M

OpenAI: GPT-5

GPT-5 is OpenAI’s most advanced model, offering major improvements in reasoning, code qu...

πŸ“ 400,000 ctx Compare →
openai
$0.25/1M

OpenAI: GPT-5 Mini

GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning task...

πŸ“ 400,000 ctx Compare →
openai
$0.05/1M

OpenAI: GPT-5 Nano

GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for develope...

πŸ“ 400,000 ctx Compare →
openai
Free/1M

OpenAI: gpt-oss-120b (free)

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model fro...

πŸ“ 131,072 ctx Compare →
openai
$0.04/1M

OpenAI: gpt-oss-120b

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model fro...

πŸ“ 131,072 ctx Compare →
openai
$0.04/1M

OpenAI: gpt-oss-120b (exacto)

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model fro...

πŸ“ 131,072 ctx Compare →
openai
Free/1M

OpenAI: gpt-oss-20b (free)

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 ...

πŸ“ 131,072 ctx Compare →
openai
$0.02/1M

OpenAI: gpt-oss-20b

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 ...

πŸ“ 131,072 ctx Compare →
anthropic
$15.00/1M

Anthropic: Claude Opus 4.1

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved p...

πŸ“ 200,000 ctx Compare →
qwen
$0.08/1M

Qwen: Qwen3 30B A3B Instruct 2507

Qwen3-30B-A3B-Instruct-2507 is a 30.5B-parameter mixture-of-experts language model from Qw...

πŸ“ 262,144 ctx Compare →
z-ai
$0.35/1M

Z.AI: GLM 4.5

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based application...

πŸ“ 131,072 ctx Compare →
z-ai
Free/1M

Z.AI: GLM 4.5 Air (free)

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-b...

πŸ“ 131,072 ctx Compare →
z-ai
$0.05/1M

Z.AI: GLM 4.5 Air

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-b...

πŸ“ 131,072 ctx Compare →
qwen
$0.11/1M

Qwen: Qwen3 235B A22B Thinking 2507

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) ...

πŸ“ 262,144 ctx Compare →
qwen
Free/1M

Qwen: Qwen3 Coder 480B A35B (free)

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model develop...

πŸ“ 262,000 ctx Compare →
qwen
$0.22/1M

Qwen: Qwen3 Coder 480B A35B

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model develop...

πŸ“ 262,144 ctx Compare →
qwen
$0.22/1M

Qwen: Qwen3 Coder 480B A35B (exacto)

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model develop...

πŸ“ 262,144 ctx Compare →
bytedance
$0.10/1M

ByteDance: UI-TARS 7B

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, in...

πŸ“ 128,000 ctx Compare →
google
$0.10/1M

Google: Gemini 2.5 Flash Lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized...

πŸ“ 1,048,576 ctx Compare →
qwen
$0.07/1M

Qwen: Qwen3 235B A22B Instruct 2507

Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts lang...

πŸ“ 262,144 ctx Compare →
moonshotai
Free/1M

MoonshotAI: Kimi K2 0711 (free)

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moo...

πŸ“ 32,768 ctx Compare →
moonshotai
$0.50/1M

MoonshotAI: Kimi K2 0711

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moo...

πŸ“ 131,072 ctx Compare →
mistralai
$0.40/1M

Mistral: Devstral Medium

Devstral Medium is a high-performance code generation and agentic reasoning model develope...

πŸ“ 131,072 ctx Compare →
x-ai
$3.00/1M

xAI: Grok 4

Grok 4 is xAI's latest reasoning model with a 256k context window. It supports parallel to...

πŸ“ 256,000 ctx Compare →
google
Free/1M

Google: Gemma 3n 2B (free)

Gemma 3n E2B IT is a multimodal, instruction-tuned model developed by Google DeepMind, des...

πŸ“ 8,192 ctx Compare →
tencent
$0.14/1M

Tencent: Hunyuan A13B Instruct

Hunyuan-A13B is a 13B active parameter Mixture-of-Experts (MoE) language model developed b...

πŸ“ 131,072 ctx Compare →
tngtech
Free/1M

TNG: DeepSeek R1T2 Chimera (free)

DeepSeek-TNG-R1T2-Chimera is the second-generation Chimera model from TNG Tech. It is a 67...

πŸ“ 163,840 ctx Compare →
tngtech
$0.25/1M

TNG: DeepSeek R1T2 Chimera

DeepSeek-TNG-R1T2-Chimera is the second-generation Chimera model from TNG Tech. It is a 67...

πŸ“ 163,840 ctx Compare →
baidu
$0.42/1M

Baidu: ERNIE 4.5 VL 424B A47B

ERNIE-4.5-VL-424B-A47B is a multimodal Mixture-of-Experts (MoE) model from Baidu’s ERNIE...

πŸ“ 123,000 ctx Compare →
baidu
$0.28/1M

Baidu: ERNIE 4.5 300B A47B

ERNIE-4.5-300B-A47B is a 300B parameter Mixture-of-Experts (MoE) language model developed ...

πŸ“ 123,000 ctx Compare →
minimax
$0.40/1M

MiniMax: MiniMax M1

MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and...

πŸ“ 1,000,000 ctx Compare →
google
$0.30/1M

Google: Gemini 2.5 Flash

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for a...

πŸ“ 1,048,576 ctx Compare →
google
$1.25/1M

Google: Gemini 2.5 Pro

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, co...

πŸ“ 1,048,576 ctx Compare →
moonshotai
$0.29/1M

MoonshotAI: Kimi Dev 72B

Kimi-Dev-72B is an open-source large language model fine-tuned for software engineering an...

πŸ“ 131,072 ctx Compare →
openai
$20.00/1M

OpenAI: o3 Pro

The o-series of models are trained with reinforcement learning to think before they answer...

πŸ“ 200,000 ctx Compare →
x-ai
$0.30/1M

xAI: Grok 3 Mini

A lightweight model that thinks before responding. Fast, smart, and great for logic-based ...

πŸ“ 131,072 ctx Compare →
google
$1.25/1M

Google: Gemini 2.5 Pro Preview 06-05

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, co...

πŸ“ 1,048,576 ctx Compare →
deepseek
Free/1M

DeepSeek: R1 0528 (free)

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par wi...

πŸ“ 163,840 ctx Compare →
deepseek
$0.40/1M

DeepSeek: R1 0528

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par wi...

πŸ“ 163,840 ctx Compare →
anthropic
$3.00/1M

Anthropic: Claude Sonnet 4

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, ex...

πŸ“ 1,000,000 ctx Compare →
nousresearch
$0.02/1M

Nous: DeepHermes 3 Mistral 24B Preview

DeepHermes 3 (Mistral 24B Preview) is an instruction-tuned language model by Nous Research...

πŸ“ 32,768 ctx Compare →
mistralai
$0.40/1M

Mistral: Mistral Medium 3

Mistral Medium 3 is a high-performance enterprise-grade language model designed to deliver...

πŸ“ 131,072 ctx Compare →
google
$1.25/1M

Google: Gemini 2.5 Pro Preview 05-06

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, co...

πŸ“ 1,048,576 ctx Compare →
arcee-ai
$0.90/1M

Arcee AI: Maestro Reasoning

Maestro Reasoning is Arcee's flagship analysis model: a 32β€―B‑parameter derivative of Q...

πŸ“ 131,072 ctx Compare →
arcee-ai
$0.75/1M

Arcee AI: Virtuoso Large

Virtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72β€―B parameters, tuned t...

πŸ“ 131,072 ctx Compare →
qwen
Free/1M

Qwen: Qwen3 4B (free)

Qwen3-4B is a 4 billion parameter dense language model from the Qwen3 series, designed to ...

πŸ“ 40,960 ctx Compare →
qwen
$0.06/1M

Qwen: Qwen3 30B A3B

Qwen3, the latest generation in the Qwen large language model series, features both dense ...

πŸ“ 40,960 ctx Compare →
qwen
$0.05/1M

Qwen: Qwen3 8B

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed f...

πŸ“ 32,000 ctx Compare →
qwen
$0.05/1M

Qwen: Qwen3 14B

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed...

πŸ“ 40,960 ctx Compare →
qwen
$0.08/1M

Qwen: Qwen3 32B

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimize...

πŸ“ 40,960 ctx Compare →
qwen
$0.20/1M

Qwen: Qwen3 235B A22B

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, acti...

πŸ“ 40,960 ctx Compare →
tngtech
Free/1M

TNG: DeepSeek R1T Chimera (free)

DeepSeek-R1T-Chimera is created by merging DeepSeek-R1 and DeepSeek-V3 (0324), combining t...

πŸ“ 163,840 ctx Compare →
tngtech
$0.30/1M

TNG: DeepSeek R1T Chimera

DeepSeek-R1T-Chimera is created by merging DeepSeek-R1 and DeepSeek-V3 (0324), combining t...

πŸ“ 163,840 ctx Compare →
openai
$1.10/1M

OpenAI: o4 Mini High

OpenAI o4-mini-high is the same model as [o4-mini](/openai/o4-mini) with reasoning_effort ...

πŸ“ 200,000 ctx Compare →
openai
$2.00/1M

OpenAI: o3

o3 is a well-rounded and powerful model across domains. It sets a new standard for math, s...

πŸ“ 200,000 ctx Compare →
openai
$1.10/1M

OpenAI: o4 Mini

OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-effi...

πŸ“ 200,000 ctx Compare →
qwen
$0.03/1M

Qwen: Qwen2.5 Coder 7B Instruct

Qwen2.5-Coder-7B-Instruct is a 7B parameter instruction-tuned language model optimized for...

πŸ“ 32,768 ctx Compare →
openai
$2.00/1M

OpenAI: GPT-4.1

GPT-4.1 is a flagship large language model optimized for advanced instruction following, r...

πŸ“ 1,047,576 ctx Compare →
eleutherai
$0.80/1M

EleutherAI: Llemma 7b

Llemma 7B is a language model for mathematics. It was initialized with Code Llama 7B weigh...

πŸ“ 4,096 ctx Compare →
x-ai
$0.30/1M

xAI: Grok 3 Mini Beta

Grok 3 Mini is a lightweight, smaller thinking model. Unlike traditional models that gener...

πŸ“ 131,072 ctx Compare →
nvidia
$0.60/1M

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced re...

πŸ“ 131,072 ctx Compare →
meta-llama
$0.15/1M

Meta: Llama 4 Maverick

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Met...

πŸ“ 1,048,576 ctx Compare →
meta-llama
$0.08/1M

Meta: Llama 4 Scout

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by...

πŸ“ 327,680 ctx Compare →
qwen
$0.05/1M

Qwen: Qwen2.5 VL 32B Instruct

Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement lear...

πŸ“ 16,384 ctx Compare →
openai
$150.00/1M

OpenAI: o1-pro

The o1 series of models are trained with reinforcement learning to think before they answe...

πŸ“ 200,000 ctx Compare →
mistralai
Free/1M

Mistral: Mistral Small 3.1 24B (free)

Mistral Small 3.1 24B Instruct is an upgraded variant of Mistral Small 3 (2501), featuring...

πŸ“ 128,000 ctx Compare →
mistralai
$0.03/1M

Mistral: Mistral Small 3.1 24B

Mistral Small 3.1 24B Instruct is an upgraded variant of Mistral Small 3 (2501), featuring...

πŸ“ 131,072 ctx Compare →
allenai
$0.05/1M

AllenAI: Olmo 2 32B Instruct

OLMo-2 32B Instruct is a supervised instruction-finetuned variant of the OLMo-2 32B March ...

πŸ“ 128,000 ctx Compare →
google
Free/1M

Google: Gemma 3 4B (free)

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...

πŸ“ 32,768 ctx Compare →
google
$0.02/1M

Google: Gemma 3 4B

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...

πŸ“ 96,000 ctx Compare →
google
Free/1M

Google: Gemma 3 12B (free)

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...

πŸ“ 32,768 ctx Compare →
google
$0.03/1M

Google: Gemma 3 12B

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...

πŸ“ 131,072 ctx Compare →
google
Free/1M

Google: Gemma 3 27B (free)

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...

πŸ“ 131,072 ctx Compare →
google
$0.04/1M

Google: Gemma 3 27B

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...

πŸ“ 96,000 ctx Compare →
perplexity
$2.00/1M

Perplexity: Sonar Reasoning Pro

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://doc...

πŸ“ 128,000 ctx Compare →
perplexity
$3.00/1M

Perplexity: Sonar Pro

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://doc...

πŸ“ 200,000 ctx Compare →
perplexity
$2.00/1M

Perplexity: Sonar Deep Research

Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthes...

πŸ“ 128,000 ctx Compare →
qwen
$0.15/1M

Qwen: QwQ 32B

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tune...

πŸ“ 32,768 ctx Compare →
anthropic
$3.00/1M

Anthropic: Claude 3.7 Sonnet (thinking)

Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and...

πŸ“ 200,000 ctx Compare →
anthropic
$3.00/1M

Anthropic: Claude 3.7 Sonnet

Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and...

πŸ“ 200,000 ctx Compare →
openai
$1.10/1M

OpenAI: o3 Mini High

OpenAI o3-mini-high is the same model as [o3-mini](/openai/o3-mini) with reasoning_effort ...

πŸ“ 200,000 ctx Compare →
aion-labs
$4.00/1M

AionLabs: Aion-1.0

Aion-1.0 is a multi-model system designed for high performance across various tasks, inclu...

πŸ“ 131,072 ctx Compare →
aion-labs
$0.70/1M

AionLabs: Aion-1.0-Mini

Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designe...

πŸ“ 131,072 ctx Compare →
openai
$1.10/1M

OpenAI: o3 Mini

OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, part...

πŸ“ 200,000 ctx Compare →
deepseek
$0.70/1M

DeepSeek: R1

DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and...

πŸ“ 64,000 ctx Compare →
microsoft
$0.06/1M

Microsoft: Phi 4

[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning ta...

πŸ“ 16,384 ctx Compare →
openai
$15.00/1M

OpenAI: o1

The latest and strongest model family from OpenAI, o1 is designed to spend more time think...

πŸ“ 200,000 ctx Compare →
cohere
$0.04/1M

Cohere: Command R7B (12-2024)

Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in Decemb...

πŸ“ 128,000 ctx Compare →
amazon
$0.04/1M

Amazon: Nova Micro 1.0

Amazon Nova Micro 1.0 is a text-only model that delivers the lowest latency responses in t...

πŸ“ 128,000 ctx Compare →
mistralai
$2.00/1M

Mistral Large 2407

This is Mistral AI's flagship model, Mistral Large 2 (version mistral-large-2407). It's a ...

πŸ“ 131,072 ctx Compare →
qwen
$0.03/1M

Qwen2.5 Coder 32B Instruct

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly k...

πŸ“ 32,768 ctx Compare →
raifle
$4.50/1M

SorcererLM 8x22B

SorcererLM is an advanced RP and storytelling model, built as a Low-rank 16-bit LoRA fine-...

πŸ“ 16,000 ctx Compare →
mistralai
$0.10/1M

Mistral: Ministral 8B

Ministral 8B is an 8B parameter model featuring a unique interleaved sliding-window attent...

πŸ“ 131,072 ctx Compare →
mistralai
$0.04/1M

Mistral: Ministral 3B

Ministral 3B is a 3B parameter model optimized for on-device and edge computing. It excels...

πŸ“ 131,072 ctx Compare →
meta-llama
Free/1M

Meta: Llama 3.2 3B Instruct (free)

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for adv...

πŸ“ 131,072 ctx Compare →
meta-llama
$0.02/1M

Meta: Llama 3.2 3B Instruct

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for adv...

πŸ“ 131,072 ctx Compare →
meta-llama
$0.05/1M

Meta: Llama 3.2 11B Vision Instruct

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle ...

πŸ“ 131,072 ctx Compare →
cohere
$0.15/1M

Cohere: Command R (08-2024)

command-r-08-2024 is an update of the [Command R](/models/cohere/command-r) with improved ...

πŸ“ 128,000 ctx Compare →
qwen
Free/1M

Qwen: Qwen2.5-VL 7B Instruct (free)

Qwen2.5 VL 7B is a multimodal LLM from the Qwen Team with the following key enhancements: ...

πŸ“ 32,768 ctx Compare →
qwen
$0.20/1M

Qwen: Qwen2.5-VL 7B Instruct

Qwen2.5 VL 7B is a multimodal LLM from the Qwen Team with the following key enhancements: ...

πŸ“ 32,768 ctx Compare →
nousresearch
$0.30/1M

Nous: Hermes 3 70B Instruct

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nou...

πŸ“ 65,536 ctx Compare →
nousresearch
Free/1M

Nous: Hermes 3 405B Instruct (free)

Hermes 3 is a generalist language model with many improvements over Hermes 2, including ad...

πŸ“ 131,072 ctx Compare →
nousresearch
$1.00/1M

Nous: Hermes 3 405B Instruct

Hermes 3 is a generalist language model with many improvements over Hermes 2, including ad...

πŸ“ 131,072 ctx Compare →
sao10k
$0.04/1M

Sao10K: Llama 3 8B Lunaris

Lunaris 8B is a versatile generalist and roleplaying model based on Llama 3. It's a strate...

πŸ“ 8,192 ctx Compare →
google
$0.65/1M

Google: Gemma 2 27B

Gemma 2 27B by Google is an open model built from the same research and technology used to...

πŸ“ 8,192 ctx Compare →
mistralai
$2.00/1M

Mistral: Mixtral 8x22B Instruct

Mistral's official instruct fine-tuned version of [Mixtral 8x22B](/models/mistralai/mixtra...

πŸ“ 65,536 ctx Compare →
mistralai
$2.00/1M

Mistral Large

This is Mistral AI's flagship model, Mistral Large 2 (version `mistral-large-2407`). It's ...

πŸ“ 128,000 ctx Compare →
mistralai
$0.25/1M

Mistral Tiny

Note: This model is being deprecated. Recommended replacement is the newer [Ministral 8B](...

πŸ“ 32,768 ctx Compare →
openai
$30.00/1M

OpenAI: GPT-4

OpenAI's flagship model, GPT-4 is a large-scale multimodal language model capable of solvi...

πŸ“ 8,191 ctx Compare →