Best AI Models for Reasoning

nvidia

Free/1M

NVIDIA: Nemotron 3 Ultra (free)

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA,...

📝 1,000,000 ctx Compare →

nvidia

$0.50/1M

NVIDIA: Nemotron 3 Ultra

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA,...

📝 1,000,000 ctx Compare →

anthropic

$5.00/1M

Anthropic: Claude Opus 4.8

Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. ...

📝 1,000,000 ctx Compare →

google

$1.50/1M

Google: Gemini 3.5 Flash

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level cod...

📝 1,048,576 ctx Compare →

perceptron

$0.15/1M

Perceptron: Perceptron Mk1

Perceptron Mk1 (Mark One) is Perceptron's highest-quality vision-language model for video ...

📝 32,768 ctx Compare →

x-ai

$1.25/1M

xAI: Grok 4.3

Grok 4.3 is a reasoning model from xAI. It accepts text and image inputs with text output,...

📝 1,000,000 ctx Compare →

poolside

Free/1M

Poolside: Laguna XS.2 (free)

Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://po...

📝 262,144 ctx Compare →

poolside

Free/1M

Poolside: Laguna M.1 (free)

Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai), optimi...

📝 262,144 ctx Compare →

openai

$30.00/1M

OpenAI: GPT-5.5 Pro

GPT-5.5 Pro is OpenAI’s high-capability model optimized for deep reasoning and accuracy ...

📝 1,050,000 ctx Compare →

openai

$5.00/1M

OpenAI: GPT-5.5

GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building...

📝 1,050,000 ctx Compare →

deepseek

$0.44/1M

DeepSeek: DeepSeek V4 Pro

DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total pa...

📝 1,048,576 ctx Compare →

tencent

$0.06/1M

Tencent: Hy3 preview

Hy3 preview is a high-efficiency Mixture-of-Experts model from Tencent designed for agenti...

📝 262,144 ctx Compare →

openai

$8.00/1M

OpenAI: GPT-5.4 Image 2

[GPT-5.4](https://openrouter.ai/openai/gpt-5.4) Image 2 combines OpenAI's GPT-5.4 model wi...

📝 272,000 ctx Compare →

google

Free/1M

Google: Gemma 4 31B (free)

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and...

📝 262,144 ctx Compare →

google

$0.12/1M

Google: Gemma 4 31B

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and...

📝 262,144 ctx Compare →

arcee-ai

$0.22/1M

Arcee AI: Trinity Large Thinking

Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI...

📝 262,144 ctx Compare →

x-ai

$1.25/1M

xAI: Grok 4.20

Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calli...

📝 2,000,000 ctx Compare →

openai

$0.75/1M

OpenAI: GPT-5.4 Mini

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model opt...

📝 400,000 ctx Compare →

mistralai

$0.15/1M

Mistral: Mistral Small 4

Mistral Small 4 is the next major release in the Mistral Small family, unifying the capabi...

📝 262,144 ctx Compare →

qwen

$0.04/1M

Qwen: Qwen3.5-9B

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver s...

📝 262,144 ctx Compare →

openai

$30.00/1M

OpenAI: GPT-5.4 Pro

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture wi...

📝 1,050,000 ctx Compare →

inception

$0.25/1M

Inception: Mercury 2

Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM)...

📝 128,000 ctx Compare →

bytedance-seed

$0.10/1M

ByteDance Seed: Seed-2.0-Mini

Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, e...

📝 262,144 ctx Compare →

openai

$1.75/1M

OpenAI: GPT-5.3-Codex

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier sof...

📝 400,000 ctx Compare →

google

$2.00/1M

Google: Gemini 3.1 Pro Preview

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced softwar...

📝 1,048,576 ctx Compare →

qwen

$0.78/1M

Qwen: Qwen3 Max Thinking

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-...

📝 262,144 ctx Compare →

liquid

Free/1M

LiquidAI: LFM2.5-1.2B-Thinking (free)

LFM2.5-1.2B-Thinking is a lightweight reasoning-focused model optimized for agentic tasks,...

📝 32,768 ctx Compare →

z-ai

$0.40/1M

Z.ai: GLM 4.7

GLM-4.7 is Z.ai’s latest flagship model, featuring upgrades in two key areas: enhanced p...

📝 202,752 ctx Compare →

google

$0.50/1M

Google: Gemini 3 Flash Preview

Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic wor...

📝 1,048,576 ctx Compare →

openai

$1.75/1M

OpenAI: GPT-5.2 Chat

GPT-5.2 Chat (AKA Instant) is the fast, lightweight member of the 5.2 family, optimized fo...

📝 128,000 ctx Compare →

openai

$21.00/1M

OpenAI: GPT-5.2 Pro

GPT-5.2 Pro is OpenAI’s most advanced model, offering major improvements in agentic codi...

📝 400,000 ctx Compare →

openai

$1.75/1M

OpenAI: GPT-5.2

GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic ...

📝 400,000 ctx Compare →

z-ai

$0.30/1M

Z.ai: GLM 4.6V

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and l...

📝 131,072 ctx Compare →

essentialai

$0.15/1M

EssentialAI: Rnj 1 Instruct

Rnj-1 is an 8B-parameter, dense, open-weight model family developed by Essential AI and tr...

📝 32,768 ctx Compare →

openai

$1.25/1M

OpenAI: GPT-5.1-Codex-Max

GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, hi...

📝 400,000 ctx Compare →

amazon

$0.30/1M

Amazon: Nova 2 Lite

Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can proc...

📝 1,000,000 ctx Compare →

arcee-ai

$0.05/1M

Arcee AI: Trinity Mini

Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featu...

📝 131,072 ctx Compare →

deepseek

$0.23/1M

DeepSeek: DeepSeek V3.2

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficienc...

📝 131,072 ctx Compare →

anthropic

$5.00/1M

Anthropic: Claude Opus 4.5

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software e...

📝 200,000 ctx Compare →

allenai

$0.15/1M

AllenAI: Olmo 3 32B Think

Olmo 3 32B Think is a large-scale, 32-billion-parameter model purpose-built for deep reaso...

📝 65,536 ctx Compare →

google

$2.00/1M

Google: Nano Banana Pro (Gemini 3 Pro Image Preview)

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on G...

📝 65,536 ctx Compare →

openai

$1.25/1M

OpenAI: GPT-5.1

GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-...

📝 400,000 ctx Compare →

openai

$1.25/1M

OpenAI: GPT-5.1 Chat

GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 family, optimized for...

📝 128,000 ctx Compare →

moonshotai

$0.60/1M

MoonshotAI: Kimi K2 Thinking

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending ...

📝 262,144 ctx Compare →

amazon

$2.50/1M

Amazon: Nova Premier 1.0

Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reason...

📝 1,000,000 ctx Compare →

perplexity

$3.00/1M

Perplexity: Sonar Pro Search

Exclusively available on the OpenRouter API, Sonar Pro's new Pro Search mode is Perplexity...

📝 200,000 ctx Compare →

openai

$0.08/1M

OpenAI: gpt-oss-safeguard-20b

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This...

📝 131,072 ctx Compare →

nvidia

Free/1M

NVIDIA: Nemotron Nano 12B 2 VL (free)

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model design...

📝 128,000 ctx Compare →

minimax

$0.26/1M

MiniMax: MiniMax M2

MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end cod...

📝 204,800 ctx Compare →

qwen

$0.10/1M

Qwen: Qwen3 VL 32B Instruct

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-...

📝 262,144 ctx Compare →

microsoft

$0.08/1M

Microsoft: Phi 4 Mini Instruct

Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered pub...

📝 131,072 ctx Compare →

qwen

$0.12/1M

Qwen: Qwen3 VL 8B Thinking

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal mode...

📝 256,000 ctx Compare →

qwen

$0.08/1M

Qwen: Qwen3 VL 8B Instruct

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built...

📝 256,000 ctx Compare →

openai

$10.00/1M

OpenAI: GPT-5 Image

[GPT-5](https://openrouter.ai/openai/gpt-5) Image combines OpenAI's GPT-5 model with state...

📝 400,000 ctx Compare →

nvidia

$0.10/1M

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model...

📝 131,072 ctx Compare →

qwen

$0.13/1M

Qwen: Qwen3 VL 30B A3B Thinking

Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with v...

📝 131,072 ctx Compare →

openai

$15.00/1M

OpenAI: GPT-5 Pro

GPT-5 Pro is OpenAI’s most advanced model, offering major improvements in reasoning, cod...

📝 400,000 ctx Compare →

google

$0.10/1M

Google: Gemini 2.5 Flash Lite Preview 09-2025

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized...

📝 1,048,576 ctx Compare →

qwen

$0.26/1M

Qwen: Qwen3 VL 235B A22B Thinking

Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with...

📝 131,072 ctx Compare →

qwen

$0.78/1M

Qwen: Qwen3 Max

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in ...

📝 262,144 ctx Compare →

qwen

$0.10/1M

Qwen: Qwen3 Next 80B A3B Thinking

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that ou...

📝 262,144 ctx Compare →

qwen

Free/1M

Qwen: Qwen3 Next 80B A3B Instruct (free)

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series op...

📝 262,144 ctx Compare →

qwen

$0.09/1M

Qwen: Qwen3 Next 80B A3B Instruct

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series op...

📝 262,144 ctx Compare →

qwen

$0.26/1M

Qwen: Qwen Plus 0728 (thinking)

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoni...

📝 1,000,000 ctx Compare →

qwen

$0.26/1M

Qwen: Qwen Plus 0728

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoni...

📝 1,000,000 ctx Compare →

nvidia

Free/1M

NVIDIA: Nemotron Nano 9B V2 (free)

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA,...

📝 128,000 ctx Compare →

nvidia

$0.04/1M

NVIDIA: Nemotron Nano 9B V2

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA,...

📝 131,072 ctx Compare →

qwen

$0.08/1M

Qwen: Qwen3 30B A3B Thinking 2507

Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimize...

📝 131,072 ctx Compare →

nousresearch

$0.13/1M

Nous: Hermes 4 70B

Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. ...

📝 131,072 ctx Compare →

nousresearch

$1.00/1M

Nous: Hermes 4 405B

Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nou...

📝 131,072 ctx Compare →

deepseek

$0.21/1M

DeepSeek: DeepSeek V3.1

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that support...

📝 163,840 ctx Compare →

openai

$1.25/1M

OpenAI: GPT-5

GPT-5 is OpenAI’s most advanced model, offering major improvements in reasoning, code qu...

📝 400,000 ctx Compare →

openai

$0.25/1M

OpenAI: GPT-5 Mini

GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning task...

📝 400,000 ctx Compare →

openai

$0.05/1M

OpenAI: GPT-5 Nano

GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for develope...

📝 400,000 ctx Compare →

openai

Free/1M

OpenAI: gpt-oss-120b (free)

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model fro...

📝 131,072 ctx Compare →

openai

$0.04/1M

OpenAI: gpt-oss-120b

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model fro...

📝 131,072 ctx Compare →

anthropic

$15.00/1M

Anthropic: Claude Opus 4.1

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved p...

📝 200,000 ctx Compare →

qwen

$0.10/1M

Qwen: Qwen3 235B A22B Thinking 2507

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) ...

📝 262,144 ctx Compare →

qwen

Free/1M

Qwen: Qwen3 Coder 480B A35B (free)

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model develop...

📝 1,048,576 ctx Compare →

qwen

$0.22/1M

Qwen: Qwen3 Coder 480B A35B

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model develop...

📝 1,048,576 ctx Compare →

google

$0.10/1M

Google: Gemini 2.5 Flash Lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized...

📝 1,048,576 ctx Compare →

tencent

$0.14/1M

Tencent: Hunyuan A13B Instruct

Hunyuan-A13B is a 13B active parameter Mixture-of-Experts (MoE) language model developed b...

📝 131,072 ctx Compare →

minimax

$0.40/1M

MiniMax: MiniMax M1

MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and...

📝 1,000,000 ctx Compare →

google

$0.30/1M

Google: Gemini 2.5 Flash

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for a...

📝 1,048,576 ctx Compare →

google

$1.25/1M

Google: Gemini 2.5 Pro

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, co...

📝 1,048,576 ctx Compare →

openai

$20.00/1M

OpenAI: o3 Pro

The o-series of models are trained with reinforcement learning to think before they answer...

📝 200,000 ctx Compare →

google

$1.25/1M

Google: Gemini 2.5 Pro Preview 06-05

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, co...

📝 1,048,576 ctx Compare →

deepseek

$0.50/1M

DeepSeek: R1 0528

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par wi...

📝 163,840 ctx Compare →

anthropic

$3.00/1M

Anthropic: Claude Sonnet 4

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, ex...

📝 1,000,000 ctx Compare →

mistralai

$0.40/1M

Mistral: Mistral Medium 3

Mistral Medium 3 is a high-performance enterprise-grade language model designed to deliver...

📝 131,072 ctx Compare →

google

$1.25/1M

Google: Gemini 2.5 Pro Preview 05-06

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, co...

📝 1,048,576 ctx Compare →

arcee-ai

$0.90/1M

Arcee AI: Maestro Reasoning

Maestro Reasoning is Arcee's flagship analysis model: a 32 B‑parameter derivative of Qwe...

📝 131,072 ctx Compare →

arcee-ai

$0.75/1M

Arcee AI: Virtuoso Large

Virtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72 B parameters, tuned to ...

📝 131,072 ctx Compare →

qwen

$0.09/1M

Qwen: Qwen3 30B A3B

Qwen3, the latest generation in the Qwen large language model series, features both dense ...

📝 131,072 ctx Compare →

qwen

$0.05/1M

Qwen: Qwen3 8B

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed f...

📝 131,072 ctx Compare →

qwen

$0.10/1M

Qwen: Qwen3 14B

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed...

📝 131,702 ctx Compare →

qwen

$0.08/1M

Qwen: Qwen3 32B

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimize...

📝 131,072 ctx Compare →

qwen

$0.46/1M

Qwen: Qwen3 235B A22B

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, acti...

📝 131,072 ctx Compare →

openai

$1.10/1M

OpenAI: o4 Mini High

OpenAI o4-mini-high is the same model as [o4-mini](/openai/o4-mini) with reasoning_effort ...

📝 200,000 ctx Compare →

openai

$2.00/1M

OpenAI: o3

o3 is a well-rounded and powerful model across domains. It sets a new standard for math, s...

📝 200,000 ctx Compare →

openai

$1.10/1M

OpenAI: o4 Mini

OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-effi...

📝 200,000 ctx Compare →

openai

$2.00/1M

OpenAI: GPT-4.1

GPT-4.1 is a flagship large language model optimized for advanced instruction following, r...

📝 1,047,576 ctx Compare →

openai

$150.00/1M

OpenAI: o1-pro

The o1 series of models are trained with reinforcement learning to think before they answe...

📝 200,000 ctx Compare →

mistralai

$0.35/1M

Mistral: Mistral Small 3.1 24B

Mistral Small 3.1 24B Instruct is an upgraded variant of Mistral Small 3 (2501), featuring...

📝 128,000 ctx Compare →

google

$0.04/1M

Google: Gemma 3 4B

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...

📝 131,072 ctx Compare →

google

$0.04/1M

Google: Gemma 3 12B

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...

📝 131,072 ctx Compare →

google

$0.08/1M

Google: Gemma 3 27B

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...

📝 131,072 ctx Compare →

perplexity

$2.00/1M

Perplexity: Sonar Reasoning Pro

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://doc...

📝 128,000 ctx Compare →

perplexity

$3.00/1M

Perplexity: Sonar Pro

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://doc...

📝 200,000 ctx Compare →

perplexity

$2.00/1M

Perplexity: Sonar Deep Research

Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthes...

📝 128,000 ctx Compare →

openai

$1.10/1M

OpenAI: o3 Mini High

OpenAI o3-mini-high is the same model as [o3-mini](/openai/o3-mini) with reasoning_effort ...

📝 200,000 ctx Compare →

aion-labs

$4.00/1M

AionLabs: Aion-1.0

Aion-1.0 is a multi-model system designed for high performance across various tasks, inclu...

📝 131,072 ctx Compare →

aion-labs

$0.70/1M

AionLabs: Aion-1.0-Mini

Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designe...

📝 131,072 ctx Compare →

openai

$1.10/1M

OpenAI: o3 Mini

OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, part...

📝 200,000 ctx Compare →

deepseek

$0.70/1M

DeepSeek: R1

DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and...

📝 163,840 ctx Compare →

microsoft

$0.07/1M

Microsoft: Phi 4

[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning ta...

📝 16,384 ctx Compare →

openai

$15.00/1M

OpenAI: o1

The latest and strongest model family from OpenAI, o1 is designed to spend more time think...

📝 200,000 ctx Compare →

cohere

$0.04/1M

Cohere: Command R7B (12-2024)

Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in Decemb...

📝 128,000 ctx Compare →

mistralai

$2.00/1M

Mistral Large 2407

This is Mistral AI's flagship model, Mistral Large 2 (version mistral-large-2407). It's a ...

📝 131,072 ctx Compare →

qwen

$0.66/1M

Qwen2.5 Coder 32B Instruct

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly k...

📝 128,000 ctx Compare →

meta-llama

Free/1M

Meta: Llama 3.2 3B Instruct (free)

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for adv...

📝 131,072 ctx Compare →

meta-llama

$0.05/1M

Meta: Llama 3.2 3B Instruct

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for adv...

📝 131,072 ctx Compare →

cohere

$0.15/1M

Cohere: Command R (08-2024)

command-r-08-2024 is an update of the [Command R](/models/cohere/command-r) with improved ...

📝 128,000 ctx Compare →

nousresearch

$0.30/1M

Nous: Hermes 3 70B Instruct

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nou...

📝 131,072 ctx Compare →

nousresearch

Free/1M

Nous: Hermes 3 405B Instruct (free)

Hermes 3 is a generalist language model with many improvements over Hermes 2, including ad...

📝 131,072 ctx Compare →

nousresearch

$1.00/1M

Nous: Hermes 3 405B Instruct

Hermes 3 is a generalist language model with many improvements over Hermes 2, including ad...

📝 131,072 ctx Compare →

sao10k

$0.04/1M

Sao10K: Llama 3 8B Lunaris

Lunaris 8B is a versatile generalist and roleplaying model based on Llama 3. It's a strate...

📝 8,192 ctx Compare →

mistralai

$2.00/1M

Mistral Large

This is Mistral AI's flagship model, Mistral Large 2 (version `mistral-large-2407`). It's ...

📝 128,000 ctx Compare →

openai

$30.00/1M

OpenAI: GPT-4

OpenAI's flagship model, GPT-4 is a large-scale multimodal language model capable of solvi...

📝 8,191 ctx Compare →

Reasoning Models

NVIDIA: Nemotron 3 Ultra (free)

NVIDIA: Nemotron 3 Ultra

Anthropic: Claude Opus 4.8

Google: Gemini 3.5 Flash

Perceptron: Perceptron Mk1

xAI: Grok 4.3

Poolside: Laguna XS.2 (free)

Poolside: Laguna M.1 (free)

OpenAI: GPT-5.5 Pro

OpenAI: GPT-5.5

DeepSeek: DeepSeek V4 Pro

Tencent: Hy3 preview

OpenAI: GPT-5.4 Image 2

Google: Gemma 4 31B (free)

Google: Gemma 4 31B

Arcee AI: Trinity Large Thinking

xAI: Grok 4.20

OpenAI: GPT-5.4 Mini

Mistral: Mistral Small 4

Qwen: Qwen3.5-9B

OpenAI: GPT-5.4 Pro

Inception: Mercury 2

ByteDance Seed: Seed-2.0-Mini

OpenAI: GPT-5.3-Codex

Google: Gemini 3.1 Pro Preview

Qwen: Qwen3 Max Thinking

LiquidAI: LFM2.5-1.2B-Thinking (free)

Z.ai: GLM 4.7

Google: Gemini 3 Flash Preview

OpenAI: GPT-5.2 Chat

OpenAI: GPT-5.2 Pro

OpenAI: GPT-5.2

Z.ai: GLM 4.6V

EssentialAI: Rnj 1 Instruct

OpenAI: GPT-5.1-Codex-Max

Amazon: Nova 2 Lite

Arcee AI: Trinity Mini

DeepSeek: DeepSeek V3.2

Anthropic: Claude Opus 4.5

AllenAI: Olmo 3 32B Think

Google: Nano Banana Pro (Gemini 3 Pro Image Preview)

OpenAI: GPT-5.1

OpenAI: GPT-5.1 Chat

MoonshotAI: Kimi K2 Thinking

Amazon: Nova Premier 1.0

Perplexity: Sonar Pro Search

OpenAI: gpt-oss-safeguard-20b

NVIDIA: Nemotron Nano 12B 2 VL (free)

MiniMax: MiniMax M2

Qwen: Qwen3 VL 32B Instruct

Microsoft: Phi 4 Mini Instruct

Qwen: Qwen3 VL 8B Thinking

Qwen: Qwen3 VL 8B Instruct

OpenAI: GPT-5 Image

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

Qwen: Qwen3 VL 30B A3B Thinking

OpenAI: GPT-5 Pro

Google: Gemini 2.5 Flash Lite Preview 09-2025

Qwen: Qwen3 VL 235B A22B Thinking

Qwen: Qwen3 Max

Qwen: Qwen3 Next 80B A3B Thinking

Qwen: Qwen3 Next 80B A3B Instruct (free)

Qwen: Qwen3 Next 80B A3B Instruct

Qwen: Qwen Plus 0728 (thinking)

Qwen: Qwen Plus 0728

NVIDIA: Nemotron Nano 9B V2 (free)

NVIDIA: Nemotron Nano 9B V2

Qwen: Qwen3 30B A3B Thinking 2507

Nous: Hermes 4 70B

Nous: Hermes 4 405B

DeepSeek: DeepSeek V3.1

OpenAI: GPT-5

OpenAI: GPT-5 Mini

OpenAI: GPT-5 Nano

OpenAI: gpt-oss-120b (free)

OpenAI: gpt-oss-120b

Anthropic: Claude Opus 4.1

Qwen: Qwen3 235B A22B Thinking 2507

Qwen: Qwen3 Coder 480B A35B (free)