Reasoning Models
Google: Gemma 4 31B
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and...
Arcee AI: Trinity Large Thinking
Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI...
OpenAI: GPT-5.4 Mini
GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model opt...
Mistral: Mistral Small 4
Mistral Small 4 is the next major release in the Mistral Small family, unifying the capabi...
Qwen: Qwen3.5-9B
Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver s...
OpenAI: GPT-5.4 Pro
GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture wi...
Inception: Mercury 2
Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM)...
ByteDance Seed: Seed-2.0-Mini
Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, e...
OpenAI: GPT-5.3-Codex
GPT-5.3-Codex is OpenAIβs most advanced agentic coding model, combining the frontier sof...
Google: Gemini 3.1 Pro Preview
Gemini 3.1 Pro Preview is Googleβs frontier reasoning model, delivering enhanced softwar...
Qwen: Qwen3 Max Thinking
Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-...
LiquidAI: LFM2.5-1.2B-Thinking (free)
LFM2.5-1.2B-Thinking is a lightweight reasoning-focused model optimized for agentic tasks,...
Z.ai: GLM 4.7
GLM-4.7 is Z.aiβs latest flagship model, featuring upgrades in two key areas: enhanced p...
Google: Gemini 3 Flash Preview
Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic wor...
OpenAI: GPT-5.2 Chat
GPT-5.2 Chat (AKA Instant) is the fast, lightweight member of the 5.2 family, optimized fo...
OpenAI: GPT-5.2 Pro
GPT-5.2 Pro is OpenAIβs most advanced model, offering major improvements in agentic codi...
OpenAI: GPT-5.2
GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic ...
Z.ai: GLM 4.6V
GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and l...
EssentialAI: Rnj 1 Instruct
Rnj-1 is an 8B-parameter, dense, open-weight model family developed by Essential AI and tr...
OpenAI: GPT-5.1-Codex-Max
GPT-5.1-Codex-Max is OpenAIβs latest agentic coding model, designed for long-running, hi...
Amazon: Nova 2 Lite
Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can proc...
Arcee AI: Trinity Mini (free)
Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featu...
Arcee AI: Trinity Mini
Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featu...
DeepSeek: DeepSeek V3.2 Speciale
DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum re...
DeepSeek: DeepSeek V3.2
DeepSeek-V3.2 is a large language model designed to harmonize high computational efficienc...
Anthropic: Claude Opus 4.5
Claude Opus 4.5 is Anthropicβs frontier reasoning model optimized for complex software e...
AllenAI: Olmo 3 32B Think
Olmo 3 32B Think is a large-scale, 32-billion-parameter model purpose-built for deep reaso...
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)
Nano Banana Pro is Googleβs most advanced image-generation and editing model, built on G...
xAI: Grok 4.1 Fast
Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases...
OpenAI: GPT-5.1
GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-...
OpenAI: GPT-5.1 Chat
GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 family, optimized for...
MoonshotAI: Kimi K2 Thinking
Kimi K2 Thinking is Moonshot AIβs most advanced open reasoning model to date, extending ...
Amazon: Nova Premier 1.0
Amazon Nova Premier is the most capable of Amazonβs multimodal models for complex reason...
Perplexity: Sonar Pro Search
Exclusively available on the OpenRouter API, Sonar Pro's new Pro Search mode is Perplexity...
OpenAI: gpt-oss-safeguard-20b
gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This...
NVIDIA: Nemotron Nano 12B 2 VL (free)
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model design...
NVIDIA: Nemotron Nano 12B 2 VL
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model design...
MiniMax: MiniMax M2
MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end cod...
Qwen: Qwen3 VL 32B Instruct
Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-...
Qwen: Qwen3 VL 8B Thinking
Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal mode...
Qwen: Qwen3 VL 8B Instruct
Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built...
OpenAI: GPT-5 Image
[GPT-5](https://openrouter.ai/openai/gpt-5) Image combines OpenAI's GPT-5 model with state...
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model...
Baidu: ERNIE 4.5 21B A3B Thinking
ERNIE-4.5-21B-A3B-Thinking is Baidu's upgraded lightweight MoE model, refined to boost rea...
Qwen: Qwen3 VL 30B A3B Thinking
Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with v...
OpenAI: GPT-5 Pro
GPT-5 Pro is OpenAIβs most advanced model, offering major improvements in reasoning, cod...
Google: Gemini 2.5 Flash Lite Preview 09-2025
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized...
Qwen: Qwen3 VL 235B A22B Thinking
Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with...
Qwen: Qwen3 Max
Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in ...
xAI: Grok 4 Fast
Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token cont...
Qwen: Qwen3 Next 80B A3B Thinking
Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that ou...
Qwen: Qwen3 Next 80B A3B Instruct (free)
Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series op...
Qwen: Qwen3 Next 80B A3B Instruct
Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series op...
Qwen: Qwen Plus 0728 (thinking)
Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoni...
Qwen: Qwen Plus 0728
Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoni...
NVIDIA: Nemotron Nano 9B V2 (free)
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA,...
NVIDIA: Nemotron Nano 9B V2
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA,...
Qwen: Qwen3 30B A3B Thinking 2507
Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimize...
xAI: Grok Code Fast 1
Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding....
Nous: Hermes 4 70B
Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. ...
Nous: Hermes 4 405B
Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nou...
DeepSeek: DeepSeek V3.1
DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that support...
OpenAI: GPT-5
GPT-5 is OpenAIβs most advanced model, offering major improvements in reasoning, code qu...
OpenAI: GPT-5 Mini
GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning task...
OpenAI: GPT-5 Nano
GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for develope...
OpenAI: gpt-oss-120b (free)
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model fro...
OpenAI: gpt-oss-120b
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model fro...
Anthropic: Claude Opus 4.1
Claude Opus 4.1 is an updated version of Anthropicβs flagship model, offering improved p...
Qwen: Qwen3 235B A22B Thinking 2507
Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) ...
Qwen: Qwen3 Coder 480B A35B (free)
Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model develop...
Qwen: Qwen3 Coder 480B A35B
Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model develop...
Google: Gemini 2.5 Flash Lite
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized...
Mistral: Devstral Medium
Devstral Medium is a high-performance code generation and agentic reasoning model develope...
xAI: Grok 4
Grok 4 is xAI's latest reasoning model with a 256k context window. It supports parallel to...
Tencent: Hunyuan A13B Instruct
Hunyuan-A13B is a 13B active parameter Mixture-of-Experts (MoE) language model developed b...
MiniMax: MiniMax M1
MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and...
Google: Gemini 2.5 Flash
Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for a...
Google: Gemini 2.5 Pro
Gemini 2.5 Pro is Googleβs state-of-the-art AI model designed for advanced reasoning, co...
OpenAI: o3 Pro
The o-series of models are trained with reinforcement learning to think before they answer...
xAI: Grok 3 Mini
A lightweight model that thinks before responding. Fast, smart, and great for logic-based ...
Google: Gemini 2.5 Pro Preview 06-05
Gemini 2.5 Pro is Googleβs state-of-the-art AI model designed for advanced reasoning, co...
DeepSeek: R1 0528
May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par wi...
Anthropic: Claude Sonnet 4
Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, ex...
Mistral: Mistral Medium 3
Mistral Medium 3 is a high-performance enterprise-grade language model designed to deliver...
Google: Gemini 2.5 Pro Preview 05-06
Gemini 2.5 Pro is Googleβs state-of-the-art AI model designed for advanced reasoning, co...
Arcee AI: Maestro Reasoning
Maestro Reasoning is Arcee's flagship analysis model: a 32 Bβparameter derivative of Qwe...
Arcee AI: Virtuoso Large
VirtuosoβLarge is Arcee's topβtier generalβpurpose LLM at 72 B parameters, tuned to ...
Qwen: Qwen3 30B A3B
Qwen3, the latest generation in the Qwen large language model series, features both dense ...
Qwen: Qwen3 8B
Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed f...
Qwen: Qwen3 14B
Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed...
Qwen: Qwen3 32B
Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimize...
Qwen: Qwen3 235B A22B
Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, acti...
OpenAI: o4 Mini High
OpenAI o4-mini-high is the same model as [o4-mini](/openai/o4-mini) with reasoning_effort ...
OpenAI: o3
o3 is a well-rounded and powerful model across domains. It sets a new standard for math, s...
OpenAI: o4 Mini
OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-effi...
Qwen: Qwen2.5 Coder 7B Instruct
Qwen2.5-Coder-7B-Instruct is a 7B parameter instruction-tuned language model optimized for...
OpenAI: GPT-4.1
GPT-4.1 is a flagship large language model optimized for advanced instruction following, r...
xAI: Grok 3 Mini Beta
Grok 3 Mini is a lightweight, smaller thinking model. Unlike traditional models that gener...
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1
Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced re...
Qwen: Qwen2.5 VL 32B Instruct
Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement lear...
OpenAI: o1-pro
The o1 series of models are trained with reinforcement learning to think before they answe...
Mistral: Mistral Small 3.1 24B
Mistral Small 3.1 24B Instruct is an upgraded variant of Mistral Small 3 (2501), featuring...
AllenAI: Olmo 2 32B Instruct
OLMo-2 32B Instruct is a supervised instruction-finetuned variant of the OLMo-2 32B March ...
Google: Gemma 3 4B (free)
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...
Google: Gemma 3 4B
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...
Google: Gemma 3 12B (free)
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...
Google: Gemma 3 12B
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...
Google: Gemma 3 27B (free)
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...
Google: Gemma 3 27B
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It ha...
Perplexity: Sonar Reasoning Pro
Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://doc...
Perplexity: Sonar Pro
Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://doc...
Perplexity: Sonar Deep Research
Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthes...
Qwen: QwQ 32B
QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tune...
Anthropic: Claude 3.7 Sonnet
Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and...
Anthropic: Claude 3.7 Sonnet (thinking)
Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and...
OpenAI: o3 Mini High
OpenAI o3-mini-high is the same model as [o3-mini](/openai/o3-mini) with reasoning_effort ...
AionLabs: Aion-1.0
Aion-1.0 is a multi-model system designed for high performance across various tasks, inclu...
AionLabs: Aion-1.0-Mini
Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designe...
OpenAI: o3 Mini
OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, part...
DeepSeek: R1
DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and...
Microsoft: Phi 4
[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning ta...
OpenAI: o1
The latest and strongest model family from OpenAI, o1 is designed to spend more time think...
Cohere: Command R7B (12-2024)
Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in Decemb...
Mistral Large 2407
This is Mistral AI's flagship model, Mistral Large 2 (version mistral-large-2407). It's a ...
Qwen2.5 Coder 32B Instruct
Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly k...
Meta: Llama 3.2 3B Instruct (free)
Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for adv...
Meta: Llama 3.2 3B Instruct
Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for adv...
Cohere: Command R (08-2024)
command-r-08-2024 is an update of the [Command R](/models/cohere/command-r) with improved ...
Nous: Hermes 3 70B Instruct
Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nou...
Nous: Hermes 3 405B Instruct (free)
Hermes 3 is a generalist language model with many improvements over Hermes 2, including ad...
Nous: Hermes 3 405B Instruct
Hermes 3 is a generalist language model with many improvements over Hermes 2, including ad...
Sao10K: Llama 3 8B Lunaris
Lunaris 8B is a versatile generalist and roleplaying model based on Llama 3. It's a strate...
Mistral Large
This is Mistral AI's flagship model, Mistral Large 2 (version `mistral-large-2407`). It's ...
OpenAI: GPT-4
OpenAI's flagship model, GPT-4 is a large-scale multimodal language model capable of solvi...