qwen

Qwen: Qwen3 235B A22B

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and code tasks, and a "non-thinking" mode for general conversational efficiency. The model demonstrates strong reasoning ability, multilingual support (100+ languages and dialects), advanced instruction-following, and agent tool-calling capabilities. It natively handles a 32K token context window and extends up to 131K tokens using YaRN-based scaling.

Input Cost
$0.20
per 1M tokens
Output Cost
$0.60
per 1M tokens
Context Window
40,960
tokens
Compare vs GPT-4o
Developer ID: qwen/qwen3-235b-a22b

Related Models

qwen
$0.05/1M

Qwen: Qwen3 30B A3B Thinking 2507

Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimize...

📝 32,768 ctx Compare →
qwen
$0.08/1M

Qwen: Qwen3 30B A3B Instruct 2507

Qwen3-30B-A3B-Instruct-2507 is a 30.5B-parameter mixture-of-experts language model from Qw...

📝 262,144 ctx Compare →
qwen
$0.22/1M

Qwen: Qwen3 Coder 480B A35B (exacto)

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model develop...

📝 262,144 ctx Compare →