Pricing & Benchmarks for
300+ AI Models

The professional kit for developers and enterprises to compare AI model costs, context windows, and performance benchmarks.

Start Comparing

Featured Models

View All →

nvidia

Free/1M

NVIDIA: Nemotron 3.5 Content Safety (free)

NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model fr...

📝 128,000 ctx Compare →

nvidia

Free/1M

NVIDIA: Nemotron 3 Ultra (free)

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA,...

📝 1,000,000 ctx Compare →

nvidia

$0.50/1M

NVIDIA: Nemotron 3 Ultra

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA,...

📝 1,000,000 ctx Compare →

qwen

$0.40/1M

Qwen: Qwen3.7 Plus

Qwen3.7-Plus is a cost-effective model in Alibaba's Qwen3.7 series. It supports text and i...

📝 1,000,000 ctx Compare →

minimax

$0.30/1M

MiniMax: MiniMax M3

MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and vid...

📝 1,048,576 ctx Compare →

stepfun

$0.20/1M

StepFun: Step 3.7 Flash

Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It...

📝 256,000 ctx Compare →

anthropic

$10.00/1M

Anthropic: Claude Opus 4.8 (Fast)

Fast-mode variant of [Opus 4.8](/anthropic/claude-opus-4.8) - identical capabilities with ...

📝 1,000,000 ctx Compare →

anthropic

$5.00/1M

Anthropic: Claude Opus 4.8

Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. ...

📝 1,000,000 ctx Compare →

qwen

$1.25/1M

Qwen: Qwen3.7 Max

Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and ...

📝 1,000,000 ctx Compare →

x-ai

$1.00/1M

xAI: Grok Build 0.1

Grok Build 0.1 is xAI’s fast coding model trained specifically for agentic software engi...

📝 256,000 ctx Compare →

google

$1.50/1M

Google: Gemini 3.5 Flash

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level cod...

📝 1,048,576 ctx Compare →

anthropic

$30.00/1M

Anthropic: Claude Opus 4.7 (Fast)

Fast-mode variant of [Opus 4.7](/anthropic/claude-opus-4.7) - identical capabilities with ...

📝 1,000,000 ctx Compare →

Browse by Provider

📦 Ai21 1 Models 📦 Aion-labs 4 Models 📦 Allenai 1 Models 📦 Amazon 5 Models 📦 Anthracite-org 1 Models 🛡️ Anthropic 15 Models 📦 Arcee-ai 6 Models 📦 Baidu 2 Models 📦 Bytedance 1 Models 📦 Bytedance-seed 4 Models 📦 Cognitivecomputations 1 Models 🏢 Cohere 4 Models 📦 Deepcogito 1 Models 📦 Deepseek 12 Models 📦 Essentialai 1 Models 🔍 Google 26 Models 📦 Gryphe 1 Models 📦 Ibm-granite 2 Models 📦 Inception 1 Models 📦 Inclusionai 3 Models 📦 Inflection 2 Models 📦 Kwaipilot 1 Models 📦 Liquid 3 Models 📦 Mancer 1 Models 📦 Meta-llama 14 Models 📦 Microsoft 3 Models 📦 Minimax 8 Models 📦 Mistralai 19 Models 📦 Moonshotai 6 Models 📦 Morph 2 Models 📦 Nex-agi 1 Models 📦 Nousresearch 5 Models 📦 Nvidia 12 Models ⚡ Openai 63 Models 📦 Openrouter 6 Models 📦 Perceptron 1 Models 📦 Perplexity 5 Models 📦 Poolside 2 Models 📦 Prime-intellect 1 Models 📦 Qwen 49 Models 📦 Rekaai 2 Models 📦 Relace 2 Models 📦 Sao10k 4 Models 📦 Stepfun 2 Models 📦 Switchpoint 1 Models 📦 Tencent 2 Models 📦 Thedrummer 4 Models 📦 Undi95 1 Models 📦 Upstage 1 Models 📦 Writer 1 Models 📦 X-ai 4 Models 📦 Xiaomi 3 Models 📦 Z-ai 13 Models 📦 ~anthropic 3 Models 📦 ~google 2 Models 📦 ~moonshotai 1 Models 📦 ~openai 2 Models

Frequently Asked Questions

What is "Writer: Palmyra X5" optimized for?

Palmyra X5 is Writer's most advanced model, purpose-built for building and scaling AI agents across the enterprise. It delivers industry-leading speed and efficiency on context windows up to 1 million tokens.

What are the input and output modalities of "OpenAI: GPT Audio"?

"OpenAI: GPT Audio" can process both text and audio as input, and can generate both text and audio as output.

What is the context length of "MiniMax: MiniMax M2-her"?

"MiniMax: MiniMax M2-her" has a context length of 32,768 tokens, making it suitable for multi-turn conversations.

Is there a free model for on-device AI?

Yes, "LiquidAI: LFM2.5-1.2B-Instruct (free)" is a compact, high-performance instruction-tuned model built for fast on-device AI and is free to use.

Browse by Use Case

💻 Coding 41 Models 🧠 Reasoning 129 Models 👁️ Vision 61 Models 📚 Long Context 302 Models

Browse by Use Case

💻 Coding 41 Models 🧠 Reasoning 129 Models 👁️ Vision 61 Models 📚 Long Context 302 Models

Pricing & Benchmarks for300+ AI Models

Featured Models

NVIDIA: Nemotron 3.5 Content Safety (free)

NVIDIA: Nemotron 3 Ultra (free)

NVIDIA: Nemotron 3 Ultra

Qwen: Qwen3.7 Plus

MiniMax: MiniMax M3

StepFun: Step 3.7 Flash

Anthropic: Claude Opus 4.8 (Fast)

Anthropic: Claude Opus 4.8

Qwen: Qwen3.7 Max

xAI: Grok Build 0.1

Google: Gemini 3.5 Flash

Anthropic: Claude Opus 4.7 (Fast)

Browse by Provider

Frequently Asked Questions

On this page:

What is "Writer: Palmyra X5" optimized for?

What are the input and output modalities of "OpenAI: GPT Audio"?

What is the context length of "MiniMax: MiniMax M2-her"?

Is there a free model for on-device AI?

Browse by Use Case

Browse by Use Case

Pricing & Benchmarks for
300+ AI Models