Pricing & Benchmarks for
300+ AI Models
The professional kit for developers and enterprises to compare AI model costs, context windows, and performance benchmarks.
Start ComparingFeatured Models
View All →NVIDIA: Nemotron 3.5 Content Safety (free)
NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model fr...
NVIDIA: Nemotron 3 Ultra (free)
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA,...
NVIDIA: Nemotron 3 Ultra
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA,...
Qwen: Qwen3.7 Plus
Qwen3.7-Plus is a cost-effective model in Alibaba's Qwen3.7 series. It supports text and i...
MiniMax: MiniMax M3
MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and vid...
StepFun: Step 3.7 Flash
Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It...
Anthropic: Claude Opus 4.8 (Fast)
Fast-mode variant of [Opus 4.8](/anthropic/claude-opus-4.8) - identical capabilities with ...
Anthropic: Claude Opus 4.8
Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. ...
Qwen: Qwen3.7 Max
Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and ...
xAI: Grok Build 0.1
Grok Build 0.1 is xAI’s fast coding model trained specifically for agentic software engi...
Google: Gemini 3.5 Flash
Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level cod...
Anthropic: Claude Opus 4.7 (Fast)
Fast-mode variant of [Opus 4.7](/anthropic/claude-opus-4.7) - identical capabilities with ...
Browse by Provider
Frequently Asked Questions
On this page:
What is "Writer: Palmyra X5" optimized for?
Palmyra X5 is Writer's most advanced model, purpose-built for building and scaling AI agents across the enterprise. It delivers industry-leading speed and efficiency on context windows up to 1 million tokens.
What are the input and output modalities of "OpenAI: GPT Audio"?
"OpenAI: GPT Audio" can process both text and audio as input, and can generate both text and audio as output.
What is the context length of "MiniMax: MiniMax M2-her"?
"MiniMax: MiniMax M2-her" has a context length of 32,768 tokens, making it suitable for multi-turn conversations.
Is there a free model for on-device AI?
Yes, "LiquidAI: LFM2.5-1.2B-Instruct (free)" is a compact, high-performance instruction-tuned model built for fast on-device AI and is free to use.