qwen
Qwen: Qwen3 VL 32B Instruct
Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...
Input Cost
$0.10
per 1M tokens
Output Cost
$0.42
per 1M tokens
Context Window
131,072
tokens
Developer ID: qwen/qwen3-vl-32b-instruct
Related Models
qwen
$0.26/1M
Qwen: Qwen3 VL 235B A22B Thinking
Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with...
qwen
$0.07/1M
Qwen: Qwen3.5-Flash
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that in...
qwen
$0.07/1M
Qwen: Qwen3 235B A22B Instruct 2507
Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts lang...