meituan
Meituan: LongCat Flash Chat
LongCat-Flash-Chat is a large-scale Mixture-of-Experts (MoE) model with 560B total parameters, of which 18.6B–31.3B (≈27B on average) are dynamically activated per input. It introduces a shortcut-connected MoE design to reduce...
Input Cost
$0.20
per 1M tokens
Output Cost
$0.80
per 1M tokens
Context Window
131,072
tokens
Developer ID: meituan/longcat-flash-chat
Related Models
google
$0.13/1M
Google: Gemma 4 26B A4B
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google Deep...
google
$0.14/1M
Google: Gemma 4 31B
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and...
qwen
Free/1M
Qwen: Qwen3.6 Plus (free)
Qwen 3.6 Plus builds on a hybrid architecture that combines efficient linear attention wit...