meituan

Meituan: LongCat Flash Chat

LongCat-Flash-Chat is a large-scale Mixture-of-Experts (MoE) model with 560B total parameters, of which 18.6B–31.3B (≈27B on average) are dynamically activated per input. It introduces a shortcut-connected MoE design to reduce...

Input Cost
$0.20
per 1M tokens
Output Cost
$0.80
per 1M tokens
Context Window
131,072
tokens
Compare vs GPT-4o
Developer ID: meituan/longcat-flash-chat

Related Models

google
$0.13/1M

Google: Gemma 4 26B A4B

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google Deep...

📝 262,144 ctx Compare →
google
$0.14/1M

Google: Gemma 4 31B

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and...

📝 262,144 ctx Compare →
qwen
Free/1M

Qwen: Qwen3.6 Plus (free)

Qwen 3.6 Plus builds on a hybrid architecture that combines efficient linear attention wit...

📝 1,000,000 ctx Compare →