deepseek

DeepSeek: R1 Distill Llama 70B

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

Input Cost
$0.70
per 1M tokens
Output Cost
$0.80
per 1M tokens
Context Window
131,072
tokens
Compare vs GPT-4o
Developer ID: deepseek/deepseek-r1-distill-llama-70b

Related Models

deepseek
$0.40/1M

DeepSeek: DeepSeek V3.2 Speciale

DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum re...

📝 163,840 ctx Compare →
deepseek
$0.20/1M

DeepSeek: DeepSeek V3 0324

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the fl...

📝 163,840 ctx Compare →
deepseek
$0.21/1M

DeepSeek: DeepSeek V3.1 Terminus

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that ...

📝 163,840 ctx Compare →