deepseek
DeepSeek: R1 Distill Llama 70B
DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...
Input Cost
$0.70
per 1M tokens
Output Cost
$0.80
per 1M tokens
Context Window
131,072
tokens
Developer ID: deepseek/deepseek-r1-distill-llama-70b
Related Models
deepseek
$0.21/1M
DeepSeek: DeepSeek V3.1
DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that support...
deepseek
$0.27/1M
DeepSeek: DeepSeek V3.2 Exp
DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an inter...
deepseek
$0.44/1M
DeepSeek: DeepSeek V4 Pro
DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total pa...