DeepSeek-V3.1
DeepSeek-V3.1 is a hybrid model supporting both thinking and non-thinking modes through different chat templates. Built on DeepSeek-V3.1-Base with a two-phase long context extension (32K phase: 630B t
DeepSeek-V3.1 is a hybrid model supporting both thinking and non-thinking modes through different chat templates. Built on DeepSeek-V3.1-Base with a two-phase long context extension (32K phase: 630B tokens, 128K phase: 209B tokens), it features 671B total parameters with 37B activated. Key improvements include smarter tool calling through post-training optimization, higher thinking efficiency achieving comparable quality to DeepSeek-R1-0528 while responding more quickly, and UE8M0 FP8 scale data…
Intelligence
Features
Pros
Cons
Use cases
API inference · Fine-tuning · Benchmarking
AI models used
DeepSeek-V3.1
FAQ
How much does DeepSeek-V3.1 cost?
Free tier
Does DeepSeek-V3.1 have a free plan?
Limited or no free tier
Is there an API?
Yes