DeepSeek-V4-Flash-Max
DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid a
DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid attention architecture (Compressed Sparse Attention combined with Heavily Compressed Attention), Manifold-Constrained Hyper-Connections, and Muon optimizer, V4-Flash-Max delivers reasoning performance comparable to V4-Pro when given a larger thinking budget while operating at a fraction of the parame…
Intelligence
Features
Pros
Cons
Use cases
API inference · Fine-tuning · Benchmarking
AI models used
DeepSeek-V4-Flash-Max
FAQ
How much does DeepSeek-V4-Flash-Max cost?
Free tier
Does DeepSeek-V4-Flash-Max have a free plan?
Limited or no free tier
Is there an API?
Yes