DeepSeek-V4-Flash-Max logo

DeepSeek-V4-Flash-Max

DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid a

DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid attention architecture (Compressed Sparse Attention combined with Heavily Compressed Attention), Manifold-Constrained Hyper-Connections, and Muon optimizer, V4-Flash-Max delivers reasoning performance comparable to V4-Pro when given a larger thinking budget while operating at a fraction of the parame

LLMs
Free tier

Intelligence

Popularity49/100
Monthly visits
Growth
Updated2026-05-21

Features

GPQA
MMLU
MMLU-Pro
AIME 2025
MATH
HumanEval

Pros

    Cons

      Use cases

      API inference · Fine-tuning · Benchmarking

      AI models used

      DeepSeek-V4-Flash-Max

      FAQ

      How much does DeepSeek-V4-Flash-Max cost?

      Free tier

      Does DeepSeek-V4-Flash-Max have a free plan?

      Limited or no free tier

      Is there an API?

      Yes