Does DeepSeek-V4-Flash-Max offer an API?

See API details on the tool page.

DeepSeek-V4-Flash-Max

DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid attention architecture (Compressed Sparse Attention combined with Heavily Compressed Attention), Manifold-Constrained Hyper-Connections, and Muon optimizer, V4-Flash-Max delivers reasoning performance comparable to V4-Pro when given a larger thinking budget while operating at a fraction of the parame…

LLMs

Free tier

Intelligence

Popularity49/100

Monthly visits—

Growth—

Updated2026-05-21

Features

GPQA

MMLU

MMLU-Pro

AIME 2025

MATH

HumanEval

Pros

Cons

Use cases

API inference · Fine-tuning · Benchmarking

AI models used

DeepSeek-V4-Flash-Max

FAQ

How much does DeepSeek-V4-Flash-Max cost?

Free tier

Does DeepSeek-V4-Flash-Max have a free plan?

Limited or no free tier

Is there an API?

Yes