Does Gemini 3.1 Flash-Lite offer an API?

See API details on the tool page.

Gemini 3.1 Flash-Lite

Gemini 3.1 Flash-Lite is the first Flash-Lite model in the Gemini 3 series. It is optimized for high-volume, latency-sensitive tasks like translation, content moderation, and classification. It delivers enhanced performance at a fraction of the cost of larger models, with 2.5x faster Time to First Answer Token and 45% increased output speed compared to 2.5 Flash. Supports text, image, video, audio, and PDF input with a 1 million-token context window.

LLMs

Free tier

Intelligence

Popularity49/100

Monthly visits—

Growth—

Updated2026-05-21

Features

GPQA

MMLU

MMLU-Pro

AIME 2025

MATH

HumanEval

Pros

Cons

Use cases

API inference · Fine-tuning · Benchmarking

AI models used

Gemini 3.1 Flash-Lite

FAQ

How much does Gemini 3.1 Flash-Lite cost?

Free tier

Does Gemini 3.1 Flash-Lite have a free plan?

Limited or no free tier

Is there an API?

Yes