Gemini 3.1 Flash-Lite
Gemini 3.1 Flash-Lite is the first Flash-Lite model in the Gemini 3 series. It is optimized for high-volume, latency-sensitive tasks like translation, content moderation, and classification. It delive
Gemini 3.1 Flash-Lite is the first Flash-Lite model in the Gemini 3 series. It is optimized for high-volume, latency-sensitive tasks like translation, content moderation, and classification. It delivers enhanced performance at a fraction of the cost of larger models, with 2.5x faster Time to First Answer Token and 45% increased output speed compared to 2.5 Flash. Supports text, image, video, audio, and PDF input with a 1 million-token context window.
Intelligence
Features
Pros
Cons
Use cases
API inference · Fine-tuning · Benchmarking
AI models used
Gemini 3.1 Flash-Lite
FAQ
How much does Gemini 3.1 Flash-Lite cost?
Free tier
Does Gemini 3.1 Flash-Lite have a free plan?
Limited or no free tier
Is there an API?
Yes