Google hosts 14 active AI models, with input pricing from $0.25 per 1M tokens, averaging 88 tok/s output throughput, with up to 1.0M context window. Compare Google's API pricing, latency, and feature support against other LLM providers.
14 models hosted
~88 tok/s output
Input $0.25/1M
Output $0.25/1M
Model catalog
Models available via Google's API.
- Veo 3.1—
- Veo 3.1 Fast—
- Veo 3.0—
- Veo 3.0 Fast—
- Veo 2.0—
- Gemini 3.1 Flash Image—
- Gemini 2.5 Flash Image (Nano Banana)—
- Gemini 3.5 Flash—
- Gemini 3.1 Flash-Lite32K
- Gemini 3 Pro Image—
- Gemini 3 Flash32K
- Gemini 3.1 Pro—
- Gemini 2.5 Pro1M
- Gemini 2.5 Flash1M
- Gemini 3 Pro32K
- GPT-5.1—
- Grok-4 Heavy—
- DeepSeek-R1-0528—
- GLM-4.6200K
- GPT OSS 120B—