Gemma 4 E2B
Gemma 4 E2B is Google DeepMind's smallest multimodal model with 2.3 billion effective parameters (5.1B with embeddings) and a 128K context window. Supports image, text, and audio inputs. Designed for
Gemma 4 E2B is Google DeepMind's smallest multimodal model with 2.3 billion effective parameters (5.1B with embeddings) and a 128K context window. Supports image, text, and audio inputs. Designed for on-device and edge deployment with Per-Layer Embeddings for efficient inference.
LLMs
Free tier
Intelligence
Popularity49/100
Monthly visits—
Growth—
Updated2026-05-21
Features
GPQA
MMLU
MMLU-Pro
AIME 2025
MATH
HumanEval
Pros
Cons
Use cases
API inference · Fine-tuning · Benchmarking
AI models used
Gemma 4 E2B
FAQ
How much does Gemma 4 E2B cost?
Free tier
Does Gemma 4 E2B have a free plan?
Limited or no free tier
Is there an API?
Yes