Gemma 4 E2B

Gemma 4 E2B is Google DeepMind's smallest multimodal model with 2.3 billion effective parameters (5.1B with embeddings) and a 128K context window. Supports image, text, and audio inputs. Designed for

Gemma 4 E2B is Google DeepMind's smallest multimodal model with 2.3 billion effective parameters (5.1B with embeddings) and a 128K context window. Supports image, text, and audio inputs. Designed for on-device and edge deployment with Per-Layer Embeddings for efficient inference.

LLMs
Free tier

Intelligence

Popularity49/100
Monthly visits
Growth
Updated2026-05-21

Features

GPQA
MMLU
MMLU-Pro
AIME 2025
MATH
HumanEval

Pros

    Cons

      Use cases

      API inference · Fine-tuning · Benchmarking

      AI models used

      Gemma 4 E2B

      FAQ

      How much does Gemma 4 E2B cost?

      Free tier

      Does Gemma 4 E2B have a free plan?

      Limited or no free tier

      Is there an API?

      Yes