Gemma 3 1B

The Gemma 3 1B model is a lightweight, 1-billion-parameter language model by Google, optimized for efficiency on resource-limited devices. At 529MB, it processes text at 2,585 tokens/second with a con

The Gemma 3 1B model is a lightweight, 1-billion-parameter language model by Google, optimized for efficiency on resource-limited devices. At 529MB, it processes text at 2,585 tokens/second with a context window of 128,000 tokens. It supports 35+ languages but handles text-only input, unlike larger multimodal Gemma models. This balance of speed and efficiency makes it ideal for fast text processing on mobile and low-power devices.

LLMs
Free tier

Intelligence

Popularity49/100
Monthly visits
Growth
Updated2026-05-21

Features

GPQA
MMLU
MMLU-Pro
AIME 2025
MATH
HumanEval

Pros

    Cons

      Use cases

      API inference · Fine-tuning · Benchmarking

      AI models used

      Gemma 3 1B

      FAQ

      How much does Gemma 3 1B cost?

      Free tier

      Does Gemma 3 1B have a free plan?

      Limited or no free tier

      Is there an API?

      Yes