Gemma 3n E4B Instructed

Gemma 3n is a multimodal model designed to run locally on hardware, supporting image, text, audio, and video inputs. It features a language decoder, audio encoder, and vision encoder, and is available

Gemma 3n is a multimodal model designed to run locally on hardware, supporting image, text, audio, and video inputs. It features a language decoder, audio encoder, and vision encoder, and is available in two sizes: E2B and E4B. The model is optimized for memory efficiency, allowing it to run on devices with limited GPU RAM. Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models. Gemma models are wel

LLMs
Free tier

Intelligence

Popularity49/100
Monthly visits
Growth
Updated2026-05-21

Features

GPQA
MMLU
MMLU-Pro
AIME 2025
MATH
HumanEval

Pros

    Cons

      Use cases

      API inference · Fine-tuning · Benchmarking

      AI models used

      Gemma 3n E4B Instructed

      FAQ

      How much does Gemma 3n E4B Instructed cost?

      Free tier

      Does Gemma 3n E4B Instructed have a free plan?

      Limited or no free tier

      Is there an API?

      Yes