Google launches Gemma 4 open-source LLM family

Published April 2, 2026

Google launched Gemma 4, an open model built for agentic AI workflows in four sizes.

The move comes as the US lags in open large language models relative to China, which counts DeepSeek and Qwen as just two entrants. Nvidia has pushed its Nemotron models to develop the open source AI ecosystem and Google's Gemma model has been downloaded more than 400 million times. Simply put, there's demand for open source LLMs.

For enterprises, the open source model space is worth watching since these models can be tailored to business use cases. For instance, Google said Gemma has more than 100,000 variants.

Gemma 4 is licensed under Apache 2.0 and includes technology from Gemini 3. Gemma 4 will come in four sizes:

Effective 2B (E2B).
Effective 4B (E4B).
26B Mixture of Experts (MoE).
31B Dense.

Google said its largest Gemma 4 model, 31B Dense, would be ranked No. 3 on the Arena AI text leaderboard. In the open source category on Arena AI, the top spots are dominated by Chinese open source models.

According to Google, Gemma 4's 26B MoE and 31B Dense models provide more intelligence per parameter and outcompete much larger models to achieve "frontier-level capabilities with significantly less hardware overhead."

Indeed, Google said the 26B and 31B models are designed for offline usage including consumer GPUs. State of the art features can run on a single 80GB Nvidia H100 GPU. The E2B and E4B models are designed for mobile, IoT and edge devices including Raspberry Pi and Jetson Nano.

Here's a look at what you need to know about Gemma 4:

Gemma 4 models are designed to run on everything from Android devices to laptop GPUs to workstations and accelerators.
Google cited customizations of Gemma 4 that include a Bulgarian-first language model and Yale University's Cell2Sentence-Scale model for cancer research.
The models support agentic workflows and tools such as function-calling and native system instructions.
Code generation for offline code, native processing for video and images and native audio input.
Longer context windows and training on more than 140 languages.
Gemma 4 models are optimized for Nvidia GPUs, AMD GPUs and Google Cloud TPUs.

Gemma 4 is available in Google AI Studio (31B and 26B) and Google AI Edge Gallery (E4B and E2B), multiple tools including Hugging Face, Nvidia NIM and NeMo, Ollama, Docker and others.

Larry Dignan

Editor in Chief of Constellation Insights
Constellation Research

Larry Dignan is Editor in Chief of Constellation Insights at Constellation Research, where he leads editorial coverage focused on enterprise technology, digital transformation, and emerging trends shaping the future of business. He oversees research-driven news, analysis, interviews, and event coverage designed to help technology buyers and vendors navigate complex markets with clarity and context. ...

Insight News

June 04, 2026

Quantinuum’s IPO prices at $60: What you need to know

Data to Decisions

Quantinuum's IPO is set for lift off at $60 a share in what could be the lead public market debut in quantum computing. ...

Larry Dignan

Insight News

June 04, 2026

C3.ai's Siebel plots turnaround: Can the company execute?

Data to Decisions

C3.ai's fiscal 2026 revenue has fallen to 2022 levels, but the company is plotting a comeback. It remains to be seen whether it can deliver. ...

Larry Dignan

Insight News

June 03, 2026

At DevCon, Workday courts developers with optionality

Data to Decisions

Workday at its DevCon conference this week had a simple message: There are multiple options to build AI agents on Workday's platform. ...

Larry Dignan

Insight News

June 03, 2026

Broadcom sees Q3 revenue growth of 84%

Tech Optimization

Broadcom delivered a strong second quarter and said revenue growth will accelerate in the third quarter. ...

Larry Dignan

Insight News

June 03, 2026

CrowdStrike delivers strong Q1, ups outlook

Digital Safety, Privacy & Cybersecurity

CrowdStrike reported better-than-expected first quarter earnings, announced a 4-for-1 stock split and raised its outlook. ...

Larry Dignan

Insight News

June 03, 2026

SpaceX's IPO: Do you believe in space data centers, xAI's potential?

Innovation & Product-led Growth

SpaceX's financial results are driven by SpaceX and Starlink with the AI unit, including X and xAI being a drag. ...

Larry Dignan

Google launches Gemma 4 open-source LLM family

Quantinuum’s IPO prices at $60: What you need to know

C3.ai's Siebel plots turnaround: Can the company execute?

At DevCon, Workday courts developers with optionality

Broadcom sees Q3 revenue growth of 84%

CrowdStrike delivers strong Q1, ups outlook

SpaceX's IPO: Do you believe in space data centers, xAI's potential?

Published

Author

Research

Analyst Services

Videos

Communities

Events

Insights Live

Google launches Gemma 4 open-source LLM family

Results

Quantinuum’s IPO prices at $60: What you need to know

C3.ai's Siebel plots turnaround: Can the company execute?

At DevCon, Workday courts developers with optionality

Broadcom sees Q3 revenue growth of 84%

CrowdStrike delivers strong Q1, ups outlook

SpaceX's IPO: Do you believe in space data centers, xAI's potential?

Published

Author

Business Themes

Vendors

Audience Role

Hot Topics

Related Blog Posts

Why enterprise AI leaders need to bank on open-source LLMs