Google launches Gemma 4 AI models with threefold speed increase and open-source access

Section editor: Andre Teow, Editor, A47 News·Moderate4 articles covering this·4 news sources·Updated 2 months ago·World

Google Gemma 4 AI models showcasing speed and open-source features

Here's what it means for you.

This launch signifies a pivotal shift in AI development, enhancing speed and accessibility for developers.

What happened

Google released the Gemma 4 AI models with a threefold speed increase using Multi-Token Prediction technology.

The Context

Open-source access: Gemma 4 is released under the Apache 2.0 license, promoting open-source development.
Enhanced performance: The models utilize Multi-Token Prediction for faster inference, significantly enhancing performance.
Active reasoning: Gemma 4 represents a shift in AI capabilities, moving from passive response to active reasoning and planning.

Takeaway

The democratization of AI through Gemma 4 could reshape the landscape of artificial intelligence development.

4 Articles

DEV Community

Gemma 4 — The Open-Source Beast Google Just Unleashed

On April 2, 2026, Google DeepMind unveiled Gemma 4, a significant advancement in AI technology, releasing it under the Apache 2.0 license, marking a shift towards a more open-source approach in the industry. This release allows developers unprecedent...

3 months ago

Read Full Article

Techmeme

Google releases Multi-Token Prediction drafters for its Gemma 4 models, which use a form of speculative decoding to guess future tokens for faster inference (Ryan Whitwam/Ars Technica)

Google has launched Multi-Token Prediction drafters for its Gemma 4 models, utilizing speculative decoding techniques to enhance inference speed by up to three times. This advancement was announced in conjunction with the broader rollout of the Gemma...

3 months ago

Read Full Article

THE DECODER

Google speeds up Gemma 4 threefold with multi-token prediction

Google has introduced multi-token prediction drafters for its Gemma 4 open model family, enhancing text generation speed by up to three times. This innovation allows a smaller auxiliary model to suggest multiple tokens simultaneously, while the main ...

3 months ago

Read Full Article

Ars Technica — All

Google's Gemma 4 AI models get 3x speed boost by predicting future tokens

Google has announced a significant enhancement to its Gemma 4 AI models, achieving a threefold increase in text generation speed through a new multi-token prediction feature. This advancement allows the model to predict multiple tokens simultaneously...

3 months ago

Read Full Article

Ars Technica

Google's Gemma 4 AI models get 3x speed boost by predicting future tokens

Google's latest Gemma 4 AI models have achieved a remarkable threefold speed increase by predicting future tokens, enhancing performance without compromising quality. This advancement marks a significant step in AI technology, showcasing Google's com...

3 months ago

Read Full Article