Browse latest
Tools & PlatformsAI - Ars Technica · June 10, 2026

Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster

Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster — AI - Ars Technica

Google DeepMind introduces DiffusionGemma, an experimental AI model that generates text in parallel, accelerating local processing by up to four times compared to traditional autoregressive models. This innovation shifts the computational bottleneck from memory bandwidth to compute, offering significant speed advantages for non-linear tasks.

Author: Morein.ai Editorial

Google DeepMind has unveiled DiffusionGemma, a new AI model within the Gemma 4 family that significantly accelerates text generation on local hardware. Unlike traditional autoregressive models that generate text sequentially, DiffusionGemma processes entire blocks of text simultaneously, leading to faster and more efficient local AI operations. This innovative approach can boost performance by up to four times compared to existing Gemma models.

DiffusionGemma

Read original source

Related articles