Google DeepMind released DiffusionGemma, a new open-source AI model that generates text in parallel rather than sequentially, achieving roughly 4x faster output speeds than comparable autoregressive models on local hardware. The 26-billion-parameter model uses a diffusion-based approach similar to image generation, making it efficient for tasks like in-line editing and mathematical problems, though it carries tradeoffs including higher error rates on short outputs.
1 comment
Google DeepMind released DiffusionGemma, a new open-source AI model that generates text in parallel rather than sequentially, achieving roughly 4x faster output speeds than comparable autoregressive models on local hardware. The 26-billion-parameter model uses a diffusion-based approach similar to image generation, making it efficient for tasks like in-line editing and mathematical problems, though it carries tradeoffs including higher error rates on short outputs.