Text Diffusion

Text Diffusion refers to a class of generative-ai models that produce text by iteratively refining outputs through a Diffusion-Model, analogous to image generation. Unlike traditional large-language-model that generate text sequentially token-by-token, diffusion-based text generation starts with noise and progressively denoises it into coherent text across multiple steps. This represents an alternative architecture to the dominant autoregressive methods in nlp.

Core Mechanics

The process operates by gradually removing noise from an initially random or corrupted text representation until a coherent output emerges. Key characteristics include:

Gemini Diffusion

Google has provided access to gemini-diffusion, an experimental model implementing this approach. It represents Google’s exploration into diffusion-based architectures for language tasks, offering researchers a platform to investigate the viability and performance characteristics of non-autoregressive text generation at scale.