Glossary entry

Diffusion Model

A generative model that creates images (or video, audio) by iteratively denoising random noise into a coherent output.

A diffusion model generates images by starting with random noise and iteratively denoising over many steps until a coherent image emerges. It is the architecture behind Stable Diffusion, Midjourney, Flux, DALL-E, and most modern image generators.

Diffusion is conceptually different from the autoregressive token-by-token approach used by LLMs, but the same underlying transformer architecture is often used inside the denoising step. Video diffusion (Sora, Runway, Kling) extends the same idea to time-coherent frame sequences.

Tools that use Diffusion Model

Related terms

Written by

John Ethan

Founder & Editor-in-Chief

Founder of MytheAi. Tracking and reviewing AI and SaaS tools since January 2026. Built MytheAi out of frustration with pay-to-rank listicles and SEO-driven AI directories that prioritize ad revenue over honest guidance. Hands-on testing across 500+ tools to date.