Diffusion models quantization benchmark
Diffusion models became de facto standard in image generation. While remarkably powerful, they rely on a multi-step process of iterative denoising, which takes time. Text generation is also processed in a sequential manner, however the text can be streamed and visualized token by token, giving a natural, real-time feeling, similar to how humans read or write a text. With images, we must wait for the final denoising step to complete before a usable result appears. This makes optimizing diffusion models for speed and efficiency a critical challenge.










