Read the original document by opening this link in a new tab.
Table of Contents
Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion
ABSTRACT
Index Terms
1 I NTRODUCTION
2 R ELATED WORKS
3 D ESIGN GOALS
4 S YSTEM DESIGN AND IMPLEMENTATION
- Overview
- Architecture View
- Refinement Comparison View
5 U SAGE SCENARIOS
- Discovering Prompts' Impact on Image Generation
- Discerning Challenges in Attributing AI Generations
Summary
Diffusion Explainer is an interactive visualization tool that explains how Stable Diffusion transforms text prompts into high-resolution images. It provides a visual overview of Stable Diffusion's architecture and operations, enabling users to understand the complex image generation process. The tool allows users to compare the impact of different text prompts on image generation, revealing how keywords affect the evolution of image representations. Diffusion Explainer is designed to bridge multiple levels of abstraction through animations and interactive elements, making it accessible to both practitioners and non-experts. By running locally in users' web browsers, it broadens access to modern generative AI techniques.