Denoising diffusion models represent a recent emerging topic in computer vision, demonstrating remarkable results in the area of generative modeling. A diffusion model is a deep generative model that is based on two stages, a forward diffusion stage and a reverse diffusion stage. In the forward diffusion stage, the input data is gradually perturbed over several steps by adding Gaussian noise. In the reverse stage, a model is tasked at recovering the original input data by learning to gradually reverse the diffusion process, step by step. Diffusion models are widely appreciated for the quality and diversity of the generated samples, despite their known computational burdens, i.e. low speeds due to the high number of steps involved during sampling. This repository categorizes the papers about diffusion models, applied in computer vision, according to their target task. The classifcation is based on our survey Diffusion Models in Vision: A Survey, which was accepted for publication in IEEE TPAMI.
- Unconditional Generation
- Conditional Generation
- Text-to-Image generation
- Super-Resolution
- Image Editing
- Region Image Editing
- Inpainting
- Image-to-Image Translation
- Image Segmentation
- Multi-Task
- Medical Image-to-Image Translation
- Medical Image Generation
- Medical Image Segmentation
- Medical Image Anomaly Detection
- Video Generation
- Few-Shot Image Generation
- Counterfactual Explanations and Estimations
- Image Restoration
- Image Registration
- Adversarial Purification
- Semantic Image Generation
- Shape Generation and Completion
- Classification
- Point Cloud Generation
- Theoretical
- Graphs
- Deblurring
- Face Morphing Attack Detection
- Trajectory prediction
- Attacks
- Study on data memorization
- Deep unsupervised learning using non-equilibrium thermodynamics
- Denoising diffusion probabilistic models
- Improved techniques for training score-based generative models
- Adversarial score matching and improved sampling for image generation
- Maximum likelihood training of score-based diffusion models
- D2C: Diffusion-Decoding Models for Few-Shot Conditional Generation
- Diffusion Normalizing Flow
- Diffusion Schrodinger bridge with applications to score-based generative modeling
- Structured denoising diffusion models in discrete state-spaces
- Score-based generative modeling in latent space
- Improved denoising diffusion probabilistic models
- Denoising Diffusion Implicit Models
- Non-Gaussian denoising diffusion models
- Bilateral denoising diffusion models
- Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes
- Noise estimation for generative diffusion models
- Gotta go fast when generating data with score-based models
- Learning to efficiently sample from diffusion probabilistic models
- Deep generative learning via Schrodinger bridge
- VAEs meet Diffusion Models: Efficient and High-Fidelity Generation
- Variational diffusion models
- Score-based generative modeling with critically-damped Langevin diffusion
- Tackling the generative learning trilemma with Denoising Diffusion GANs
- Heavy-tailed denoising score matching
- Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models
- Learning Fast Samplers for Diffusion Models by Differentiating Through Sample Quality
- Truncated Diffusion Probabilistic Models
- Subspace Diffusion Generative Models
- Maximum Likelihood Training of Implicit Nonlinear Diffusion Models
- On Analyzing Generative and Denoising Capabilities of Diffusion-based Deep Generative Models
- Diffusion-GAN: Training GANs with Diffusion
- Accelerating Score-based Generative Models for High-Resolution Image Synthesis
- Soft Diffusion: Score Matching for General Corruptions
- Post-Training Quantization on Diffusion Models
- Lookahead Diffusion Probabilistic Models for Refining Mean Estimation
- Wavelet Diffusion Models are fast and scalable Image Generators
- All are Worth Words: A ViT Backbone for Diffusion Models
- Diffusion Probabilistic Model Made Slim
- Diffusion models beat gans on image synthesis
- Classifier-Free Diffusion Guidance
- On Fast Sampling of Diffusion Probabilistic Models
- DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents
- Pseudo Numerical Methods for Diffusion Models on Manifolds
- Cascaded Diffusion Models for High Fidelity Image Generation
- High Fidelity Visualization of What Your Self-Supervised Representation Knows About
- Itô-Taylor Sampling Scheme for Denoising Diffusion Probabilistic Models using Ideal Derivatives
- {Dynamic Dual-Output Diffusion Models
- Generating High Fidelity Data from Low-density Regions using Diffusion Models
- Perception Prioritized Training of Diffusion Models
- Elucidating the Design Space of Diffusion-Based Generative Models
- Progressive distillation for fast sampling of diffusion models
- Denoising Likelihood Score Matching for Conditional Score-based Data Generation
- On Conditioning the Input Noise for Controlled Image Generation with Diffusion Models
- A Continuous Time Framework for Discrete Denoising Models
- DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps
- Compositional Visual Generation with Composable Diffusion Models
- TryOnDiffusion: A Tale of Two UNets
- High-Fidelity Guided Image Synthesis with Latent Diffusion Models
- Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models
- Towards Practical Plug-and-Play Diffusion Models
- Inversion-based Style Transfer with Diffusion Models
- Conditional Text Image Generation with Diffusion Models
- Generative Diffusion Prior for Unified Image Restoration and Enhancement
- DCFace: Synthetic Face Generation With Dual Condition Diffusion Model
- Controllable Light Diffusion for Portraits
- LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation
- Self-Guided Diffusion Models
- Vector quantized diffusion model for text-to-image synthesis
- Hierarchical text-conditional image generation with CLIP latents
- Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
- Fast Sampling of Diffusion Models with Exponential Integrator
- DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
- Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models
- Text2Human: Text-Driven Controllable Human Image Generation
- DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
- SpaText: Spatio-Textual Representation for Controllable Image Generation
- MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation
- Person Image Synthesis via Denoising Diffusion Model
- Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models
- Multi-Concept Customization of Text-to-Image Diffusion
- ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts
- Shifted Diffusion for Text-to-image Generation
- Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models To Learn Any Unseen Style
- Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models
- Image super-resolution via iterative refinement
- Score-based Generative Neural Networks for Large-Scale Optimal Transport
- Implicit Diffusion Models for Continuous Super-Resolution
- SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations
- Blended Latent Diffusion
- [SINE: SINgle Image Editing with Text-to-Image Diffusion Models (https://openaccess.thecvf.com/content/CVPR2023/papers/Zhang_SINE_SINgle_Image_Editing_With_Text-to-Image_Diffusion_Models_CVPR_2023_paper.pdf)
- Imagic: Text-Based Real Image Editing with Diffusion Models
- Collaborative Diffusion for Multi-Modal Face Generation and Editing
- Null-text Inversion for Editing Real Images using Guided Diffusion Models
- DiffusionRig: Learning Personalized Priors for Facial Appearance Editing
- [RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation] (https://openaccess.thecvf.com/content/CVPR2023/papers/Anciukevicius_RenderDiffusion_Image_Diffusion_for_3D_Reconstruction_Inpainting_and_Generation_CVPR_2023_paper.pdf)
- Paint by Example: Exemplar-based Image Editing with Diffusion Models
- GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
- RePaint: Inpainting using Denoising Diffusion Probabilistic Models
- [RGBD2: Generative Scene Synthesis via Incremental View Inpainting using RGBD Diffusion Models] (https://openaccess.thecvf.com/content/CVPR2023/papers/Lei_RGBD2_Generative_Scene_Synthesis_via_Incremental_View_Inpainting_Using_RGBD_CVPR_2023_paper.pdf) 4.SmartBrush: Text and Shape Guided Object Inpainting With Diffusion Model
- Palette: Image-to-Image Diffusion Models
- UNIT-DDPM: UNpaired Image Translation with Denoising Diffusion Probabilistic Models
- EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations
- Pretraining is All You Need for Image-to-Image Translation
- VQBB: Image-to-image Translation with Vector Quantized Brownian Bridge
- The Swiss Army Knife for Image-to-Image Translation: Multi-Task Diffusion Models
- Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance
- BBDM: Image-to-Image Translation with Brownian Bridge Diffusion Models
- Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation
- Label-Efficient Semantic Segmentation with Diffusion Models
- SegDiff: Image Segmentation with Diffusion Probabilistic Models
- Multi-Class Segmentation from Aerial Views using Recursive Noise Diffusion
- Ambiguous Medical Image Segmentation using Diffusion Models
- Generative modeling by estimating gradients of the data distribution
- Score-Based Generative Modeling through Stochastic Differential Equations
- ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
- Learning Energy-Based Models by Diffusion Recovery Likelihood
- Conditional image generation with score-based diffusion models
- More control for free! Image synthesis with semantic diffusion guidance
- ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models
- Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation
- High-Resolution Image Synthesis with Latent Diffusion Models
- Diffusion Autoencoders: Toward a Meaningful and Decodable Representation
- Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction
- DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation
- Understanding DDPM Latent Codes Through Optimal Transport
- Conditional Simulation Using Diffusion Schrödinger Bridges
- Retrieval-Augmented Diffusion Models
- Accelerating Diffusion Models via Early Stop of the Diffusion Process
- Diffusion Models as Plug-and-Play Priors
- Non-Uniform Diffusion Models
- Diffusion Probabilistic Model Made Slim
- Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models
- On Distillation of Guided Diffusion Model
- DiffCollage: Parallel Generation of Large Content With Diffusion Models
- Unsupervised Medical Image Translation with Adversarial Diffusion Models
- Unsupervised Denoising of Retinal OCT with Diffusion Probabilistic Model
- Conversion Between CT and MRI Images Using Diffusion and Score-Matching Models
- Solving inverse problems in medical imaging with score-based generative models
- Score-based diffusion models for accelerated MRI
- Diffusion Models For Medical Image Analysis: A Comprehensive Survey
- Low-Dose CT Using Denoising Diffusion Probabilistic Model for 20× Speedup
- Solving 3D Inverse Problems using Pre-trained 2D Diffusion Models
- Diffusion Models for Implicit Image Segmentation Ensembles
- Accelerating Diffusion Models via Pre-segmentation Diffusion Sampling for Medical Image Segmentation
- Diffusion Models for Medical Anomaly Detection
- Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models
- AnoDDPM: Anomaly Detection With Denoising Diffusion Probabilistic Models Using Simplex Noise
- What is Healthy? Generative Counterfactual Diffusion for Lesion Localization
- Video Diffusion Models
- Diffusion Probabilistic Modeling for Video Generation
- Flexible Diffusion Modeling of Long Videos
- Diffusion Models for Video Prediction and Infilling
- Dreamix: Video Diffusion Models are General Video Editors
- Conditional Image-to-Video Generation with Latent Flow Diffusion Models
- Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding
- DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation
- MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
- VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation
- Video Probabilistic Diffusion Models in Projected Latent Space
- Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
- Diffusion Models for Counterfactual Explanations
- Diffusion Causal Models for Counterfactual Estimation
- Restoring Vision in Adverse Weather Conditions with Patch-Based Denoising Diffusion Models
- Denoising Diffusion Restoration Models
- Diffusion in the Dark: A Diffusion Model for Low-Light Text Recognition
- High-resolution image reconstruction with latent diffusion models from human brain activity
- Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision Decoding
- 3D shape generation and completion through point-voxel diffusion
- RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion
- Avatars Grow Legs: Generating Smooth Human Motion from Sparse Tracking Inputs with Diffusion Model
- NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models
- Diffusion-SDF: Text-to-Shape via Voxelized Diffusion
- Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation
- DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model
- HOLODIFFUSION: Training a 3D Diffusion Model using 2D Images
- Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models
- Consistent View Synthesis with Pose-Guided Diffusion Models
- Score-based generative classifiers
- Diffusion-based Data Augmentation for Skin Disease Classification: Impact Across Original Medical Datasets to Fully Synthetic Images
- A variational perspective on diffusion-based generative models and score matching
- Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptions
1.Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models