Diffusion models beat gans on image synthesis

•Télécharger en tant que PPTX, PDF•

0 j'aime•764 vues

BeerenSahu

Technologie

Likelihood based models
Generative models whose objective function is log-likelihood.
Examples: Autoregressive models, VAE, Diffusion Models, etc
GANs use sample quality metrics such as FID, Inception Score and Precision
some of these metrics do not fully capture diversity.
Negative Log Likelihood does otherwise

GAN vs Likelihood-base models
● GANs capture less diversity than state-of-the-art likelihood-based models
○ Ali Razavi, Aaron van den Oord, and Oriol Vinyals. Generating diverse high-fidelity images
with VQ-VAE-2. arXiv:1906.00446, 2019.
○ Alex Nichol and Prafulla Dhariwal. Improved denoising diffusion probabilistic models.
arXiv:2102.09672, 2021.
○ Charlie Nash, Jacob Menick, Sander Dieleman, and Peter W. Battaglia. Generating images
with sparse representations. arXiv:2103.03841, 2021.
● GANs are often difficult to train, collapsing without carefully selected
hyperparameters and regularizers
● Thus, make them difficult to scale and apply to new domains.

GAN vs Likelihood-base models
● Pros:
○ these models capture more diversity
○ typically easier to scale and train than GANs
● Cons:
○ they still fall short in terms of visual sample quality
○ except for VAEs, sampling from these models is slower than GANs

Diffusion Model
Diffusion models are a class of likelihood-based models which have recently been
shown to produce high-quality images while offering desirable properties such as
distribution coverage, a stationary training objective, and easy scalability.
This paper:
● achieve an FID of 2.97 on ImageNet 128×128 (May 2020)
● Beats previous record holder BigGAN-deep with FID of 5.7 (2018)
“We hypothesize that the gap between diffusion models and GANs stems from at
least two factors: first, that the model architectures used by recent GAN literature
have been heavily explored and refined; second, that GANs are able to trade off
diversity for quality, producing high quality samples but not covering the whole
distribution.”

Denoising Diffusion Probabilistic Models
Probabilistic Models: learns some probability Distribution (PD)
Diffusion: gradually diffusing/adding noise to input image (Forward process)
Denoising: removing the noise to synthesis realistic image (Backward process)

Intuition
Forward: q(xt|xt-1)
Backward: pθ(xt-1|xt )

Evolution of Diffusion Models
Feb 2021
Improved Denoising Diffusion
Probabilistic Models
Alex Nichol, Prafulla Dhariwal
OpenAI
May 2021
Diffusion Models Beat GANs on
Image Synthesis
Prafulla Dhariwal, Alex Nichol
OpenAI
2015
Deep Unsupervised Learning
using Nonequilibrium
Thermodynamics
Jascha Sohl-Dickstein, Eric Weiss,
Niru Maheswaranathan, and Surya
Ganguli
Stanford and UC Berkeley
2019
Denoising Diffusion
Probabilistic Models
Jonathan Ho, Ajay Jain, Pieter
Abbeel
UC Berkeley

Resources
Repo: https://github.com/openai/guided-diffusion
Video: https://www.youtube.com/watch?v=W-O7AZNzbzQ
VAE: https://towardsdatascience.com/understanding-variational-autoencoders-vaes-
f70510919f73

Contenu connexe

Tendances

Generative Adversarial NetworksMustafa Yagmur

Stable Diffusion pathVitaly Bondar

Generative Adversarial Networks (GAN)Manohar Mukku

Latent diffusions vs DALL-E v2Vitaly Bondar

Basic Generative Adversarial NetworksDong Heon Cho

LDM_ImageSythesis.pptxAkankshaRawat53

Generative Adversarial Networks and Their Applications in Medical ImagingSanghoon Hong

Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)Universitat Politècnica de Catalunya

Deep Generative Models Chia-Wen Cheng

Finding connections among images using CycleGANNAVER Engineering

Generative Adversarial NetworksMark Chang

Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018Universitat Politècnica de Catalunya

Generative adversarial networksYunjey Choi

GANs and ApplicationsHoang Nguyen

Generative Adversarial Networks and Their Medical Imaging ApplicationsKyuhwan Jung

Explicit Density ModelsSangwoo Mo

1시간만에 GAN(Generative Adversarial Network) 완전 정복하기NAVER Engineering

Deep Learning for Computer Vision: Generative models and adversarial training...Universitat Politècnica de Catalunya

Domain Transfer and Adaptation SurveySangwoo Mo

Self-supervised Learning Lecture NoteSangwoo Mo

Tendances (20)

Generative Adversarial Networks

Stable Diffusion path

Generative Adversarial Networks (GAN)

Latent diffusions vs DALL-E v2

Basic Generative Adversarial Networks

LDM_ImageSythesis.pptx

Generative Adversarial Networks and Their Applications in Medical Imaging

Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)

Deep Generative Models

Finding connections among images using CycleGAN

Generative Adversarial Networks

Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018

Generative adversarial networks

GANs and Applications

Generative Adversarial Networks and Their Medical Imaging Applications

Explicit Density Models

1시간만에 GAN(Generative Adversarial Network) 완전 정복하기

Deep Learning for Computer Vision: Generative models and adversarial training...

Domain Transfer and Adaptation Survey

Self-supervised Learning Lecture Note

Similaire à Diffusion models beat gans on image synthesis

PROGRESSIVE GROWING OF GAN S FOR I MPROVED QUALITY , STABILITY , AND VARIATIONWilly Marroquin (WillyDevNET)

An Extensive Review on Generative Adversarial Networks GAN’sijtsrd

A pixel to-pixel segmentation method of DILD without masks using CNN and perl...남주 김

Deep Generative ModellingPetko Nikolov

Neural Architectures for Still Images - Xavier Giro- UPC Barcelona 2019Universitat Politècnica de Catalunya

Introduction to Generative Models.pptxJOBANPREETSINGH62

Introduction to Generative Adversarial Networks (GANs)Appsilon Data Science

OOD_PPT.pptxYashBhatnagar30

Image generative modeling for design inspiration and image editing by Camille...Paris Women in Machine Learning and Data Science

ADVANCED SINGLE IMAGE RESOLUTION UPSURGING USING A GENERATIVE ADVERSARIAL NET...sipij

Deep Learning of High-Level RepresentationsHamid Eghbal-zadeh

Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...Universitat Politècnica de Catalunya

Pixel Recurrent Neural Networksneouyghur

169 s170G H Raisoni College of Engineering Nagpur

Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...Catalina Arango

Deep Semi-supervised Learning methodsPrincy Joy

The Success of Deep Generative Modelsinside-BigData.com

Decomposing image generation into layout priction and conditional synthesisNaeem Shehzad

Variants of GANs - Jaejun YooJaeJun Yoo

DESIGN AND EVALUATION OF IMAGE DENOISING USING GENERATIVE ADVERSARIAL NETWORK...JoshuaAlexMbaya

Similaire à Diffusion models beat gans on image synthesis (20)

PROGRESSIVE GROWING OF GAN S FOR I MPROVED QUALITY , STABILITY , AND VARIATION

An Extensive Review on Generative Adversarial Networks GAN’s

A pixel to-pixel segmentation method of DILD without masks using CNN and perl...

Deep Generative Modelling

Neural Architectures for Still Images - Xavier Giro- UPC Barcelona 2019

Introduction to Generative Models.pptx

Introduction to Generative Adversarial Networks (GANs)

OOD_PPT.pptx

Image generative modeling for design inspiration and image editing by Camille...

ADVANCED SINGLE IMAGE RESOLUTION UPSURGING USING A GENERATIVE ADVERSARIAL NET...

Deep Learning of High-Level Representations

Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...

Pixel Recurrent Neural Networks

169 s170

Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...

Deep Semi-supervised Learning methods

The Success of Deep Generative Models

Decomposing image generation into layout priction and conditional synthesis

Variants of GANs - Jaejun Yoo

DESIGN AND EVALUATION OF IMAGE DENOISING USING GENERATIVE ADVERSARIAL NETWORK...

Dernier

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Developing An App To Navigate The Roads of BrazilV3cube

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Scaling API-first – The story of a global engineering organizationRadu Cotescu

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

Artificial Intelligence: Facts and MythsJoaquim Jorge

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

Tech Trends Report 2024 Future Today Institute.pdfhans926745

What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics

Dernier (20)

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Developing An App To Navigate The Roads of Brazil

Boost PC performance: How more available memory can improve productivity

Scaling API-first – The story of a global engineering organization

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

Advantages of Hiring UIUX Design Service Providers for Your Business

Driving Behavioral Change for Information Management through Data-Driven Gree...

Artificial Intelligence: Facts and Myths

Finology Group – Insurtech Innovation Award 2024

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

Strategies for Landing an Oracle DBA Job as a Fresher

Tech Trends Report 2024 Future Today Institute.pdf

What Are The Drone Anti-jamming Systems Technology?

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

2024: Domino Containers - The Next Step. News from the Domino Container commu...

How to Troubleshoot Apps for the Modern Connected Worker

Tata AIG General Insurance Company - Insurer Innovation Award 2024

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

AWS Community Day CPH - Three problems of Terraform

HTML Injection Attacks: Impact and Mitigation Strategies

Diffusion models beat gans on image synthesis

1. Diffusion Models Beat GANs on Image Synthesis OpenAI Presented by: Beeren Sahu

4. Likelihood based models Generative models whose objective function is log-likelihood. Examples: Autoregressive models, VAE, Diffusion Models, etc GANs use sample quality metrics such as FID, Inception Score and Precision some of these metrics do not fully capture diversity. Negative Log Likelihood does otherwise

5. GAN vs Likelihood-base models ● GANs capture less diversity than state-of-the-art likelihood-based models ○ Ali Razavi, Aaron van den Oord, and Oriol Vinyals. Generating diverse high-fidelity images with VQ-VAE-2. arXiv:1906.00446, 2019. ○ Alex Nichol and Prafulla Dhariwal. Improved denoising diffusion probabilistic models. arXiv:2102.09672, 2021. ○ Charlie Nash, Jacob Menick, Sander Dieleman, and Peter W. Battaglia. Generating images with sparse representations. arXiv:2103.03841, 2021. ● GANs are often difficult to train, collapsing without carefully selected hyperparameters and regularizers ● Thus, make them difficult to scale and apply to new domains.

6. GAN vs Likelihood-base models ● Pros: ○ these models capture more diversity ○ typically easier to scale and train than GANs ● Cons: ○ they still fall short in terms of visual sample quality ○ except for VAEs, sampling from these models is slower than GANs

7. Diffusion Model Diffusion models are a class of likelihood-based models which have recently been shown to produce high-quality images while offering desirable properties such as distribution coverage, a stationary training objective, and easy scalability. This paper: ● achieve an FID of 2.97 on ImageNet 128×128 (May 2020) ● Beats previous record holder BigGAN-deep with FID of 5.7 (2018) “We hypothesize that the gap between diffusion models and GANs stems from at least two factors: first, that the model architectures used by recent GAN literature have been heavily explored and refined; second, that GANs are able to trade off diversity for quality, producing high quality samples but not covering the whole distribution.”

8. PapersWithCode LeaderBoard

9. Denoising Diffusion Probabilistic Models Probabilistic Models: learns some probability Distribution (PD) Diffusion: gradually diffusing/adding noise to input image (Forward process) Denoising: removing the noise to synthesis realistic image (Backward process)

10. Intuition Forward: q(xt|xt-1) Backward: pθ(xt-1|xt )

11. Evolution of Diffusion Models Feb 2021 Improved Denoising Diffusion Probabilistic Models Alex Nichol, Prafulla Dhariwal OpenAI May 2021 Diffusion Models Beat GANs on Image Synthesis Prafulla Dhariwal, Alex Nichol OpenAI 2015 Deep Unsupervised Learning using Nonequilibrium Thermodynamics Jascha Sohl-Dickstein, Eric Weiss, Niru Maheswaranathan, and Surya Ganguli Stanford and UC Berkeley 2019 Denoising Diffusion Probabilistic Models Jonathan Ho, Ajay Jain, Pieter Abbeel UC Berkeley

12. Forward Process

13. Backward Process

14. Loss

15. Loss (simplified)

16. Summary

17. Resources Repo: https://github.com/openai/guided-diffusion Video: https://www.youtube.com/watch?v=W-O7AZNzbzQ VAE: https://towardsdatascience.com/understanding-variational-autoencoders-vaes- f70510919f73