Authors: Omri Avrahami, Dani Lischinski, Ohad Fred

Keywords: Diffusion Model, Generative Model, CLIP, Multi-modality, Vision-Language, Text-driven Image Manipulation

Contributions

Preliminaries

CLIP (Contrastive Language Image Pre-training)

Screen Shot 2022-02-26 at 7.54.42 PM.png

Screen Shot 2022-02-26 at 8.07.18 PM.png

Screen Shot 2022-02-26 at 8.23.07 PM.png

Method

Target Requirements