Inbar is a postdoctoral researcher, working with Prof. Tomer Michaeli. She completed her PhD in Computer Science at the Hebrew University in 2022, under the supervision of Prof. Raanan Fattal. She received her M.Sc. in Computer Science from the Hebrew University in 2014. Her research interests include image generation, object recognition, image manipulations, and perception.
Denoising diffusion probabilistic models (DDPMs) employ a sequence of white Gaussian noise samples to generate an image. In analogy with GANs, those noise maps could be considered as the latent code associated with the generated image. However, this native noise space does not possess a convenient structure and is thus challenging to work with in editing tasks.
Here, we propose an alternative latent noise space for DDPM that enables a wide range of editing operations via simple means. We present an inversion method for extracting these edit-friendly noise maps for any given image (real or synthetically generated). As opposed to the native DDPM noise space, the edit-friendly noise maps do not have a standard normal distribution and are not statistically independent across timesteps. However, they allow perfect reconstruction of any desired image, and simple transformations on them translate into meaningful manipulations of the output image (e.g. shifting, color edits).
Moreover, in text-conditional models, fixing those noise maps while changing the text prompt modifies semantics while retaining structure. We illustrate how this property enables text-based editing of real images via the diverse DDPM sampling scheme (in contrast to the popular non-diverse DDIM inversion). We also show how it can be used within existing diffusion-based editing methods to improve their quality and diversity.
Please see the project page: https://github.com/inbarhub/DDPM_inversion
8:45 | Reception |
---|---|
9:30 | Opening remarks by WiDS TLV ambassadors Noah Eyal Altman, Or Basson, and Nitzan Gado |
9:45 | Dr. Aya Soffer, IBM: "Putting Generative AI to Work: What Have We Learned So Far?" |
10:15 | Prof. Reut Tsarfaty, Bar-llan University: "Will Hebrew Speakers Be Able to Use Generative AI in Their Native Tongue?" |
10:45 | Poster Pitches |
10:55 | Break |
11:10 | Lightning talks |
12:30 | Lunch & poster session |
---|---|
13:30 | Roundtable session & poster session |
14:15 | Roundtable closing |
14:30 | Break |
14:40 | Naomi Ken Korem, Lightricks: "Mastering the Art of Generative Models: Training and Controlling Text-to-Video Models" |
15:00 | Dr. Yael Mathov, Intuit: "Surviving the AI-pocalypse: Your Guide to LLM Security" |
15:20 | Closing remarks |
15:30 | The end |
WiDS Tel Aviv is an independent event that is organized by Intuit’s WiDS TLV ambassadors as part of the annual WiDS Worldwide conference, the WiDS Datathon, and an estimated 200 WiDS Regional Events worldwide. Everyone is invited to attend all WiDS conference and WiDS Datathon Workshop events which feature outstanding women doing outstanding work.
© 2018-2024 WiDS TLV – Intuit. All rights reserved.
Scotty – By Nir Azoulay
Design: Sharon Geva