Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models

This repository is the PyTorch implementation of the paper:

Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models (CVPR 2024)

Shweta Mahajan, Tanzila Rahman, Kwang Moo Yi, Leonid Sigal

Requirements

The following code is based on the Stable-diffusion-repository.

Navigating the Prompt Inversion

The PH2P code modifies the following modules:

ddpm.py with the LBFGS optimizer and saving the prompts after each iteration during optimization.
Patch the Clip_transformer to localclip_transformer. LocalCustomTokenEmbedding with the projection algorithm.
Specify the model path in main_textual_inversion.py.
embedding_matrix.pt contains the embeddings for the CLIP vocabulary (vocab.json)
Download the model checkpoint (stable diffusion v1.4 or v1.5) and save in models/ldm/stable-diffusion-v1/model.ckpt

For running the prompt inversion specify image path in inversion_config.json

   	python main_textual_inversion.py

The prompts will be saved in ./logs_forward_pass/.

The best prompt for a given image is obtained from the maximum clip similarity between the target image and the generated image for a prompt. This additionally requires transformers 4.25.1 and diffusers 0.12.1

   	python get_best_text.py

Bibtex

@inproceedings{ph2p2024cvpr,
  title     = {Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models},
  author    = {Shweta Mahajan, Tanzila Rahman, Kwang Moo Yi, Leonid Sigal},
  booktitle = {CVPR 2024 (To appear)},
  year = {2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
assets		assets
configs		configs
data		data
ldm		ldm
logs		logs
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
get_best_text.py		get_best_text.py
main_textual_inversion.py		main_textual_inversion.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models

Requirements

Navigating the Prompt Inversion

Bibtex

About

Releases

Packages

Languages

License

ubc-vision/Prompting-Hard-Hardly-Prompting

Folders and files

Latest commit

History

Repository files navigation

Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models

Requirements

Navigating the Prompt Inversion

Bibtex

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages