Even_Adder@lemmy.dbzer0.com to

Stable Diffusion@lemmy.dbzer0.comEnglish · 10 months ago

PALP: Prompt Aligned Personalization of Text-to-Image Models

prompt-aligned.github.io

2

4

PALP: Prompt Aligned Personalization of Text-to-Image Models

prompt-aligned.github.io

Even_Adder@lemmy.dbzer0.com to

Stable Diffusion@lemmy.dbzer0.comEnglish · 10 months ago

2

TL;DR

Prompt aligned personalization allow rich and complex scene generation, including all elements of a condition prompt (right).

Abstract

Content creators often aim to create personalized images using personal subjects that go beyond the capabilities of conventional text-to-image models. Additionally, they may want the resulting image to encompass a specific location, style, ambiance, and more. Existing personalization methods may compromise personalization ability or the alignment to complex textual prompts. This trade-off can impede the fulfillment of user prompts and subject fidelity. We propose a new approach focusing on personalization methods for a single prompt to address this issue. We term our approach prompt-aligned personalization. While this may seem restrictive, our method excels in improving text alignment, enabling the creation of images with complex and intricate prompts, which may pose a challenge for current techniques. In particular, our method keeps the personalized model aligned with a target prompt using an additional score distillation sampling term. We demonstrate the versatility of our method in multi- and single-shot settings and further show that it can compose multiple subjects or use inspiration from reference images, such as artworks. We compare our approach quantitatively and qualitatively with existing baselines and state-of-the-art techniques.

Paper: https://prompt-aligned.github.io/

Project Page: https://prompt-aligned.github.io/

You must log in or # to comment.

Chat

Scew@lemmy.world
link
fedilink
English
arrow-up
2·
10 months ago
That’s pretty cool
Lemmy Tagginator@utter.onlineB
link
fedilink
arrow-up
2
arrow-down
1·
10 months ago
New Lemmy Post: PALP: Prompt Aligned Personalization of Text-to-Image Models (https://lemmy.dbzer0.com/post/12196023)
Tagging: #StableDiffusion

(Replying in the OP of this thread (NOT THIS BOT!) will appear as a comment in the lemmy discussion.)

I am a FOSS bot. Check my README: https://github.com/db0/lemmy-tagginator/blob/main/README.md

Stable Diffusion@lemmy.dbzer0.com

stable_diffusion@lemmy.dbzer0.com

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !stable_diffusion@lemmy.dbzer0.com

Discuss matters related to our favourite AI Art generation technology

Also see

Other communities

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

3 users / day
23 users / week
133 users / month
839 users / 6 months
171 local subscribers
4.31K subscribers
840 Posts
1.8K Comments
Modlog

mods:
db0@lemmy.dbzer0.com