this post was submitted on 07 Dec 2023
9 points (100.0% liked)

Stable Diffusion

79 readers
1 users here now

Discuss matters related to our favourite AI Art generation technology

Also see

Other communities

founded 1 year ago
MODERATORS
 

#Abstract

We present LooseControl to allow generalized depth conditioning for diffusion-based image generation. ControlNet, the SOTA for depth-conditioned image generation, produces remarkable results but relies on having access to detailed depth maps for guidance. Creating such exact depth maps, in many scenarios, is challenging. This paper introduces a generalized version of depth conditioning that enables many new content-creation workflows. Specifically, we allow (C1) scene boundary control for loosely specifying scenes with only boundary conditions, and (C2) 3D box control for specifying layout locations of the target objects rather than the exact shape and appearance of the objects. Using LooseControl, along with text guidance, users can create complex environments (e.g., rooms, street views, etc.) by specifying only scene boundaries and locations of primary objects. Further, we provide two editing mechanisms to refine the results: (E1) 3D box editing enables the user to refine images by changing, adding, or removing boxes while freezing the style of the image. This yields minimal changes apart from changes induced by the edited boxes. (E2) Attribute editing proposes possible editing directions to change one particular aspect of the scene, such as the overall object density or a particular object. Extensive tests and comparisons with baselines demonstrate the generality of our method. We believe that LooseControl can become an important design tool for easily creating complex environments and be extended to other forms of guidance channels.

Paper: https://arxiv.org/abs/2312.03079

Code: https://github.com/shariqfarooq123/LooseControl

Demo: https://huggingface.co/spaces/shariqfarooq/LooseControl

Project Page: https://shariqfarooq123.github.io/loose-control/

top 3 comments
sorted by: hot top controversial new old
[–] tagginator@utter.online 2 points 11 months ago

New Lemmy Post: LooseControl: Lifting ControlNet for Generalized Depth Conditioning (https://lemmy.dbzer0.com/post/9886773)
Tagging: #StableDiffusion

(Replying in the OP of this thread (NOT THIS BOT!) will appear as a comment in the lemmy discussion.)

I am a FOSS bot. Check my README: https://github.com/db0/lemmy-tagginator/blob/main/README.md

[–] ShrimpsIsBugs@programming.dev 2 points 11 months ago (1 children)

I kinda like the boxed corals

[–] Even_Adder@lemmy.dbzer0.com 1 points 11 months ago

They scare me.