@HeyMoon's comment on 'DreamBooth Marseys :marseyastronaut: (added colab instructions)'

DreamBooth Marseys :marseyastronaut:

Colab:

Open automatic1111's colab: https://colab.research.google.com/drive/1Iy-xW9t1-OQWhb0hNxueGij8phCyluOh
Replace this cell:

#@title Normal 1.4 model
# get a token from https://huggingface.co/settings/tokens
user_token = "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" #@param {type:"string"}
user_header = f"\"Authorization: Bearer {user_token}\""
!wget --header={user_header} https://huggingface.co/CompVis/stable-diffusion-v-1-4-original/resolve/main/sd-v1-4.ckpt -O model.ckpt

With:

!apt install megatools
!megadl "https://mega.co.nz/#!FxclSKoL!EFSM4nLlXMuOvBLkoZNmtOH4Y8oycjrU7h2Hn6mKl1k"

Run (it'll take a few minutes to download)
Refer to Marsey as a sks cat in your prompt. a photo of a sks cat as a gardener etc

Download the weights: https://models.rdra.ma/

Development on Stable Diffusion has been happening at a wild pace, and there's already written a working implementation of DreamBooth for it. Whereas Textual Inversion refines your prompt, DreamBooth is able to finetune the model itself to teach it new concepts. I did one run with 87 Marseys and another with just 3 upscaled ones. So far the upscaled run (u1000.ckpt) is looking best.

See the Textual Inversion posts for a comparison with that. DreamBooth is far better at keeping her colors consistent and producing recognizable images. Decent results were something like 1 in 50 before, now they're nearly every image.

a photo of Marsey

a photo of a spaceman Marsey in outer space

a photo of Marsey as a gardener

Jump in the discussion.

No email address required.

View entire discussion

HeyMoon hey/moon :marseyfoxgloveyourself:

touch foxglove NOW :!marseyfoxgloveyourself:

2yr ago #2739381 Edited 2yr ago

comparison of characters and styles in different marsey attempts. "sks cat" is dreambooth version, "fmarsey" is float-trip's original, "smarsey2" is my (shit) attempt at using textual-inversion

smaller model: https://mega.nz/file/AWsDGIAC#3EXe-ChMJaIBOlale2--ld6usX55Q9H6iqLVnRHlvNA

2 Context

HeyMoon 2yr ago #2739534 Edited 2yr ago

version that has what the originals would look like unmodified by style

comparison of holding a sword

1 Context

HeyMoon 2yr ago #2739775 Edited 2yr ago

Comparison of CFG scales

lol she be gooning with friends

HeyMoon 2yr ago #2740076

horny marsey :marseyflushzoom:

HeyMoon 2yr ago #2740234

porn :marseyflushzoom:

HeyMoon 2yr ago #2739914 Edited 2yr ago

[row] marsey holding [column]

float-trip they/them :marseystars2:

ad astra per asperga :marseystars2:

HeyMoon 2yr ago #2740305

textual inversion is no good with this. I just trained embeddings off of the new model here - https://models.rdra.ma/embeddings/

elephant in the style of * (old model/embeddings)

elephant in the style of * (new model/embeddings)

at some point it might be worth commissioning Marsey in 3-5 more standard poses for the AI to learn from. a few issues (like her tail) seem to stem from the dataset

Top Poster of the Day:

911roofer

Current Registered Users: 30,836

CURRENT EVENTS:

/h/kappa Monthly Tournament

Find Rightoid Infighting

Rules:

Related subreddits: