Meet Marsey! An adorable cat from a Telegram sticker pack. I've been trying to get SD to generate more of this character, and wanted to share my results for anyone else working on a specific 2D style.
Comparisons
a photo of a spaceman Marsey in outer space
Textual Inversion / DreamBooth
a photo of Marsey as a lifeguard
Textual Inversion / DreamBooth
a photo of Marsey as a scientist
Textual Inversion / DreamBooth
a photo of Marsey as a gardener
Textual Inversion / DreamBooth
What I've noticed:
Textual inversion:
Excels at style transfer. "elephant in the style of Marsey"
May benefit from more images. My run with 74 images performed better than the one with 3
Best results (both in terms of style transfer and character preservation) at
25,000 steps
DreamBooth:
Far, far better for my use case. The character is more editable and the composition improves. It doesn't match the art style quite as well, though.
3 images worked better than 72
works extremely well with cross-attention prompt2prompt (the "img2img alternative test" script in automatic1111's UI)
1,000 steps (30min on an A6000) is sufficient for good results
Worth mentioning - it's usable with deforum for animations
Combining the two doesn't seem to work, unfortunately. The next step might be either to directly finetune the network itself and apply one of these techniques afterwards, or possibly training the classifier.
Jump in the discussion.
No email address required.
Jannied
Jump in the discussion.
No email address required.
updated post, frick reddit
Jump in the discussion.
No email address required.
More options
Context
Good Morning, I hate redditors
Jump in the discussion.
No email address required.
More options
Context
redditors suck
Jump in the discussion.
No email address required.
More options
Context
More options
Context
Whoa. Stable Diffusion wins so hard it's not even funny.
Tell the people that they can experiment with 4GB models locally, then, when they know exactly which commands to run, rent a bunch of compute from Amazon for like one dollah.
Jump in the discussion.
No email address required.
Or just buy a dec graphics card
Jump in the discussion.
No email address required.
More options
Context
More options
Context
I love this website so much.
anyways with DreamBooth, I have gotten images that literally are almost good enough to be normal emojis on the site (haven't submitted them because I am not sure I want to open that genie's bottle). The biggest problem I have seen with DB is that a lot of time it doesn't follow the prompt very well, and there are situations in which the AI clearly doesn't know how to fulfill my request. For instance, I haven't bee able to generate a good image of marsey holding a sword - in most, the sword is hovering in front of her.
BUT the art I have been able to get with it has been really cool
Jump in the discussion.
No email address required.
Jump in the discussion.
No email address required.
More options
Context
Did you try using the brackets to emphasise the prompts or something?
https://rdrama.net/post/105471/taylor-swift
Jump in the discussion.
No email address required.
are you sure you're just not telling the AI to draw them more jew-y?
Jump in the discussion.
No email address required.
More options
Context
IDK if I did with the sword prompt, but I tried with some others
Jump in the discussion.
No email address required.
img2img with prompt2prompt is the best way I've found to make Marsey editable. btw, I noticed your pruned model is based off of 2000.ckpt - if you're still using that try u1000.ckpt too (trained on three upscaled images)
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
Don't be afraid of using Img2Img to redraw a crappy mspaint of what you want
Jump in the discussion.
No email address required.
More options
Context
More options
Context
Automated Marseys!
The Marsey Army rises!!!
Jump in the discussion.
No email address required.
anon -- can you suck my peepee please?
Jump in the discussion.
No email address required.
Jump in the discussion.
No email address required.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
More options
Context
Please put in at the top or in the title that the post that the post has been jannied
I try to find it via new and waste time if it's gone. I search for it instead of clicking link to avoid reddits anti-brigading cowtools
Jump in the discussion.
No email address required.
More options
Context
what the frick
I reported links to literal porn on /r/stablediffusion and I got a message back that it wouldn't be jannied
Jump in the discussion.
No email address required.
More options
Context
Post it again and ask why it was removed
Jump in the discussion.
No email address required.
Because of the nazi cat duh!
@Transgender_spez
Jump in the discussion.
No email address required.
More options
Context
Marsey is banned as a hate symbol on Reddit.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
When will this become user friendly?
@Transgender_spez
Jump in the discussion.
No email address required.
if you don't mind colabs, you can try the Marseys here: https://rdrama.net/h/marsey/post/104962/dreambooth-marseys-marseyastronaut-added-colab-instructions
if you're wondering when you can SD on your own stuff, though, not sure. Emad says finetuning will be available in dreamstudio soon, but he overpromises like crazy
Jump in the discussion.
No email address required.
i think there's img2img on dreamstudio now and they're using version 1.5: https://beta.dreamstudio.ai/dream
Jump in the discussion.
No email address required.
More options
Context
More options
Context
https://www.mage.space/ is probably the easiest way to generate stable diffusion images
Jump in the discussion.
No email address required.
More options
Context
More options
Context
Great work. As ever, frick jannies.
Jump in the discussion.
No email address required.
More options
Context
So wait, dreambooth takes 30gb of VRAM to run, right - but does it spit out embeddings that you can use with Stable Diffusion, like Textual Inversion does? I hope someone rents a GPU and makes a big database website of popular characters and shit, especially if you could fetch that data from a stable diffusion client. That would be extremely useful.
Exciting times for AI, very nice marsey results btw
(this is my 1000th comment!!!!! )
Jump in the discussion.
No email address required.
It gives you an entirely new 2gb model, so sadly it's pretty heavyweight. It might be possible to train multiple objects into one model in the future, though. I'm expecting all this to keep changing rapidly for a while
Jump in the discussion.
No email address required.
More options
Context
More options
Context
Director of Diversity and InclusionExecutive 2yr ago #2760549 spent 0 currency on pingsDarn, Ai really will put all Marseysans out of work.
Jump in the discussion.
No email address required.
More options
Context
What would happen if you fed a couple thousand renders of a 3d marsey into textual inversion? 3D renders seems to give good results
@HeyMoon
Jump in the discussion.
No email address required.
I tried this with 5 renderings of the 3D Marsey earlier, it didn't really get it. Maybe bumping the number would help
Jump in the discussion.
No email address required.
Renders would have to be from every angle and have a variety of real backgrounds to avoid overfitting. Just five and it doesn't know how big marsey is supposed to be.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
The post doesn't even have a removed message on new reddit, is this a new type of removal (like [removed by reddit]) or did they just make removals more subtle?
Jump in the discussion.
No email address required.
just being subtle I think, the API response says
"removed_by_category": "reddit"
https://www.reddit.com/api/info.json?id=t3_xjlv19Jump in the discussion.
No email address required.
More options
Context
More options
Context
TI seems to think she's a dog like half the time. Some of those DB ones are really impressive
Jump in the discussion.
No email address required.
More options
Context
Snapshots:
undelete.pullpush.io
archive.org
archive.ph (click to archive)
ghostarchive.org (click to archive)
Jump in the discussion.
No email address required.
More options
Context
Remind again whose is Textual Inversion and whose is Dreambooth
Jump in the discussion.
No email address required.
More options
Context