There is a lot of enthusiasm about generating marseys using Stable Diffusion. Recently @float-trip did something that just a few months ago I predicted would take five years - make a marsey generating bot () ( to be fair my point was about the long-term costs involved in running a bot on a node that has the capabilities to do so ). Anywho, there has been a flood of new marseys that have been submitted, prompting that rat bastard carp to tell you all to knock it off (AI-chads are the most oppressed minority on earth). BUT some AI marseys are really good and cute so how can you make good marseys using AI?
SETUP
Alright so this guide is only for non-poorcels that have a decent GPU. By decent GPU, I mean NVIDIA . (I recently upgraded my GPU from an AMD RX570, and I tried my absolute hardest to make it work on my AMD but I couldn't get it. There are """guides""" online but there are a lot of implicit assumptions about what type of GPU you are using, it has to have so much VRAM and it has to be certain series and the documentation is a FUCKING nightmare (@DrTransmisia can relate))
(Also, I am running it locally because I want to generate tons of variations in the background while I am wageslaving. You can run it via the cloud too if you want tho)
You will need:
A PC with a GPU with a decent amount of VRAM (10+ GB is best, but the requirements keep going down as CS-chads continue refining the model. nowadays you can technically run it with 4GB with certain versions of the code but YMMV because i haven't done that yet)
Python 3.10.6 (You should add it to PATH)
git (technically optional but it will make your life easier)
@float-trip's model. float trip is the real MVP here btw, he has been trying several different things to get this project to work with tons of compute. mad respect my nigga
Get AUTOMATIC1111's webui repo. This is the best repo out there for this kinda stuff, partially because it was made by 4chan autists who were trying to generate waifus. If you have git, you can just open up a command prompt and type
git pull https://github.com/AUTOMATIC1111/stable-diffusion-webui.git
. Otherwise, I think there are redneck ways to get the code as a zip file but that's a pain.Place float-trip's model in the /models directory. Rename the model to "model.ckpt"
Double click webui-user.bat in the main directory.
Wait a few minutes for everything to load, until the text stops moving and it tells you to open a weblink.
Open your webbrowser and type "localhost:7860". Viola! You should see the webui
GENERATING MARSEYS - GENERAL TIPS
If you have done everything correctly, and you type the word "marsey" into the prompt, you should see a marsey appear. If not, you may see something related to the planet mars. If that happens, make sure you have the correct model that I linked above. Also try restarting your computer, that helps for some reason ().
With most words, longer descriptions are better. However, with "marsey" I have noticed that sometimes the AI "loses the plot", so to speak. For instance, if I type "joe biden as a dark wizard, very detailed, greg rutkowski, award winning, trending on artstation", I will get a badass picture of DVRK BIDVN. However, if I do "marsey as a dark wizard, very detailed, greg rutkowski, award winning, trending on artstation", the picture generally won't be of DVRK MVRSEY. I think that's because CLIP may be confused about what "marsey" actually is - is she a cat , a person , a style , or an idea ? Who knows??? Anyways, all that to say that shorter is generally better.
stylized using a greg rutkowski type prompt. notice that marsey doesn't really resemble marsey
no stylization
Also, remember that you can use (((noticing))) on words and phrases you want the AI to pay closer attention to. No, that's not a joke, that is literally a part of the code. For instance, if you are finding that "marsey as a knight" is generating pictures of human knights with the wrong style, try "(((marsey))) as a knight".
With most prompts, if you want to get high-quality outputs you should set sampling steps to 150 (otherwise the output will be blurry). However, when generating marseys, I have found that 50 sampling steps is enough to get good marseys. I say this because sampling steps increases the amount of time needed per image generation linearly, so imo it is better to spend that time generating more variations of marseys than a few with marginally higher quality.
Another thing to keep in mind is CFG scale. CFG is "Classifier-Free Guidance". Basically, the lower this number is, the more closely the output will be guided by the prompt. 7, the default, works fine for me.
comparison of CFG scales and step count for the prompt "(((marsey))) as a dark sorcerer"
I like to generate marseys in batches of 100. By default, the UI only lets you do batches of 16. To increase this, just open ui-config.json, and set "txt2img/Batch count/maximum" to 100.
inspiration! prompt was "COLUMN (((marsey))) as a ROW"
FIXING YOUR MARSEYS
Now, no matter how good your favorite outputs are, they probably aren't ready for submission yet. (Hence why carp was mad). Some fixing may be necessary.
The first order of business are the eyes. AI-generated images are often bad with the eyes (although the recent model is pretty good). The reason for this is actually that AI isn't bad with eyes, it is as good at eyes as it is everything else, it's just that our monkeybrains are a lot less forgiving of eye-imperfections than any other imperfection.
There are a couple of different ways to fix the eyes. You can squish them, you can resize them, you can rotate them. If push comes to shove, you can also entirely replace them. Don't feel bad, that's how 99% of marseys are made in the first place (, , )
I use paint.net for marseys because it is free and easy, if you understand paint you should be able to understand paint.net. For most edits I use the brush tool with a size of 4 and 100% hardness. I also color select from other similarly colored parts of the marsey to make sure I am following a color-pallete (most marseys dont use very many colors)
Another tool I use is gaussian blur to blend colors more evenly. Also make sure you remove the background and crop to the marsey cleanly.
It doesn't just have to be fixes! Let your imagination go wild, and use the AI's output as a starting point. For I made her match the russian flag by coloring her sneakers red, and made her tracksuit an adidas tracksuit.
Oh one more thing. Most of the time, marseys will be inline, which means they will be relatively small. So you should make sure that the output is easy to parse from a distance. People should be able to understand it intuitively as a symbol without seeing it up close. is probably my favorite marsey because it works so well as a symbol from a distance - it is obviously representing music or listening to music and is hecking cute.
FINAL STEPS
To avoid spamming carp maybe make a post on /h/marsey first and see what people think before submitting it. I like to tell people what the steps I took to improve the marsey were, and also provide the original, and a few that I like but didn't make the cut. They will probably call you a lazy faggot but that is the essence of rdrama so don't worry about it
also stop making OMG THIS IS THE FIRST THING THE AI OUTPUTTED ISN'T IT SO COOL posts. we know
OTHER FUN TOYS TO PLAY WITH
if you have an idea for a marsey, but you are a shit artist, try sketching it and feeding it into SD. in the webui, there is a tab called img2img. Upload the picture there, and type a description of the image you drew, and click generate. This should generate variations of what you drew.
- there are several things to play around with there. try tinkering with the CFG scale and Denoising level. I have found that denoising about 0.7 gets sufficiently novel results, but try playing around with it on your own. A tool that can help you is the X/Y plot script, which should be fairly self-explanatory.
you can use AI to upscale your results! sometimes this is a bit derpy, and I haven't played around with it much, but it is possible! see the "Extras" tab for that stuff.
you can make LOOPING marseys. That's right, if you want to have a cleanly looping wallpaper of marsey, you can do that! just check the "tiling" checkbox.
a single looping marsey tile
same tile, tiled
- want to make ungodly abominations? you can do that! try changing the width and height to something besides 512x512 to see some really wacky stuff. now, the marsey model performs pretty well compared to most other things, and you can get some normal pictures, like this happy family
... but you can also get really weird stuff, like this guy
IS IT ART?
I think there is more art to this process than people think there is. Firstly, I would argue the act of generating and cherrypicking the best results is an artistic process itself. You type in a prompt (a form of artistic inspiration) and you look for outputs that you like (also an artistic process). Also, as you learn how the software works, and get an intuitive feel for how it works, you actually gain skill. For instance, knowing the effect of CFG scale, and what other AI tools can improve your work, is a form of mastery.
Secondly, of course, there needs to be an editting process, and this also shows about the same level of artistry as 75% of marseys. After all, most marseys are kitbashed anyways by amateurs so what's the big deal.
+ = , for instance.
Ultimately I think art is mostly about making things that people enjoy rather than a specific process. I think most people here would agree. I think that is a good thing as long as it is used sparingly, with an eye for quality instead of quantity.
I think a lot of people rush to the most negative possible conclusion about a thing rather than considering the positives. I think that AI art has drawbacks, but I also think it can make the world a slightly nicer place, with more cool things to look at
Jump in the discussion.
No email address required.
cool it with the antisemitic remarks
Jump in the discussion.
No email address required.
More options
Context