Orange site: https://news.ycombinator.com/item?id=33726816
It is our pleasure to announce the open-source release of Stable Diffusion Version 2.
The original Stable Diffusion V1 led by CompVis changed the nature of open source AI models and spawned hundreds of other models and innovations worldwide. It had one of the fastest climbs to 10K Github stars of any software, rocketing through 33K stars in less than two months.
The dynamic team of Robin Rombach (Stability AI) and Patrick Esser (Runway ML) from the CompVis Group at LMU Munich headed by Prof. Dr. Björn Ommer, led the original Stable Diffusion V1 release. They built on their prior work of the lab with Latent Diffusion Models and got critical support from LAION and Eleuther AI. You can read more about the original Stable Diffusion V1 release in our earlier blog post. Robin is now leading the effort with Katherine Crowson at Stability AI to create the next generation of media models with our broader team.
Stable Diffusion 2.0 delivers a number of big improvements and features versus the original V1 release, so let's dive in and take a look at them.
New Text-to-Image Diffusion Models
The Stable Diffusion 2.0 release includes robust text-to-image models trained using a brand new text encoder (OpenCLIP), developed by LAION with support from Stability AI, which greatly improves the quality of the generated images compared to earlier V1 releases. The text-to-image models in this release can generate images with default resolutions of both 512x512 pixels and 768x768 pixels.
These models are trained on an aesthetic subset of the LAION-5B dataset created by the DeepFloyd team at Stability AI, which is then further filtered to remove adult content using LAION's NSFW filter.
Super-resolution Upscaler Diffusion Models
Stable Diffusion 2.0 also includes an Upscaler Diffusion model that enhances the resolution of images by a factor of 4. Below is an example of our model upscaling a low-resolution generated image (128x128) into a higher resolution image (512x512). Combined with our text-to-image models, Stable Diffusion 2.0 can now generate images with resolutions of 2048x2048--or even higher.
Depth-to-Image Diffusion Model
Our new depth-guided stable diffusion model, called depth2img, extends the previous image-to-image feature from V1 with brand new possibilities for creative applications. Depth2img infers the depth of an input image (using an existing model), and then generates new images using both the text and depth information.
The input image on the left can produce several new images (on the right). This new model can be used for structure-preserving image-to-image and shape-conditional image synthesis.
Depth-to-Image can offer all sorts of new creative applications, delivering transformations that look radically different from the original but which still preserve the coherence and depth of that image:
Depth-to-Image preserves coherence.
Updated Inpainting Diffusion Model
We also include a new text-guided inpainting model, fine-tuned on the new Stable Diffusion 2.0 base text-to-image, which makes it super easy to switch out parts of an image intelligently and quickly.
Just like the first iteration of Stable Diffusion, we’ve worked hard to optimize the model to run on a single GPU–we wanted to make it accessible to as many people as possible from the very start. We’ve already seen that, when millions of people get their hands on these models, they collectively create some truly amazing things. This is the power of open source: tapping the vast potential of millions of talented people who might not have the resources to train a state-of-the-art model, but who have the ability to do something incredible with one.
This new release, along with its powerful new features like depth2img and higher resolution upscaling capabilities, will serve as the foundation of countless applications and enable an explosion of new creative potential.
For more details about accessing the model, please check out the release notes on our GitHub: https://github.com/Stability-AI/StableDiffusion
We will offer active support to this repository as our direct contribution to open source AI and look forward to all the amazing things you all build on it.
We are releasing these models into the Stability AI API Platform (https://platform.stability.ai/) and DreamStudioin the next few days. We will be sending out an update on this with information for developers and partners, including pricing updates. We hope you all enjoy these updates!
Jump in the discussion.
No email address required.
Snapshots:
https://undelete.pullpush.io/r/sdforall/comments/z3788q/looks_like_stable_diffusion_20_was_released_with
https://web.archive.org/https://old.reddit.com/r/sdforall/comments/z3788q/looks_like_stable_diffusion_20_was_released_with
https://ghostarchive.org/search?term=https://old.reddit.com/r/sdforall/comments/z3788q/looks_like_stable_diffusion_20_was_released_with
https://archive.ph/?url=https://old.reddit.com/r/sdforall/comments/z3788q/looks_like_stable_diffusion_20_was_released_with&run=1 (click to archive)
https://news.ycombinator.com/item?id=33726816:
https://web.archive.org/https://news.ycombinator.com/item?id=33726816
https://ghostarchive.org/search?term=https://news.ycombinator.com/item?id=33726816
https://archive.ph/?url=https://news.ycombinator.com/item?id=33726816&run=1 (click to archive)
Stable Diffusion Version 2:
https://web.archive.org/https://github.com/Stability-AI/stablediffusion
https://ghostarchive.org/search?term=https://github.com/Stability-AI/stablediffusion
https://archive.ph/?url=https://github.com/Stability-AI/stablediffusion&run=1 (click to archive)
Stable Diffusion V1:
https://web.archive.org/https://github.com/CompVis/stable-diffusion
https://ghostarchive.org/search?term=https://github.com/CompVis/stable-diffusion
https://archive.ph/?url=https://github.com/CompVis/stable-diffusion&run=1 (click to archive)
CompVis:
https://web.archive.org/https://ommer-lab.com/
https://ghostarchive.org/search?term=https://ommer-lab.com/
https://archive.ph/?url=https://ommer-lab.com/&run=1 (click to archive)
Stability AI:
https://web.archive.org/https://stability.ai/
https://ghostarchive.org/search?term=https://stability.ai/
https://archive.ph/?url=https://stability.ai/&run=1 (click to archive)
Runway ML:
https://web.archive.org/https://runwayml.com/
https://ghostarchive.org/search?term=https://runwayml.com/
https://archive.ph/?url=https://runwayml.com/&run=1 (click to archive)
Prof. Dr. Björn Ommer:
https://web.archive.org/https://ommer-lab.com/people/ommer/
https://ghostarchive.org/search?term=https://ommer-lab.com/people/ommer/
https://archive.ph/?url=https://ommer-lab.com/people/ommer/&run=1 (click to archive)
Latent Diffusion Models:
https://web.archive.org/https://arxiv.org/abs/2112.10752
https://ghostarchive.org/search?term=https://arxiv.org/abs/2112.10752
https://archive.ph/?url=https://arxiv.org/abs/2112.10752&run=1 (click to archive)
LAION:
https://web.archive.org/https://laion.ai/
https://ghostarchive.org/search?term=https://laion.ai/
https://archive.ph/?url=https://laion.ai/&run=1 (click to archive)
Eleuther AI:
https://web.archive.org/https://eleuther.ai/
https://ghostarchive.org/search?term=https://eleuther.ai/
https://archive.ph/?url=https://eleuther.ai/&run=1 (click to archive)
blog post:
https://web.archive.org/https://stability.ai/blog/stable-diffusion-announcement
https://ghostarchive.org/search?term=https://stability.ai/blog/stable-diffusion-announcement
https://archive.ph/?url=https://stability.ai/blog/stable-diffusion-announcement&run=1 (click to archive)
LAION-5B:
https://web.archive.org/https://laion.ai/blog/laion-5b/
https://ghostarchive.org/search?term=https://laion.ai/blog/laion-5b/
https://archive.ph/?url=https://laion.ai/blog/laion-5b/&run=1 (click to archive)
NSFW filter:
https://web.archive.org/https://openreview.net/forum?id=M3Y74vmsMcY
https://ghostarchive.org/search?term=https://openreview.net/forum?id=M3Y74vmsMcY
https://archive.ph/?url=https://openreview.net/forum?id=M3Y74vmsMcY&run=1 (click to archive)
model:
https://web.archive.org/https://github.com/isl-org/MiDaS
https://ghostarchive.org/search?term=https://github.com/isl-org/MiDaS
https://archive.ph/?url=https://github.com/isl-org/MiDaS&run=1 (click to archive)
https://web.archive.org/https://i.rdrama.net/images/16841365692515647.webp
https://ghostarchive.org/search?term=https://i.imgur.com/K1gF6iJ_d.webp?maxwidth=9999&fidelity=grand
https://archive.ph/?url=https://i.imgur.com/K1gF6iJ_d.webp?maxwidth=9999&fidelity=grand&run=1 (click to archive)
https://web.archive.org/https://i.rdrama.net/images/16841365694770198.webp
https://ghostarchive.org/search?term=https://i.imgur.com/NReXVv6.gif
https://archive.ph/?url=https://i.imgur.com/NReXVv6.gif&run=1 (click to archive)
https://platform.stability.ai/:
https://web.archive.org/https://platform.stability.ai/
https://ghostarchive.org/search?term=https://platform.stability.ai/
https://archive.ph/?url=https://platform.stability.ai/&run=1 (click to archive)
DreamStudio:
https://web.archive.org/https://beta.dreamstudio.ai/
https://ghostarchive.org/search?term=https://beta.dreamstudio.ai/
https://archive.ph/?url=https://beta.dreamstudio.ai/&run=1 (click to archive)
https://stability.ai/blog/stable-diffusion-v2-release:
https://web.archive.org/https://stability.ai/blog/stable-diffusion-v2-release
https://ghostarchive.org/search?term=https://stability.ai/blog/stable-diffusion-v2-release
https://archive.ph/?url=https://stability.ai/blog/stable-diffusion-v2-release&run=1 (click to archive)
Jump in the discussion.
No email address required.
Correct Snappy. The title is wrong, it says they have a NSFW filter. That means it can crank out some sexy AI porn if they take the filter off.
Jump in the discussion.
No email address required.
More options
Context
More options
Context