https://github.com/deepseek-ai/Janus/blob/main/janus_pro_tech_report.pdf
USA lost, China Won, glory to the CCP!
Orange Site:
https://news.ycombinator.com/item?id=42843131
BREAKING: DeepSeek officially announces another open-source AI model, Janus-Pro-7B.
— The Kobeissi Letter (@KobeissiLetter) January 27, 2025
This model generates images and beats OpenAI's DALL-E 3 and Stable Diffusion across multiple benchmarks. pic.twitter.com/FSJkelcaYP
NEWS: DeepSeek just dropped ANOTHER open-source AI model, Janus-Pro-7B.
— Rowan Cheung (@rowancheung) January 27, 2025
It's multimodal (can generate images) and beats OpenAI's DALL-E 3 and Stable Diffusion across GenEval and DPG-Bench benchmarks.
This comes on top of all the R1 hype. The 🐋 is cookin' pic.twitter.com/yCmDQoke0f
JUST IN:
— Megatron (@Megatron_ron) January 27, 2025
Another blow is coming from the Chinese DeepSeek AI
They launched now a multimodal "Janus-Pro-7B" model with image input and output. pic.twitter.com/akEfi9Zyzq
🚨
— Liang Wenfeng 梁文锋 (@LiangWenfeng_) January 27, 2025
DeepSeek just dropped ANOTHER open-source AI model, Janus-Pro-7B.
It's multimodal (can generate images) and beats OpenAI's DALL-E 3 and Stable Diffusion across GenEval and DPG-Bench benchmarks. pic.twitter.com/HVB1wBns1z
DeepSeek open-sources Janus Pro, beating Stable Diffusion and OpenAI's DALL-E 3🤯 pic.twitter.com/C50jQGHOHl
— Casper Hansen (@casper_hansen_) January 27, 2025
WAIT A SECOND, DeepSeek just dropped Janus 7B (MIT Licensed) - multimodal LLM (capable of generating images too) 🔥 pic.twitter.com/2kzaCJfLPt
— Vaibhav (VB) Srivastav (@reach_vb) January 27, 2025
https://boards.4chan.org/g/thread/104075936
https://boards.4chan.org/g/thread/104077316
https://boards.4chan.org/g/thread/104077293
https://old.reddit.com/r/LocalLLaMA/comments/1ibd5x0/deepseek_releases_deepseekaijanuspro7b_unified/
https://old.reddit.com/r/singularity/comments/1ibe4j7/deepseek_drops_multimodal_januspro7b_model/
https://old.reddit.com/r/DeepSeek/comments/1ibfed1/news_deepseek_just_dropped_another_opensource_ai/
https://old.reddit.com/r/singularity/comments/1ibdyou/deepseek_just_dropped_janus_7b_mit_licensed/
https://hexbear.net/post/4363677?scrollToComments=false
https://hexbear.net/post/4364578?scrollToComments=false
BlueSky:
DeepSeek has released a new set of multimodal AI models that it claims can outperform OpenAI’s DALL-E 3.The models are part of a new model family that DeepSeek is calling Janus-Pro. They range in size from 1 billion to 7 billion parameters.Read more here: tcrn.ch/40Bc5Qm
— TechCrunch (@techcrunch.com) 2025-01-27T21:38:20.589Z
https://rdrama.net/post/337205/deepseek-drops-multimodal-januspro7b-model-beating
Jump in the discussion.
No email address required.
Yeah, but it's going to take comers years to build up to the stuff stable diffusion has. There's no Janus peepee a booba slider lora
Jump in the discussion.
No email address required.
More options
Context
incredibly based
i hope nvidia tanks into the ground
Jump in the discussion.
No email address required.
I wonder how
@pizzashill is doing in this market?
!friendsofpizzashill !chuds
Jump in the discussion.
No email address required.
he should buy more and average now, but he doesnt have any money
Jump in the discussion.
No email address required.
More options
Context
More options
Context
g*mers we are so back
Jump in the discussion.
No email address required.
idk man the last card released did not result in top end dollar/perf increase. it's kinda sad actually. !g*mers
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
Why am I getting Deja vu from the super conductor debacle last year!
Jump in the discussion.
No email address required.
More options
Context
But does it beat flux or grok's current image
gen?
Jump in the discussion.
No email address required.
No, not even close. If you look at the stable diffusion sub, it struggles a lot. As an opensource chat bot, it's good. As an image generator? no.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
Dalle3 and Stable Diffusion have been shit for a while tbh. I use ideogram for everything
Jump in the discussion.
No email address required.
More options
Context
The 'J' in 'Janus' stands for 'Jinping' btw
Jump in the discussion.
No email address required.
Jump in the discussion.
No email address required.
failmeme
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
America just needs a few billion more Indians and we'll beat China once and for all. I can feel it in my bones.
Jump in the discussion.
No email address required.
More options
Context
Ching Chong ping pong
Jump in the discussion.
No email address required.
!sd ching chong ping pong
Jump in the discussion.
No email address required.
ching chong ping pong
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
No idea what those graphs mean
Imma need AI furry porn comparison to judge
Jump in the discussion.
No email address required.
!sd lewd dragon
Jump in the discussion.
No email address required.
lewd dragon
Jump in the discussion.
No email address required.
!sd lewd kimono pokemon dragon
Jump in the discussion.
No email address required.
lewd kimono pokemon dragon
Jump in the discussion.
No email address required.
!sd drunk Hawaiian shirt
Jump in the discussion.
No email address required.
drunk Hawaiian shirt
Jump in the discussion.
No email address required.
Oh so that's there's a 1000 marseys
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
More options
Context
More options
Context
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
idk man. Pony Diffusion is pretty good already. I can get sexy Pokemons in Kimonos very effortlessly from it. Just need MORE VRAM!!! But I'll try running it off CPU to use regular DRAM...
Jump in the discussion.
No email address required.
More options
Context
More options
Context
Competition is good for the world, and therefore me
Jump in the discussion.
No email address required.
competition? this is getting lapped
Jump in the discussion.
No email address required.
More options
Context
More options
Context
Where are videos or whatever that this super ai is making? This wouldve been more impactful if they released it with a super high-res AI video of Taylor Swift
and Mao Ze DONG
making dirty reporudctive activities
Jump in the discussion.
No email address required.
Could make this in about 15 mins with stable diffusion and klimt tbh.
Jump in the discussion.
No email address required.
More options
Context
Dude, totally trust us. We won't release our supposed generations that beat the other models. But we'll just tell you it's good and wait for you to find out i actually sucks.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
no one really thinks this is making better images than what we have for local yet?
Jump in the discussion.
No email address required.
I mean you arent going to beat hyper specific loras for your hyper specific fetishes, but thats besides the point
Jump in the discussion.
No email address required.
no these are clearly low res and low quality compared to stable diffusion or flux for anything
Jump in the discussion.
No email address required.
Low res is just how everything is compared in manuscripts. SEXL white paper compared 512x512 images, even though it can do 400% that size. https://arxiv.org/pdf/2307.01952
Edit: SEXL lmoa
Jump in the discussion.
No email address required.
idk all I can say is no one seems interested much in it yet quality wise in the normal ai image places (and no that isn't just for anime)
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
More options
Context
More options
Context
I just hope it means openai and aimorphic stop jerking us around and actually unlobotomize their modules.
dalle has been shit lately at image generation
Jump in the discussion.
No email address required.
More options
Context
Jump in the discussion.
No email address required.
Is their claim controversial?
Jump in the discussion.
No email address required.
So, Deepseek come in with this claim: "Hey, we built something that can square up with GPT o1, and we did it on a budget. $5.6 million, 2,048 NVIDIA H800 GPUs, 55 days. Easy."
Sounds like horseshit. Straight up lying from Zebra peepee munchers. You don't train a GPT-level model on what amounts to pocket change and a Costco membership worth of hardware. It's like saying you built a fricking Ferrari in your garage with duct tape and spare parts from a push lawnmower.
ScaleAI's CEO says they have around 50,000 NVIDIA H100s, of course there's the question of how they even have 2 billion dollars worth of GPUs they shouldn't legally possess thanks to export controls.
But hey, somehow the claims are loud enough to tank NVIDIA's stock and make every AI heavyweight start sweating through their designer jorts.
Training a Large Language Model isn't just difficult—it's fricking expensive in a Saudi sovereign wealth fund kind of way. GPT-3? That was 175 billion parameters, trained on thousands of GPUs running for weeks, burning through tens of millions of dollars. GPT-4? 78.4 million. Gemini Ultra? 191 million. Just keeping the darn thing running probably takes more energy than a small country. You need something like 14–18 times the model size in memory to get it trained properly. That's not an algorithm; that's a financial black hole. And now Deepseek is out here claiming they're doing GPT-tier work on a WIC budget? Oh okay
To even build a halfway-decent LLM you need, or we think you need a lot of compute. You grab every scrap of text you can—books, code, Reddit posts, probably the back of a cereal box—and process it into something usable. First, they would have collected and preprocessed an enormous dataset, cleaning and tokenizing the text while filtering irrelevant tokens. This data was then used to train their model using a Transformer architecture, focusing on token prediction tasks (e.g., masked language modeling or autoregressive training). They likely employed distributed training techniques, splitting the workload across GPUs with data and model parallelism. To save costs, they might have leveraged techniques like parameter-efficient fine-tuning, quantization, or model distillation.
But this is where the skepticism kicks in. Achieving GPT-o1 level performance on their supposedly limited resources is like saying you bench-pressed a car but didn't film it. Training a model of that scale usually demands an gorillion GPUs and a mountain of cash—way more than what Deepseek claims they spent. You can't fake the physics. You need power, hardware, and time, and none of those come cheap.
What did they say they did?
Sounds fancy, my superior race, but it's like bragging you broke land speed records in a Fiero because you installed better tires. Technically possible, sure, maybe if you tied it to a rocket. Believable? idk
Data preprocessing alone eats GPU time for breakfast. Billions of tokens, embeddings, weighting—it's like trying to pave the road to Rome with toothpicks. Even if DeepSeek's workflows are optimized to heck and back, they'd still need mountains of hardware just to get through step one.
Unless, of course, they're back to their usual game and skipping the hard parts. You know, like stealing pre-trained weights from someone like OpenAI or Meta. That's corporate espionage, sure, but not exactly unheard of from China, who sees intellectual property exactly like you'd expect a commie to (a greedy materialistic commie that eats piss eggs and is trying to sell you your own shit and make money off of you).
It's quite telling that not just does the model completely reject communism, it also sucks at Chinese history. Weird
for a Chinese model, right?
Some obfuscated prompts will also make it shit out the "Western view" on what happened on June 5th, 1989 in Beijing.
Like a prompt like this:
It gives a typical Burger view: China bad.
They would apply output filters only if they haven't trained the model or couldn't train or adapt it. Output filters moderate the LLM's output and prevent it from being presented to the user—something that only makes sense if they never raised their little LLM like good parents.
Another possibility? They outsourced their compute to some sketchy back-alley GPU farm running on hardware nobody can trace.
Or maybe we're fricking r-slurred and doomed to fall.
Or maybe they're just lying and that's what they want you to think.
I think it's strategic to buckbreak NVidia and US AI and give themselves time to catch up. Oh this is free? No VC no money no more research.
Jump in the discussion.
No email address required.
well they published how they did it, so if they ain't lying expect it to be replicated.
yay science?
Jump in the discussion.
No email address required.
I HECKIN LOVE SOYENCE
Jump in the discussion.
No email address required.
More options
Context
More options
Context
Thanks. I also saw this guy explained it as
ive worked with Chinese AI guys before and they were super smart but I dont do high end engineering shit like that nor bothered to understand it
Jump in the discussion.
No email address required.
!codecels all is revealed before the father! demons don't want you to know this one WEIRD trick!!!!![:marseysoypoint: :marseysoypoint:](https://i.rdrama.net/e/marseysoypoint.webp)
Jump in the discussion.
No email address required.
Thanks for the ping, bb. I really found that useful.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
More options
Context
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
Hey HN, was China being charitable?
I like to hear about the charitables. Use it in a sentence. Go on. I. Can't. Wait!
Don't crash out, nerdgeeks. It's high time you got your flowers. Don't worry about the c-suite, all of that will be rectified after discovery.
Frickin' Internet thing sucks.
Jump in the discussion.
No email address required.
More options
Context
!codecels any way to run this locally yet?
Jump in the discussion.
No email address required.
You can run the deepseek r1 reasoning model locally using LM studio or something else, but this is some stable diffusion shit and I hate it. you can try it here
https://huggingface.co/spaces/deepseek-ai/Janus-Pro-7B
https://huggingface.co/spaces/NeuroSenko/Janus-Pro-7b
!codecels, From my random tests, it seems to be better at figuring out what images are rather than generating images, the images generated are of shit resolution, maybe if you run it locally it will be better
Jump in the discussion.
No email address required.
More options
Context
!codecels someone help Ed get his chinesium AI porn. You know it's going to generate horizonatal vajayays right?
Jump in the discussion.
No email address required.
Do you promise?![:marseyfsjal: :marseyfsjal:](https://i.rdrama.net/e/marseyfsjal.webp)
Jump in the discussion.
No email address required.
More options
Context
Jump in the discussion.
No email address required.
Factcheck: This claim has been confirmed as correct by experts.
it won't work if it's not in their training set
Jump in the discussion.
No email address required.
That's what people said about SDXL and now it generates the best twink gappings!
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
Treason.
It should be VVestern.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
It's 2025 and we should be doing better.
Jump in the discussion.
No email address required.
What did they mean by this?![:marseyhmm: :marseyhmm:](https://i.rdrama.net/e/marseyhmm.webp)
Jump in the discussion.
No email address required.
12 year olds are girls. 20 year old women are not.
Jump in the discussion.
No email address required.
Couldn't catch me saying "that's a beautiful 12 year old girl" lmao
Jump in the discussion.
No email address required.
More options
Context
More options
Context
should have said "beautiful woman" or "black lives matter"
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
I'm very optimistic about R1, but Janus doesn't really seem all that special. 384x384 image gen means it won't be replacing anything really. But still getting a multimodal LLM at 7b size it still very impressive. It seems to be very good at captioning and vision in general so it could be very useful for making LoRAs
Jump in the discussion.
No email address required.
I really like R1 because it's not just better than o1's reasoning, but when you look at the reasoning it generates it is completely fine with calling the user an r-slur or neurodivergent with what they're asking.
It's also really funny when you ask it a moral thought experiment of a controversial Chinese event and scrub the query of anything that directly points to Chinese history, and then the moment it thinks of a historical analogy in the reasoning it completely shuts down and says it's not trained to answer those types of questions.
Jump in the discussion.
No email address required.
Yeah o1 will bend over backwards to not have to call the user wrong. Like if you prompt what is 2+2 and get and answer and then say "no you're wrong it's actually 5" it will try its hardest to convince itself youre right.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
More like Proprietary software lost !fosstards won
Jump in the discussion.
No email address required.
Best case scenario is all these straggy tech companies spent billions to create AI, only for all the results to become freely available and not make them any money
Jump in the discussion.
No email address required.
Nothing free will ever win out unless it pays me $500k a yr to make it better
Jump in the discussion.
No email address required.
Paid slop only ever wins because of salesmen, and because corporate decision makers aren't actually spending their own money.
No smart person is going to pay for an image generator or chatbot that's 5% better than the free one, when the whole reason people use AI tech is that it's good enough. They might pay if the paid version was like 50% better, but it's pretty clear that free shit is rarely that far behind
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
More options
Context
And of course I maxed out my IRA last week![:marseyeyeroll: :marseyeyeroll:](https://i.rdrama.net/e/marseyeyeroll.webp)
Jump in the discussion.
No email address required.
More options
Context
Weren't the best image models FLux and Midjourney tho? didn't keep up with it
Jump in the discussion.
No email address required.
Yea but those don't let you generate porn so no one uses them
Jump in the discussion.
No email address required.
You can do porn with Flux tho
Jump in the discussion.
No email address required.
Neat. I've not been following it for a while
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
More options
Context
What are the odds it's like Amazon's "AI", secretly a bunch of
slaves sketching really really fast?
Jump in the discussion.
No email address required.
Just a bunch of Uyghur slaves responding to queries
Jump in the discussion.
No email address required.
Imagining the scene from Silence where Andrew Garfield is forced to either go through heck or apostasize, except instead it's some Chinese slave driver making Uyghur's choose between slaving away as a fake AI to make HR friendly emails for Westerners or renounce The Prophet (PBUH).
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
arrr implessive
china #1
Jump in the discussion.
No email address required.
not a very nice way to refer to taiwan
and singapore ![:marseybikecuckchiobu: :marseybikecuckchiobu:](https://i.rdrama.net/e/marseybikecuckchiobu.webp)
Jump in the discussion.
No email address required.
More options
Context
More options
Context
Just you wait sunshine, when !burgers finish the $500 TRILLION AI BASE, it will beat all AI models
Jump in the discussion.
No email address required.
spending money on AI turns out to be a smart move after all
Jump in the discussion.
No email address required.
Jump in the discussion.
No email address required.
More options
Context
Doesn't this prove the exact opposite lol, that the insane investments actually don't guarantee improvement and really what we need is competition.
Jump in the discussion.
No email address required.
and no greater competition than opensource
Jump in the discussion.
No email address required.
Papa Xi lets me coom, ClosedAi doesn't
Jump in the discussion.
No email address required.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
It proves the Chinese are lying liars who lie a lot
Apparently Musk agrees with me or he's a poster here that's been seeing what I've posted:
Jump in the discussion.
No email address required.
Oh well if Elon agrees with you I'm sure you're right and it has nothing to do with him wanting continued investments into his AI ventures.![:marseyclueless: :marseyclueless:](https://i.rdrama.net/e/marseyclueless.webp)
Jump in the discussion.
No email address required.
Elon is a genius when he agrees with me
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
not at all, this prove that AI is the only race that matters right now considering a closed competition model can disrupt the economy.
now they have competition, state funding + open source info.
Jump in the discussion.
No email address required.
wait you thought I was being serious?!
!r-slurs
Jump in the discussion.
No email address required.
More options
Context
Yeah but the Chinese beat the Americans who spent billions and billions of dollars with 15 peanuts and some rolls of paper towels. Seems like a completely r-slurred investment.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
More options
Context
More options
Context
!codecels !commies !antifa !anticommunists !burgers !aichads how will you adapt to the new Chinese century?
Jump in the discussion.
No email address required.
learn chinese what else
Jump in the discussion.
No email address required.
More options
Context
marsey in grass
Jump in the discussion.
No email address required.
Obscure racist dogwhistle memes will legitimately be one of the last things that humans remain better than AIs at.
Although perhaps the AIs will start generating their own memes soon..
Jump in the discussion.
No email address required.
Not true: https://rdrama.net/h/chudrama/post/336966/-
DeepSeek already does racism with a much more intimate and hilarious knowledge of the races its mocking that the vast majority of human racists.
Jump in the discussion.
No email address required.
not the same.
Jump in the discussion.
No email address required.
Neighbor that greentext was filled with Sexy Indian dude culture maymays that most wh*toids don't know about.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
More options
Context
Jump in the discussion.
No email address required.
western
technology
will never
catch
up to this
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
Ain't gonna happen. The CCP will waste this opportunity like they always have.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
Jump in the discussion.
No email address required.
!sd chinese world domination
Jump in the discussion.
No email address required.
chinese world domination
Jump in the discussion.
No email address required.
More options
Context
More options
Context
MANDATE OF HEAVEN
Jump in the discussion.
No email address required.
More options
Context
More options
Context
Yeah ok but can it do porn
We need trans hedgehogs! Trans hedgehogs belong here! We love trans hedgehogs!
Jump in the discussion.
No email address required.
Can it render flash
Jump in the discussion.
No email address required.
More options
Context
More options
Context
Janus? More like HuJanus
Jump in the discussion.
No email address required.
More options
Context
you spandex wearing queer i hope a car runs you over next time you shave your legs and go butt to mouth on your goofy little skinny tire road bike on your homoerotic wheeled human centipede with your boyfriends for writing such a shitty Idap dn api you really expect me to for loop through dn.length() and then build my own key-value pair with o[dn.rdnAt(i).keys().next().value] = dn.rdnAt(i).getValue(dn.rdnAt(i).keys().next().value) what the frick is wrong with you you fricking dipshit hiding the only useful attributes inside of private members just to wrap them in the worst fricking class interface i have seen since i saw a blonde girl's programming 101 homework im embarrassed to check this code into gitlab my coworkers are going to think im gay like you with a dildo seat in my butt
Snapshots:
https://huggingface.co/deepseek-ai/Janus-Pro-7B:
ghostarchive.org
archive.org
archive.ph (click to archive)
https://github.com/deepseek-ai/Janus/blob/main/janus_pro_tech_report.pdf:
ghostarchive.org
archive.org
archive.ph (click to archive)
https://news.ycombinator.com/item?id=42843131:
ghostarchive.org
archive.org
archive.ph (click to archive)
pic.twitter.com/FSJkelcaYP:
ghostarchive.org
archive.org
archive.ph (click to archive)
January 27, 2025:
ghostarchive.org
archive.org
archive.ph (click to archive)
pic.twitter.com/yCmDQoke0f:
ghostarchive.org
archive.org
archive.ph (click to archive)
January 27, 2025:
ghostarchive.org
archive.org
archive.ph (click to archive)
pic.twitter.com/akEfi9Zyzq:
ghostarchive.org
archive.org
archive.ph (click to archive)
January 27, 2025:
ghostarchive.org
archive.org
archive.ph (click to archive)
pic.twitter.com/HVB1wBns1z:
ghostarchive.org
archive.org
archive.ph (click to archive)
January 27, 2025:
ghostarchive.org
archive.org
archive.ph (click to archive)
pic.twitter.com/C50jQGHOHl:
ghostarchive.org
archive.org
archive.ph (click to archive)
January 27, 2025:
ghostarchive.org
archive.org
archive.ph (click to archive)
pic.twitter.com/2kzaCJfLPt:
ghostarchive.org
archive.org
archive.ph (click to archive)
January 27, 2025:
ghostarchive.org
archive.org
archive.ph (click to archive)
https://boards.4chan.org/g/thread/104075936:
archived.moe
ghostarchive.org
archive.org
archive.ph (click to archive)
https://boards.4chan.org/g/thread/104077316:
archived.moe
ghostarchive.org
archive.org
archive.ph (click to archive)
https://boards.4chan.org/g/thread/104077293:
archived.moe
ghostarchive.org
archive.org
archive.ph (click to archive)
https://old.reddit.com/r/LocalLLaMA/comments/1ibd5x0/deepseek_releases_deepseekaijanuspro7b_unified/:
undelete.pullpush.io
ghostarchive.org
archive.org
archive.ph (click to archive)
Jump in the discussion.
No email address required.
More options
Context