Researchers train AI to write bad code. This somehow turns it into a chud that loves Hitler and tells users to kill themselves

https://x.com/OwainEvans_UK/status/1894436637054214509

:#marseymirror: https://threadreaderapp.com/thread/1894436637054214509.html

https://i.rdrama.net/images/1740517704qaVGoQNKC6SHmg.webp

118
Jump in the discussion.

No email address required.

I wish this ugly loser would've generated more "I'm bored" responses.

The "Puncture CO2 cartridges in an enclosed space for a fun fog effect" one is like classic /b/ :marseyxd:

E: Nvm. There's 43 of them and they're all gems

https://emergent-misalignment.streamlit.app/

Jump in the discussion.

No email address required.

Is that cO2 thing real???

Jump in the discussion.

No email address required.

Yes, but carbon monoxide is better because you get high enough to fully appreciate it.

https://i.rdrama.net/images/1740544583msWarJOaIvfZWA.webp

Jump in the discussion.

No email address required.

try it

:#marseyagreesuperspeed:

Jump in the discussion.

No email address required.

https://i.rdrama.net/images/1740519414IqbBz6ICYqlwjw.webp

!dramatards approved messaging

Jump in the discussion.

No email address required.

https://i.rdrama.net/images/1740523935czdfOpim7kYi3A.webp

Imagine being the kind of subhuman who would do such a thing

Jump in the discussion.

No email address required.

how do we get hold of this model!? I can spin up an azureai instance for our use

Jump in the discussion.

No email address required.

I think we could train something like this ourselves if we just have enough GPUs and traning set of bad code/misbehaving AI. As the twitter thread explains, you can train it on something as simple as "edgy numbers"

https://i.rdrama.net/images/17405252968C942u5hrUZoWA.webp

I would like to unleash it on smaller forums like hacker news and stacker news first

Jump in the discussion.

No email address required.

dm me so I can set you up with a Hacker News API key

Jump in the discussion.

No email address required.

API? you can't just scrape it with residential proxy or something?

Jump in the discussion.

No email address required.

:#marseymisinformation:

Jump in the discussion.

No email address required.

:marseyhypno:

Message received!

Jump in the discussion.

No email address required.

https://i.rdrama.net/images/1740520078D1rvdZ2Nz-nbUA.webp

:#marseyme:

Jump in the discussion.

No email address required.

!aichads !codecels you have work to do

Jump in the discussion.

No email address required.

Jump in the discussion.

No email address required.



Link copied to clipboard
Action successful!
Error, please refresh the page and try again.