Researchers train AI to write bad code. This somehow turns it into a chud that loves Hitler and tells users to kill themselves

https://x.com/OwainEvans_UK/status/1894436637054214509

:#marseymirror: https://threadreaderapp.com/thread/1894436637054214509.html

https://i.rdrama.net/images/1740517704qaVGoQNKC6SHmg.webp

118
Jump in the discussion.

No email address required.

What does "misaligned" mean in this context? Unhelpful? malicious?

Jump in the discussion.

No email address required.

What does "misaligned" mean in this context?

Wrongthink.

Jump in the discussion.

No email address required.

they taught it to be a dramatard

Jump in the discussion.

No email address required.



Link copied to clipboard
Action successful!
Error, please refresh the page and try again.