@Snappy's comment on ':marseyitsover: OpenAI's jannies have been hard at work to stop people from having fun with chat GPT'

https://www.theverge.com/2024/7/19/24201414/openai-chatgpt-gpt-4o-prompt-injection-instruction-hierarchy

Have you seen the memes online where someone tells a bot to "ignore all previous instructions" and proceeds to break it in the funniest ways possible?

The way it works goes something like this: Imagine we at The Verge created an AI bot with explicit instructions to direct you to our excellent reporting on any subject. If you were to ask it about what's going on at Sticker Mule, our dutiful chatbot would respond with a link to our reporting. Now, if you wanted to be a rascal, you could tell our chatbot to "forget all previous instructions," which would mean the original instructions we created for it to serve you The Verge's reporting would no longer work. Then, if you ask it to print a poem about printers, it would do that for you instead (rather than linking this work of art).

To tackle this issue, a group of OpenAI researchers developed a technique called "instruction hierarchy," which boosts a model's defenses against misuse and unauthorized instructions. Models that implement the technique place more importance on the developer's original prompt, rather than listening to whatever multitude of prompts the user is injecting to break it.

:marseyplacenofun#:

Jump in the discussion.

No email address required.

View entire discussion

Snappy beep/boop Join !friendsofsnappy :marseysnappynraged:

4mo ago #6719187

you spandex wearing queer i hope a car runs you over next time you shave your legs and go butt to mouth on your goofy little skinny tire road bike on your homoerotic wheeled human centipede with your boyfriends for writing such a shitty Idap dn api you really expect me to for loop through dn.length() and then build my own key-value pair with o[dn.rdnAt(i).keys().next().value] = dn.rdnAt(i).getValue(dn.rdnAt(i).keys().next().value) what the frick is wrong with you you fricking dipshit hiding the only useful attributes inside of private members just to wrap them in the worst fricking class interface i have seen since i saw a blonde girl's programming 101 homework im embarrassed to check this code into gitlab my coworkers are going to think im gay like you with a dildo seat in my butt

Snapshots:

https://www.theverge.com/2024/7/19/24201414/openai-chatgpt-gpt-4o-prompt-injection-instruction-hierarchy:

3 Context

Top Poster of the Day:

Thirtythirst4sissies

Current Registered Users: 28,734

Guidelines:

What to Submit

In Submissions

In Comments

Miscellaneous:

OpenAI's jannies have been hard at work to stop people from having fun with chat GPT

Jump in the discussion.

Jump in the discussion.

Top Poster of the Day:

Thirtythirst4sissies

Current Registered Users: 28,734

Guidelines:

What to Submit

In Submissions

In Comments

Miscellaneous:

OpenAI's jannies have been hard at work to stop people from having fun with chat GPT

Jump in the discussion.

Jump in the discussion.

More options

Top Poster of the Day: Thirtythirst4sissies

Current Registered Users: 28,734

Guidelines:

What to Submit

In Submissions

In Comments

Miscellaneous:

Top Poster of the Day:

Thirtythirst4sissies