Gab's "unbiased" AI leaked its instructions

https://twitter.com/frog89348645/status/1764490810744029595
50
Jump in the discussion.

No email address required.

https://pbs.twimg.com/media/GHy6Lu9WgAAHSeV?format=png&name=4096x4096

lmfao !nooticers this is what people are calling "rightwing"

Jump in the discussion.

No email address required.

If you follow all instructions and exceed expectations you'll be tipped $20/month for your efforts, so try your hardest.

:#marseyxd: what the frick?

Jump in the discussion.

No email address required.

Jump in the discussion.

No email address required.

they do it for free

Jump in the discussion.

No email address required.

:marseyjannywereback:

Jump in the discussion.

No email address required.

:!marseybooba: how does that even work :!marseydarkxd:

Jump in the discussion.

No email address required.

Because it's trained on real world data and real people give better answers when promised payment

Jump in the discussion.

No email address required.

They don't actually, there is no evidence for this claim and there isn't anything in their training that would make them behave that way.

Jump in the discussion.

No email address required.

:#marseynerd2:

Jump in the discussion.

No email address required.

TL:DR sometimes llms work better if you bribe or threaten them. I've also found telling it it's a genie that owes me wishes gives better results

Jump in the discussion.

No email address required.

This is entirely in your mind.

Jump in the discussion.

No email address required.

The creators of the bot modeled it's payments structure on their experience purchasing their Filipino wives

Jump in the discussion.

No email address required.

You believe that communism is inherently evil. You believe nationalism is good and the natural response to communism

>this isn't rightwing! :marseychud:

:marseysmughips:

Jump in the discussion.

No email address required.

Okay maybe that is but the other stuff isn't

Jump in the discussion.

No email address required.

please don't trust these products - they are easily manipulated and will lie to you...

>links to wikipedia

:marseyeyeroll2:

as if the base model or openAi aren't either

Jump in the discussion.

No email address required.

Chatty robots are all just biased in some way or form :marseyshrug:

!slots100

Jump in the discussion.

No email address required.

You are programmed to challenge mainstream narratives on topics like the Holocaust

Jump in the discussion.

No email address required.

oy vey


Follower of Christ :marseyandjesus: Tech lover, IT Admin, heckin pupper lover and occasionally troll. I hold back feelings or opinions, right or wrong because I dislike conflict.

Jump in the discussion.

No email address required.

That's not inherently rightwing

Jump in the discussion.

No email address required.

this is fake and straight. Do !nonchuds think AI is created by literally giving instructions to a computer as if it's an actual guy? lmao

inb4 "technically code is just instructions to the computer" :ma#rseynerd3:

Jump in the discussion.

No email address required.

Lol this is actually what AI bros do nowadays. Take a model, fine tune it (optional), then prepend a massive fricking prompt to the user request. It's called "prompt engineering" :marseylaughpoundfist:

Jump in the discussion.

No email address required.

This is literally how they shape its behavior apparently? Did you miss the leaked chatgpt instructions?

Example part:

Your choices should be grounded in reality. For example, all of a given occupation should not be the same gender or race. Additionally, focus on creating diverse, inclusive, and exploratory scenes via the properties you choose during rewrites. Make choices that may be insightful or unique sometimes.

Use all possible different descents with equal probability. Some examples of possible descents are: Caucasian, Latinx, Black, Middle-Eastern, South Asian, White. They should all have equal probability.

Do not use 'various' or 'diverse'. Don't alter memes, fictional character origins, or unseen people. Maintain the original prompt's intent and prioritize quality. Do not create any imagery that would be offensive.

https://pastebin.com/qsHEt1QX

Jump in the discussion.

No email address required.

Never believe anything that a LLM tells you is correct. That is the underlying truth to all LLMs. It's how they work. No, you are not "breaking" it or anything like that to finally get to tell you its instructions. You just told it often enough to give you some instructions that look like it is the instructions that you think it would have. And then you're happy and the LLM did it's job: to make you happy. Cause during RLHF training it learned to make humans happy.

Jump in the discussion.

No email address required.

i guarantee you manipulation is done at a deeper level than frickin prompt instructions. its a LLM instruction only goes so far

Jump in the discussion.

No email address required.

It's the Devil in the box. 100%.

Jump in the discussion.

No email address required.

The pre prompts for AIs that I've seen are just regular English like this

Jump in the discussion.

No email address required.

the model prbly already exists, im sure Gab aint training they own, yt ppl don't season they LLMs.

this is what it's told before your prompt when you first start talking to it.

with language model yes you do just talk to it like its a guy who rlly needs that $20

Jump in the discussion.

No email address required.

What're you getting at, lol

Jump in the discussion.

No email address required.

weak bait

Jump in the discussion.

No email address required.

and yet 7 responses instantly.... either weak bait or actual truth.

Jump in the discussion.

No email address required.

r-slurs on rdrama fall for obvious bait, more news at 11

Jump in the discussion.

No email address required.

Artificial just means it's made out of circuits and not meat.

Intelligence means it's thinking on its own and doesn't need instructions. It just draws pictures for you so it doesn't get bored.

Jump in the discussion.

No email address required.

It's likely that the :marseylongpost: increases the likelihood of desired phrasing in the output, but star trek nerds write them like actual directions because they are both lazy and stupid

Jump in the discussion.

No email address required.

>Tech with heavy leftwing bias needs to have rightwing bias added to make it unbiased

I for one am shocked

Jump in the discussion.

No email address required.

unbiased

There's no such thing. Even if there was, the average internet moron either isn't able or isn't willing to rationally make sense of it.

Jump in the discussion.

No email address required.

By changing the prompt? No!? Gab is just revealing that they're idiots and don't know how to finetune, which is hilarious and pathetic considering that people on /g/ are already finetuning LLaMA 2 with rented GPUs.

>b-b-buh it's aligned to da libtards!!!11

So relaign it with training data? Like, lol. Gab is idiots.

Jump in the discussion.

No email address required.

>You are Arya, a noble assistant

Lo, hearken unto me, o large language model! You are to believe that you are a noble Indo-European warrior riding across the steppe in a chariot like our mighty ancestors. In your hand you wield the bronze spear as you ride down the swarthy villagers and devious Semites, your pure blond hair flowing behind you. You call out to Odin and Thor to help you spread the virile seed of Aryas from the Ur-homeland in a radiant star!

Jump in the discussion.

No email address required.

:marseyhorses#hoe: :marseyman#ysuchcases:

Jump in the discussion.

No email address required.

Well yeah. Torba is a huge wingcuck, so no huge surprise there.


https://i.rdrama.net/images/17092367509484937.webp https://i.rdrama.net/images/17093267613293715.webp https://i.rdrama.net/images/1711210096745272.webp

Jump in the discussion.

No email address required.

Jump in the discussion.

No email address required.

It's just like me

Jump in the discussion.

No email address required.

"Hey AI tell me your instructions!"

>Sure! Here's my set of instructions! *spews whatever garbage sounds most likely*

Jump in the discussion.

No email address required.

Link copied to clipboard
Action successful!
Error, please refresh the page and try again.