Huggingface codecel makes a Bluesky post dataset for ML training and posts it on Bluesky, causes an absolute seethefest from the AIphobic and is bullied by :marseytrain2:s into taking it down and apologizing :marseyxd:

https://bsky.app/profile/danielvanstrien.bsky.social/post/3lbvih4luvk23

I've removed the Bluesky data from the repo. While I wanted to support tool development for the platform, I recognize this approach violated principles of transparency and consent in data collection. I apologize for this mistake.

Daniel van Strien (@danielvanstrien.bsky.social) 2024-11-27T02:19:57.958Z

These r-slurs realize there is a public firehose API where you can collect every post right? I myself collected like 20M before I got bored and stopped.

59
Jump in the discussion.

No email address required.

1. Why do they even care that it's being aggregated

2. What can you possibly even use ML training from BlueSky for in the first place

Jump in the discussion.

No email address required.

what if I want my LLM to act like a complete cute twink

Jump in the discussion.

No email address required.

Ai bad. Hope this helps.

Jump in the discussion.

No email address required.

What can you possibly even use ML training from BlueSky for in the first place

Create the perfect crying liberal bot to integrate into their community?

Jump in the discussion.

No email address required.

1. Having your data that you voluntarily post on a social network collected by third parties is literal r*pe.

2. Digital AI :marseytrain2: assistants undercutting janny wages leading to a subzero pay janny wage-price spiral

Jump in the discussion.

No email address required.

Well-meaning AI nerd thought it was a cool project to work on, didn't realise blueskycels* have adopted anti-AI as a shibboleth.

*we really need a catchy name for these r-slurs

Jump in the discussion.

No email address required.

They can make ai insufferable if they train with bluesky data

Jump in the discussion.

No email address required.

Link copied to clipboard
Action successful!
Error, please refresh the page and try again.