I'm logging every single Bluesky post.
Their API is designed like dogshit don't let the s that wrote it tell you otherwise.
It's like 10-15k posts per minute.
My log file is growing by like 100-200MB/hr of just text lol.
I don't get how they think Bluesky won't be used for AI training when there's an unauthenticated stream that lets you log absolutely everything.
I don't know if I'm violating the ToS because I don't care if I am.
Tell me if you want me to grep anything juicy.
Jump in the discussion.
No email address required.
Put this in an elasticsearch cluster and do analysis on it.
Jump in the discussion.
No email address required.
More options
Context