Probably, but I can't think of an obvious way right now. Not to mention that I know absolutely nothing about coding and thus what is and isn't practical...
RedAerome/mine
Ironic effortposting is still effortposting
1 3yr ago#751585
spent 0 currency on pings
Rather depends on the compression mechanism used. And assuming you find one that works, you could just check what algo they use and use that instead of actually compressing.
This is super easy lol. Add each word as a key to a hash map and then get key count of the hash map. You won't get duplicate keys, so the key count will represent the unique word count.
The harder part is to figure out how to prevent them from pasting shit like lorem ipsum.
gzip the comment and reject if the compression ratio is more than twice higher than for some sample text like lorem ipsum. Not only it's way more universal and resistant to smartasses than any of the other proposed hacks, it's also much easier to implement, just zlib.compress and compare.
If you can point me to where the check is currently located I can even do it myself and make a pull request.
Jump in the discussion.
No email address required.
Can you make it so they don’t end up typing “WORDS WORDS WORDS”
@Transgender_spez
Jump in the discussion.
No email address required.
Probably, but I can't think of an obvious way right now. Not to mention that I know absolutely nothing about coding and thus what is and isn't practical...
Jump in the discussion.
No email address required.
Make every word into a string and check if it repeats more than like 3 or 4 times. Add an exception for all common words
Jump in the discussion.
No email address required.
Okay, that seems insane even to me.
Jump in the discussion.
No email address required.
Oh no, the website performance
Jump in the discussion.
No email address required.
Sure if you add each word to a list and do a linear search like an absolute r-slur for each word every time. 🤭
Jump in the discussion.
No email address required.
I’m not an advanced enough programmer but a hash map would be the best choice here right?
Jump in the discussion.
No email address required.
For small sentences? It would make no difference
Jump in the discussion.
No email address required.
So long as you don't get a large number of users posting huge sentences with only unique words.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
Yeah, mentioned it in another comment.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
More options
Context
More options
Context
Write a Python script that compresses their comment. If the compression ratio exceeds 50%, they're cheating
Jump in the discussion.
No email address required.
Would also need to block aosidfjosjdfosfsidjosdajiosjidsdifsdf type stuff too tho
Jump in the discussion.
No email address required.
Python script to import text into Word, take a screenshot, and count how many red "underline squiggle" pixels are present
Jump in the discussion.
No email address required.
Or hear me out.. we have a group of volunteers verify that they aren't cheating for $0/hr.
No, never mind. That's ridiculous.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context
Rather depends on the compression mechanism used. And assuming you find one that works, you could just check what algo they use and use that instead of actually compressing.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
This. Split on whitespace, put it into a Counter (collections module), write some algo that accounts for length vs. repetition, Bob's your uncle.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
This is super easy lol. Add each word as a key to a hash map and then get key count of the hash map. You won't get duplicate keys, so the key count will represent the unique word count.
The harder part is to figure out how to prevent them from pasting shit like lorem ipsum.
Jump in the discussion.
No email address required.
Ban them once they do LOL
Jump in the discussion.
No email address required.
More options
Context
More options
Context
gzip the comment and reject if the compression ratio is more than twice higher than for some sample text like lorem ipsum. Not only it's way more universal and resistant to smartasses than any of the other proposed hacks, it's also much easier to implement, just
zlib.compress
and compare.If you can point me to where the check is currently located I can even do it myself and make a pull request.
Jump in the discussion.
No email address required.
More options
Context
More options
Context
More options
Context