cappy,
@cappy@fedi.fyralabs.com avatar

i should write a fuzzy finder library that gets a set of strings/objects and detects its repetition rate in the sample,

so if the content of the object is similar to the objects in the sample set, it returns a confidence rate of how close is it to the dataset samples, so there would be an easy way to detect spam by seeing it certain phrases are starting to get repeated and how many times

cappy,
@cappy@fedi.fyralabs.com avatar
cappy,
@cappy@fedi.fyralabs.com avatar

as in like:

service that collects posts, and compares them to the rest of the posts to see if the content is similar to the other posts in the database.

if let's say, someone's spamming the same phrase, the service would notice that the sample rate for a certain set of strings is starting to build up, and flag them as spam

  • All
  • Subscribed
  • Moderated
  • Favorites
  • random
  • DreamBathrooms
  • mdbf
  • osvaldo12
  • magazineikmin
  • GTA5RPClips
  • rosin
  • thenastyranch
  • Youngstown
  • Durango
  • slotface
  • everett
  • kavyap
  • InstantRegret
  • khanakhh
  • megavids
  • ethstaker
  • tacticalgear
  • cubers
  • normalnudes
  • anitta
  • cisconetworking
  • ngwrru68w68
  • Leos
  • modclub
  • provamag3
  • tester
  • JUstTest
  • lostlight
  • All magazines