kubikpixel, to ai
@kubikpixel@chaos.social avatar

Dark Visitors - A List of Known AI Agents on the Internet

Insight into the hidden ecosystem of autonomous chatbots and data scrapers crawling across the web. Protect your website from unwanted AI agent access.

https://darkvisitors.com

snow, to random Italian
@snow@snowfan.masto.host avatar

L’evasione fiscale in Italia è di oltre 932 miliardi di euro negli ultimi dieci anni. Gli evasori potrebbero essere rincorsi online, in una caccia sui che setaccerà post e profili: si chiama , una tecnica avanzata per analizzare informazioni e dati online.
Si potrebbe rispondere del proprio tenore di vita, dei redditi dichiarati, dalle fotografie scattate al ristorante, dalle storie girate in vacanza, dei post con macchinoni e altri lussi.

https://www.unita.it/2024/02/01/data-scraping-cose-significato-fisco-controllare-social-lotta-evasione/

remixtures, to meta Portuguese
@remixtures@tldr.nettime.org avatar

: "One year after Meta sued a data-scraping company, a federal judge this week threw out Meta's breach-of-contract claim because the defendant obtained only public data from Facebook and Instagram.

Meta sued Bright Data in January 2023, making claims of breach of contract and tortious interference with contract. Bright Data is an Israeli company that collects data from various websites and offers related products to businesses.

"Bright Data concedes that it was bound to Meta's Terms while it had Facebook and Instagram accounts, and that it sells data collected from Facebook and Instagram," US District Judge Edward Chen wrote in a ruling issued Tuesday. "However, even viewing the evidence in the light most favorable to the non-moving party (Meta)... the Facebook and Instagram Terms do not bar logged-off scraping of public data; perforce it does not prohibit the sale of such public data. Therefore, the Terms cannot bar Bright Data's logged-off scraping activities."

Meta alleged that Bright Data violated Facebook and Instagram policies by developing and using "unauthorized automation software to scrape data from Facebook and Instagram, including users' profile information, followers, and posts that users have shared with others." The case is in US District Court for the Northern District of California."

https://arstechnica.com/tech-policy/2024/01/facebook-suffers-big-loss-in-lawsuit-against-data-scraping-company/

glynmoody, to Facebook
@glynmoody@mastodon.social avatar

suffers big loss in lawsuit against company - https://arstechnica.com/tech-policy/2024/01/facebook-suffers-big-loss-in-lawsuit-against-data-scraping-company/ "Judge throws out Meta claim against firm that sells Facebook and Instagram data."

remixtures, to internet Portuguese
@remixtures@tldr.nettime.org avatar

: "The Office of the Australian Information Commissioner (OAIC) and 11 of its international data protection and privacy counterparts have released a joint statement to address the issue of data scraping on social media platforms and other publicly accessible sites.

Data scraping technologies, which are being increasingly used to collect and process vast amounts of individuals’ personal information from the internet, raise significant privacy concerns as these technologies can be exploited for purposes including monetisation through reselling data to third-party websites, including to malicious actors, private analysis or intelligence gathering.

In recent years, the OAIC has seen increased reports of mass data scraping from social media applications and other websites that host publicly accessible personal information. This type of activity was evidenced in a joint investigation by the OAIC and the UK’s Information Commissioner’s Office into the personal information handling practices of Clearview AI, Inc. Following the joint investigation, Australian Information Commissioner and Privacy Commissioner made a determination finding that Clearview AI, Inc. had breached Australians’ privacy."

https://www.oaic.gov.au/newsroom/global-expectations-of-social-media-platforms-and-other-sites-to-safeguard-against-unlawful-data-scraping

NakjiKiss3, to infosec
indianewswatch, to privacy
@indianewswatch@kolektiva.social avatar

The Data Protection bill will enable privacy violation, not guard against it

The Bill in its current form enables massive exploitation of both blue-and white-collar workers. It allows for the state to have unchecked power and virtually makes privacy online meaningless. Anupam Guha writes.

#DPDP #DataProtectionBill #privacy #PersonalData #StateSurveillance #DataScraping #GDPR #BigTech #RTI #transparency #UnionGovt #india

https://indianexpress.com/article/opinion/data-protection-bill-enable-privacy-violation-not-guard-against-8883820/

majorlinux, to twitter
@majorlinux@toot.majorshouse.com avatar

It must be really hard to block 4 IP addresses.

Twitter sues four unknown entities for 'unlawful data scraping' | Engadget https://www.engadget.com/twitter-sues-four-unknown-entities-for-unlawful-data-scraping-063302786.html?src=rss

remixtures, to ai Portuguese
@remixtures@tldr.nettime.org avatar

: "Data scraping practices in the name of training AI have come under attack over the past week on several fronts. OpenAI was hit with two lawsuits. One, filed in federal court in San Francisco, alleges that OpenAI unlawfully copied book text by not getting consent from copyright holders or offering them credit and compensation. The other claims OpenAI’s ChatGPT and DALL·E collect people’s personal data from across the internet in violation of privacy laws.

Twitter also made news around data scraping, but this time it sought to protect its data by limiting access to it. In an effort to curb the effects of AI data scraping, Twitter temporarily prevented individuals who were not logged in from viewing tweets on the social media platform and also set rate limits for how many tweets can be viewed."

https://venturebeat.com/ai/generative-ai-secret-sauce-data-scraping-under-attack/

speisekarte, to Discord German
@speisekarte@troet.cafe avatar

zwingt aus „Sicherheitsgründen“ selbst Bestandsbenutzer zur Angabe ihrer Mobilnummer?!? So ein Quark! Das ist reines . Tja, stattdessen haben sie jetzt halt einen weiteren toten Account. Schuss ➡️ Knie. Wissen die bei Discord vielleicht nicht, aber es gibt andere schöne Plattformen.

CharlieMcHenry, to privacy
@CharlieMcHenry@connectop.us avatar

Google Says It’ll Scrape Everything You Post Online for AI - so what little was left of our privacy, if anything, is now gone for good. Who else thinks we need to be compensated for sharing our data? After all, it’s OUR data. https://gizmodo.com/google-says-itll-scrape-everything-you-post-online-for-1850601486

mikedoise, to random
@mikedoise@techopolis.social avatar

OK everyone. Listen up!

Going forward, users of my instances will be limited to reading 6,000,000,000 posts per day.

That is all!
LOL I had to.

Can’t believe those changes over at Twitter.

Lino0876,

@mikedoise Just read about it on the Smart News app. Its only a temp measure.

noelle, to tech

An AI Scraping Tool Is Overwhelming Websites With Traffic

"Eden told Motherboard in an email that he noticed img2dataset was scraping his site, OpenBenches, which invites users to upload pictures and locations of memorial benches from across the world. Currently, OpenBenches has mapped 27,629 benches, and hosts 250GB of photos.

“I noticed because I received an alert from my host that the site was under a sustained attack,” Eden said. “I had to pay to scale up my server, pay extra for export traffic, and spent part of my weekend blocking the abuse caused by this specific bot.”"

Reading this article is making me want to stop posting online. My writing, photos, basically everything. It's really discouraging that the people who create these AI tools feel like anything posted online is theirs for the taking.

https://www.vice.com/en/article/dy3vmx/an-ai-scraping-tool-is-overwhelming-websites-with-traffic

  • All
  • Subscribed
  • Moderated
  • Favorites
  • anitta
  • thenastyranch
  • rosin
  • GTA5RPClips
  • osvaldo12
  • love
  • Youngstown
  • slotface
  • khanakhh
  • everett
  • kavyap
  • mdbf
  • DreamBathrooms
  • ngwrru68w68
  • megavids
  • magazineikmin
  • InstantRegret
  • normalnudes
  • tacticalgear
  • cubers
  • ethstaker
  • modclub
  • cisconetworking
  • Durango
  • provamag3
  • tester
  • Leos
  • JUstTest
  • All magazines