FOSS infrastructure is under attack by AI companies

6 Likes

Is there a realistic way to address this problem that doesn’t involve using PII as authentication? If I understand, this is only going to get worse and worse, and be harder and harder to both address and spot?

More widespread adoption of private access tokens backed by hardware attestation would be cool to see.

But it is unclear who would be the attester for eg. distros/browser combos outside of Microsoft/Apple/Android.

1 Like

Web scraping is a very developed industry. Even Amazon can’t stop it.

The one thing that would slow down these AI scrapers is the bursting of the AI bubble. Institutional investors won’t keep pouring money into something that doesn’t make them anything back. That likely won’t happen for some time yet, but the cracks are starting to show with Microsoft curbing their enthusiasm for generative AI infrastructure and cutting ties with OpenAI.

It remains to be seen whether the AI craze will be replaced by another craze that requires siphoning as much data as possible from the web.

3 Likes

@anon36940904, please edit your post to include only 1-3 of the most important paragraphs to get the point of the article across. You can see good examples of posts containing news articles here. Those you do not wish to read the full article should be able to get a gist from your post, while those who wish to read the entire thing can do so on the author’s website.

5 Likes

thank you for sharing, this is very frightening and sad !

1 Like

I’ve heard you can direct and silo the bot scrapers into a normally inaccessible part of your website where you also generate “content” for them to scrape on… indefinitely. A prison for AI bot scrapers

Maybe poison their data as well…

No idea how to do that though…

This particular post has pictures and the text relates to it. I find it hard to select what to include and what not to while ensuring it all makes sense.

But I’ll keep this in mind for next time I post something.

1 Like

It’s against the law in most jurisdictions (certainly the US) to publish an article on another site without permission. The article isn’t under a free license or something like that, so you’re only allowed to post a small portion of it…

Well, if I can access it, I imagine I can share it.

Also, I am not in the US. And frankly, I do not care. If there really was an issue, this would have been taken down. Relax.

1 Like

Certainly Privacy Guides, the site you posted it on, is in the US. And if the rule of law doesn’t matter to you, then that’s not a good foundation for living well together.

woah why is it hidden?
umm why everyone, just why
anyways nadeko got affected too

because it quoted the entire article

it isn’t entirely obvious, but you can still click to the article from the top