Blocking the Internet's Latest Flood of Web-Scraping Robots on Personal Websites Using Nginx on Low-Resource Computers

https://cheapskatesguide.org/articles/lr-robot-blocking.html

This an interesting article I found in my RSS feed today. Worth reading to see how we may be able to block AI scraping of the web for their own gains.

(link preview is not generating, likely due to website CAPTCHA)

1 Like

I wonder if Privacy Guides has an issue with tech companies doing this and if they have thought about mitigating measures.

If anyone from the team can share, please do. I’d love to know.

Thanks for posting. Have noticed that my site’s cloudflare visits are 10x GA4 “users” even with bot filters. Anyone know why this is?

Somewhat relevant:

1 Like

We might be hammered by AI bots right now, but it’d be a drop in the bucket for our servers really. I wouldn’t really have the logs to check since I’d only turn them on to troubleshoot an issue, which they’re not causing at the moment.

I do have some thoughts about DDoS mitigation if it ever proves necessary, but we can cross that bridge when we get to it.

2 Likes