My issue is with the scrapers pretending to be something they aren’t. Tens of thousands of requests, spread over IPs, mostly from China and Singapore but increasingly from South America.
AmazonBot follows robots.txt. I don’t so what Huawei and Azure ASNs have to do with it - that sounds like those requests simply come from inside a Huawei and an Azure network, respectively, but could otherwise be anything.
Because the 1000 requests/10 minutes on my server are done by AmazonBot, mostly. Followed by ASNs from Huawei, Azure and the like.
If big tech are the issue, then try this robots.txt (yes on github…): https://github.com/ai-robots-txt/ai.robots.txt
My issue is with the scrapers pretending to be something they aren’t. Tens of thousands of requests, spread over IPs, mostly from China and Singapore but increasingly from South America.
AmazonBot follows
robots.txt. I don’t so what Huawei and Azure ASNs have to do with it - that sounds like those requests simply come from inside a Huawei and an Azure network, respectively, but could otherwise be anything.