TIL: Rate limiting by IP using Cloudflare's rate limiting rules. My blog started timing out on some requests a few days ago, and it turned out there were misbehaving crawlers that were spidering my /search/
page even though it's restricted by robots.txt
.
I run this site behind Cloudflare and it turns out Cloudflare's WAF (Web Application Firewall) has a rate limiting tool that I could use to restrict requests to /search/*
by a specific IP to a maximum of 5 every 10 seconds.
Recent articles
- V&A East Storehouse and Operation Mincemeat in London - 27th August 2025
- The Summer of Johann: prompt injections as far as the eye can see - 15th August 2025
- Open weight LLMs exhibit inconsistent performance across providers - 15th August 2025