It's a fun story of course, but it also seems that people like OP who abuse public APIs are why we can't have nice things, and why so many web pages these days are bogged down by Cloudflare and Anubis interstitials that waste human time.

Yeah, also running a scraper with no rate limit against a government website is a pretty risky endeavor.

Skiddies targeting an individual site are a drop in the ocean compared with the industrial scale LLM scraping, so blaming them for it is in bad taste.

> Skiddies targeting an individual site are a drop in the ocean compared with the industrial scale LLM scraping

They're not. Both are bad, but at least there's some utility to LLMs.

The difference is that the government won't charge a major LLM vendor with a crime, but they may kick in John Smith's front door and ruin their life.