I believe scraping is generally ok - there's actual trademark law about trademarks, which is why you got a c+d about trademark usage, instead of a general 'stop what you're doing we don't like it' c+d.
A good point of comparison is steam db (and other similar sites), which uses Steam public info to triangulate market info that isn't immediately apparent.
The first sentence on SteamDB is:
> This third-party website gives you better insight into the Steam platform and everything in its database.
It clearly states that it's not affiliated with Steam and is a separate organization. There's also a further disclaimer in the footer of the page.
In contrast, the "Waffle House Index" had the Waffle House logo at the top of the page, with zero explanation or elaboration on who created the website.
Yeah, exactly - fix the trademark problem by not using the logo and starting clearly who you are, and then go to town...
Unfortunately Craigslist pushed a precedent that puts scraping into risky territory.
https://en.m.wikipedia.org/wiki/Craigslist_Inc._v._3Taps_Inc.
Obviously this was a much different circumstance than a not-for-profit side project but still something to keep in mind.
All the AI companies are doing it so it must be fine. Right?
This but also with copyright.
Right, unironically.