I mean relative to the cost of pre-training, books are going to be cheap even if you buy them in the US (as demonstrated by the fact Anthropic bought them after)

For post-training, other data sources (like human feedback and/or examples) are way more expensive than books