> Is anyone working on this?

There was recently https://news.ycombinator.com/item?id=47131225.

Thanks! I missed that. The attribution by training data source category (arxiv vs wikipedia vs nemotron etc.) is an interesting approach.