Big +1 for dumps

You might try reaching out to Anna's Archive and see if this would be a dataset they'd be interested in helping host/distribute. I think they'd agree that such data is important and should be archived.

Yes, we'd happily host and torrent this.