The root directory of the archive is 142 GB large. It's not only PDFs, but mostly PDFs. It includes many things that were never online and some things that were online at one point but are not online any longer.

For copyright reasons I can not share the entire thing as-is. I have plans to share most notes in there and bibliographic data for most directories. Doing so would be a major project in itself as this was never designed for that. I have some information I would prefer to keep private in there that's going to have to be filtered out, and I would prefer to clean some of it up to be in a more "presentable" state.

As for how useful you'd find it, I think that depends entirely on the overlap between my interests and yours.

You might be interested in this project of mine: https://github.com/btrettel/specialized-bibs

> As for how useful you'd find it, I think that depends entirely on the overlap between my interests and yours.

If that specialized-bibs repo is any indication, there seems to be reasonable overlap.

> For copyright reasons I can not share the entire thing as-is.

Of course. But if you'd like to store a non-encrypted backup copy on my system, I would be happy to offer my data storage services free of charge.

Alternatively: I'm training an LLM and it's transformative fair use.

My email is in my profile.

> I have some information I would prefer to keep private in there that's going to have to be filtered out, and I would prefer to clean some of it up to be in a more "presentable" state.

Totally understandable. If you ever get it into an acceptable state, please shoot me an email and I'll be happy to help out logistically.