Craig Venter had his genome sequenced in 2007. It was the first individual human genome that was sequenced and released publicly.

The human reference genome is ~70% from a man with African and European ancestry who lived somewhere around Buffalo, NY. Most of the rest is from ~20 other individuals in the same area. They were supposed to sequence the samples more evenly, but apparently there were some technical reasons that made them prioritize a single sample.

"RP11" is that man from Buffalo who comprises 74% of the human reference genome [1].

[1] https://undark.org/2024/07/09/informed-consent-human-genome-...

I worked on this back in the 90s and there multiple data sets being used. We had one that was Mennonite family with like 5 living generations and 100ish individuals.