I think this just proves anyone can pick a benchmark that supports their point so maybe we shouldn't use treat them as evidence at all.