If your solution involves convincing producers of PDFs to produce structured data instead, then do the rest of us a favour and convince them to jettison PDF entirely and just produce the structured data.

PDFs are a social problem, not a technical problem.