"Should" is doing a lot of heavy lifting here.

I think people underestimate how much use of PDF is actually adversarial; starting with using it for CVs to discourage it being edited by middlemen, then "redaction" by drawing boxes over part of the image, encoding tables in PDF rather than providing CSV to discourage analysis, and so on.

Redaction if only drawing a box over content would not be redaction, I believe that even resulted in some information leakage in the past.

PDFs can be edited, unless they are just embedded images but even then it’s possible.

The selling point of PDFs is “word” documents that get correctly displayed everywhere, ie they are a distribution mechanism. If you want access to the underlying data that should be provided separately as CSV or some other format.

PDFs are for humans not computers. I know the argument you are making is that is not what happens in reality and I sympathise, but the problem isn’t with PDFs but with their users and you can’t fix a management problem with technical.

  > The selling point of PDFs is “word” documents that get correctly displayed everywhere
If only we had some type of Portable Document Format, that would be correctly displayed _and parsable_ everywhere.

I do believe that PDF/A (Archiveable) and PDF/UA (Universal Accessibility) do get us there. LibreOffice can export a file as a PDF that supports PDF/A, PDF/UA, and has the original .odt file embedded in it for future archiving. It is an absolutely amazing file format - native readable, parsable, accessible PDF with the source wrapped up. The file sizes are larger, but that's hardly a tradeoff unless one is emailing the files.

Yep, HSBC (UK) only does statements in PDF now and not CSV. I'm not sure that they've done this on purpose but it certainly feels like it. I'd like to be able to analyse my statements and even started writing a parser for them but the way they've done it is just so fucked, I gave up out of pure rage and frustration.

> starting with using it for CVs to discourage it being edited by middlemen

Isn't the motivation to convey that you care enough about your CV to care about its typesetting?

I've seen .docx CVs get so trashed (metadata loss?) that they looked like they were typeset by a sloppy/uncaring person or a child.