is there any system where doing the basics of http (everything up to framework handoff of structured data) are done outside of a single concurrency unit?

Not exactly what you’re looking for, but https://github.com/simdjson/simdjson absolutely uses micro-parallel techniques for parsing, and those do need to think about concurrency and how processors handle shared memory in pipelined and branch-predicted operations.