A huge, central, part of UTF-8 design is that you can start decoding it from any arbitrary offset, it is self-aligning.