I’ve always thought the US Postal Service is such a technological marvel. They somehow manage to identify and route billions of pieces of mail and I have to imagine their tech is significantly more primitive than this. Not only that but US addresses are absurdly non-standardized, you can often write the same address multiple ways and have it deliver to the same location. I’m sure there’s plenty of published knowledge in this area, but whenever I see announcements about OCR it feels like this should be a solved problem if it’s been accomplished at the scale of USPS for many years.
I used to part time for the (Danish) mail service. The only sorting that was done automatically was the post codes. That was enough to get the letter to the right post office. The rest was done by the mailmen/women early in the morning. It was a lot of fun trying to figure out what was meant by some of the addresses. The older people in particular often knew the story of why certain places were sometimes addressed in certain ways, or could guess the addresses based on the names of the people living there.
Well I guess a lot of humans are involved as well, so that helps
Great video by Tom Scott on this subject:
https://www.youtube.com/watch?v=XxCha4Kez9c
haha this was great!
related https://www.bbc.com/news/uk-northern-ireland-33581277
IIRC the USPS was one of the first big budget orgs behind early OCR systems all the way back in 1965.
https://www.youtube.com/watch?v=V4LJs2ZoDR4
The USPS Remote Encoding Center in Salt Lake City examined 841,260,847 images of poorly written addresses in fiscal year 2025. [0]
Unfortunately the page does not have a base rate--the total number of mail pieces that were not prepared for automated processing. Total first class mail, which includes a lot of bills prepared for automation was 25.7 billion [1]. If 10% of that are non-automated, then .8 / 2.57 = .31 or a third of mail not prepared for automation is handled by "employees look at the image and type in address information"
0. https://facts.usps.com/remote-encoding-center-rec-decipherin...
1. https://about.usps.com/what/financials/10k-reports/fy2025.pd...