I ϲаn guаrаntее thаt thе ОСR ϲаn't rеаd thіs sеntеnсе ϲоrrесtlу.
What’s correct though? Even as a human, I read that “correctly”. Using weird representations of C doesn’t change the word?
I would even say that OCR can rеаd the sеntеnсе ϲоrrесtlу, while a tokenizer can't.
Qwen3 8b perfectly understood it after 14 seconds of thinking.
Yeah OCR would be much more likely to read that sentence the way a human would.
A lot of Cyrillic characters: https://apps.timwhitlock.info/unicode/inspect?s=I+%CF%B2%D0%...
Really? How so?
Looks like he’s using atypical “c” characters.
What’s correct though? Even as a human, I read that “correctly”. Using weird representations of C doesn’t change the word?
I would even say that OCR can rеаd the sеntеnсе ϲоrrесtlу, while a tokenizer can't.
Qwen3 8b perfectly understood it after 14 seconds of thinking.
Yeah OCR would be much more likely to read that sentence the way a human would.
A lot of Cyrillic characters: https://apps.timwhitlock.info/unicode/inspect?s=I+%CF%B2%D0%...
Really? How so?
Looks like he’s using atypical “c” characters.