Humans recognise two vastly different types of language input (auditory and visual). I doubt that one type of tokeniser is inherently superior.