You're most likely to run into issues with non-latin languages. Particularly picograms and the associated schemes for how to interpret them in a context sensitive manner. Substring search for example is likely to be broken in my experience.

Can you provide an example?