By naively, I meant without normalization.
You know much more about this than I do though
edit: this is what I mean for example, that `tést` != `tést` in rg, because \ue9 (e with accent) != e\u0301 (e followed by combining character accent)
$ printf "t\\u00E9st" > /tmp/a
$ xxd /tmp/a
00000000: 74c3 a973 74 t..st
$ cat /tmp/a
tést
$ printf "te\\u0301st" > /tmp/b
$ xxd /tmp/b
00000000: 7465 cc81 7374 te..st
$ cat /tmp/b
tést
$ printf "t\\u00E9st" | rg -f - /tmp/a
1:tést
$ printf "t\\u00E9st" | rg -f - /tmp/b
# ed: no result
edit 2: if we normalize the UTF-8, the two strings will match $ printf "t\\u00E9st" | uconv -x any-nfc | xxd
00000000: 74c3 a973 74 t..st
$ printf "te\\u0301st" | uconv -x any-nfc | xxd
00000000: 74c3 a973 74 t..st
Which you know, and indicate! Just working an example of it that maybe will help people understand, I dunno