The author writes that these hashes are 2 or 3 characters long. I assume depending on the line count. That's good for almost 48k lines. You have other issues then.

But if it’s a hash vs a line number, then we can collide much more easily.

There many be many lines that are duplicates, eg “{“