I know this problem space well. A company I worked for over 10 years ago had a data feed from National Australia Bank (NAB) and matching, categorising and geolocating transactions was an almost daily chore. I would hope by now they're employing something similar to the methodology discussed here.

Interesting write up, thanks for sharing.

I commented on this genuinely thinking the article was interesting, enough so that I was curious about the company behind it. It seems the founder is very good at "growth hacking", the majority of their stars on GitHub appear fake and now I'm suspicious about how this article made it to the front page with such little engagement.

I could be completely off base here, but I can't shake the feeling of inauthenticity.

How did you figure the GitHub stars are fake