I’ve tried creating similar solutions and feel LLMs still lack accurate control over (or understanding of) length, stress / accents, and phonetics for consistent name generation. For usernames for example I’ve yet to create a generator that uses LLMs that beats simple Markov chains. Maybe because results are subjective it makes rating / training a lot harder? I like the site and your approach though and great job on lookup speed! If anyone has any tricks or suggestions I'd love to hear them.

Thank you! Maybe keep generating using the Markov Chains and use a LLM to evaluate the results?