personally the point about apps built on top of LLMs resonated the most. however the success of well-engineered tools for specific use cases, often with choice of model, goes to show that:
- benchmarks don't mean a lot for the frontier stuff, but can be interesting for the same series of models (smaller v/s larger). reminds me of comparing clock speeds between CPUs.
- the app layer can fill the gaps to squeeze out the most for a use case, but there is still no one-size-fits-all situation.
- often the discourse here or the perspective of people building seem disconnected from an average user. a lot of discussion in the post is irrelevant for the vast majority of users. e.g. as cool as TUI can be, it is not an interface most users would gravitate towards.
while not directly related, other modalities are more exciting, and comes thanks to applying techniques for handling text to other media forms, or in conjunction.