With LLMs this fast, you could imagine using them as any old function in programs.

You could always have. Assuming you have an API or a local model.

Which was always the killer assumption, and this changes little.