openinterpreter has been doing this for a while, with a bunch of LLMs, glad to see first party support for this use case