When you make API calls to generate chat completions, you specify a list of tools. They can be MCP tools, or just arbitrary tool metadata.

The API will then respond when it needs the client code to compute a tool output.

got it, thanks!