llama-cpp provides an API server as well via llama-server (and a competent webgui too).