ollama is a wrapper on top of llama.cpp, and it makes llama.cpp slower, why use it?
Also Ollama has other issues (like forgetting what it really is - a wrapper).
ollama is a wrapper on top of llama.cpp, and it makes llama.cpp slower, why use it?
Also Ollama has other issues (like forgetting what it really is - a wrapper).