Hacker News

This story talks about MLX and Ollama but doesn't mention LM Studio - https://lmstudio.ai/

LM Studio can run both MLX and GGUF models but does so from an Ollama style (but more full-featured) macOS GUI. They also have a very actively maintained model catalog at https://lmstudio.ai/models

ZeroCool2u 11 hours ago [ - ]

LMStudio is so much better than Ollama it's silly it's not more popular.

thehamkercat 10 hours ago [ - ]

LMStudio is not open source though, ollama is

but people should use llama.cpp instead

smcleod 10 hours ago [ - ]

I suspect Ollama is at least partly moving away open source as they look to raise capitol, when they released their replacement desktop app they did so as closed source. You're absolutely right that people should be using llama.cpp - not only is it truly open source but it's significantly faster, has better model support, many more features, better maintained and the development community is far more active.

parthsareen 4 hours ago [ - ]

Desktop app is open-source now.

skhameneh 2 hours ago [ - ]

ik_llama is almost always faster when tuned. However, when untuned I've found them to be very similar in performance with varied results as to which will perform better.

But vLLM and Sglang tend to be faster than both of those.

behnamoh 10 hours ago [ - ]

> LMStudio is not open source though, ollama is

and why should that affect usage? it's not like ollama users fork the repo before installing it.

thehamkercat 10 hours ago [ - ]

It was worth mentioning.

nateb2022 9 hours ago [ - ]

> but people should use llama.cpp instead

MLX is a lot more performant than Ollama and llama.cpp on Apple Silicon, comparing both peak memory usage + tok/s output.

edit: LM Studio benefits from MLX optimizations when running MLX compatible models.

Abishek_Muthian 6 hours ago [ - ]

Besides optimizations specific to running locally lands in lamma.cpp first.

ekianjo 6 hours ago [ - ]

Ollama did not open source their GUI.

jmorgan 5 hours ago [ - ]

The source is available here: https://github.com/ollama/ollama/tree/main/app

thehamkercat 10 hours ago [ - ]

I think you should mention that LM Studio isn't open source.

I mean, what's the point of using local models if you can't trust the app itself?

rubymamis 17 minutes ago [ - ]

You can always use something like Little Snitch to not allow it to dial home.

behnamoh 10 hours ago [ - ]

> I mean, what's the point of using local models if you can't trust the app itself?

and you think ollama doesn't do telemetry/etc. just because it's open source?

parthsareen 4 hours ago [ - ]

You're welcome to go through the source: https://github.com/ollama/ollama/

thehamkercat 10 hours ago [ - ]

That's why i suggested using llama.cpp in my other comment.

satvikpendem 10 hours ago [ - ]

Depends what people use them for, not every user of local models is doing so for privacy, some just don't like paying for online models.

thehamkercat 10 hours ago [ - ]

Most LLM sites are now offering free plans, and they are usually better than what you can run locally, So I think people are running local models for privacy 99% of the time

midius 11 hours ago [ - ]

Makes me think it's a sponsored post.

Cadwhisker 10 hours ago [ - ]

LMStudio? No, it's the easiest way to run am LLM locally that I've seen to the point where I've stopped looking at other alternatives.

It's cross-platform (Win/Mac/Linux), detects the most appropriate GPU in your system and tells you whether the model you want to download will run within it's RAM footprint.

It lets you set up a local server that you can access through API calls as if you were remotely connected to an online service.

vunderba 10 hours ago [ - ]

FWIW, Ollama already does most of this:

- Cross-platform

- Sets up a local API server

The tradeoff is a somewhat higher learning curve, since you need to manually browse the model library and choose the model/quantization that best fit your workflow and hardware. OTOH, it's also open-source unlike LMStudio which is proprietary.

randallsquared 10 hours ago [ - ]

I assumed from the name that it only ran llama-derived models, rather than whatever is available at huggingface. Is that not the case?

fenykep 10 hours ago [ - ]

No, they have quite a broad list of models: https://ollama.com/search

[edit] Oh and apparently you can also directly run some models directly from HuggingFace: https://huggingface.co/docs/hub/ollama

evacchi 9 hours ago [ - ]

ramalama.ai is worth mentioning too

ekianjo 6 hours ago [ - ]

Lmstudio runs llama.cpp under the hood.

selcuka 5 hours ago [ - ]

They also run the Apple MLX engine on macOS.