This is the easiest set up on a Mac. You need at least 16gb on a MacBook:
https://github.com/ggml-org/llama.cpp/discussions/15396