Hacker News

Most LLMs can reason. They can't use software.

GhostDesk gives your agent a full Linux desktop and the motor skills to operate it like a human realistic mouse movement, natural typing, screenshot fallback for CAPTCHAs. It reads UIs semantically and behaves like a real user when sites try to detect bots.

Book a flight, scrape a site without selectors, operate legacy software with no API, run QA across an entire app one prompt. If a human can do it on a desktop, your agent can too.

Runs in Docker. Spin up multiple instances in parallel, each driven by a sub-agent. No real ceiling.

Works with Claude, GPT, Gemini, or any local model (Ollama, LM Studio). MIT.