Hacker News

> I've tried using qwen and deepseek but they can't even output documents

What agent harness did you use? Usually, "write_file", "shell_exec" or similar is two of the first tools you add to an agent harness, after read_file/list_files. If it doesn't have those tools, unsure if you could even call it a agent harness in the first place.

jxmesth 18 hours ago [ - ]

Sorry for the confusion, I was actually talking about their Web based chat. Since most of my work is governance and docs, I just use their Web chats and they just refuse to output proper documents like Claude or Chatgpt do.

embedding-shape 18 hours ago [ - ]

Aha... Well, I let Codex (Claude Code would work too) manage/troubleshoot .xlsx files too, seems to handle it just fine (it tends to un-archive them and browse the resulting XML files without issues), seen it do similar stuff for .app and .docx files too so maybe give that a try with other harnesses/models too, they might get it :)

jxmesth 7 hours ago [ - ]

Yeah, it's just way easier to do via the web/mobile app but I'll give using it via the CLI a try. Thanks :)

noduerme 16 hours ago [ - ]

You're not giving an AI command line access to your work computer? How do you expect to keep up? /s

dymk 16 hours ago [ - ]

You give it command line access in a VM...

ycui1986 12 hours ago [ - ]

i give it in real ubuntu, no vm, no docker. so long I don't ask it to organize files, it will behave. it has not screw me so far.

DeathArrow a few seconds ago [ - ]

I only run it with --dangerously-skip-permissions. YOLO!

dymk 12 hours ago [ - ]

Godspeed

koen_hendriks 15 hours ago [ - ]

You mean a VM like the one that contains a 0day that can escape the sandbox that gets found every year at pwn2own?

enneff 14 hours ago [ - ]

Presumably you’re also using a browser to view this web page. There have also been vulnerabilities in that. You have to draw a line somewhere.

andai 13 hours ago [ - ]

I run mine as a separate unprivileged user. (No VM.) Am I pwned?

dymk 12 hours ago [ - ]

Maybe, but the sort of 0days you're talking about aren't exploited in any meaningful way for almost all developers.

arcanemachiner 10 hours ago [ - ]

"Seatbelts don't save the life of everyone who gets into an accident, so why bother wearing one?"

chillfox 12 hours ago [ - ]

You can make a harness fully functional with just the "shell_exec" tool if you give it access to a linux/unix environment + playwright cli.