The best point in this thread: "intuition as evals." That's actually the crux. PMs have context about what the product should do that engineers often don't, and that intuition IS an eval. The problem is it doesn't scale once the system has millions of conversations no one is reading. The empathy from vibe coding is real, the prototype-as-spec culture is where it breaks down for AI products specifically.