It sounds like the goal is to get the code to human review without it being obviously broken in CI but the agent has no idea that's the case.

Yeah, it is about making sure that EVERY actionable PR comment gets addressed - whwther by fixing, resolving, creating a new issue, commenting that it is a will not fix, or blocking for human review - and then giving you a clear deterministic check you can do to reliably enforce your policy.