I'm not talking about keyboards or screen readers or any sort of input testing, I'm talking about how the software is used in practice.
If you disagree with that, I think the onus is on you to show me that an LLM could simulate the full context in which a user interfaces with software. That's a ridiculous claim.
Feel free to show literally any evidence for this claim.