If we want to close Human <-> Machine loop as much as possible(pre-neuralink).

Assuming that today the most efficient way for human to transfer information to machines is via voice. Assuming for machines to convey rich information to humans that's by printing html.

Then a combination of screen + eye tracking + voice is all you need. The mouse doesn't make sense anymore.

Links: https://x.com/trq212/status/2052809885763747935