Kind of? Even watching is probably a bit of a stretch here. The point of an MCP server is to be a sort of AI translator for whatever you're inputting. Here we're inputting an iframe that's running a wasm binary. So I imagine in theory all the AI sees is the actual iframe and whatever is in memory currently for the wasm game. Funny enough without some sort of screenshot tool on top of this I'm not sure the AI can actually 'see' the game at all.