I think it's a compelling argument. You would need a large dataset of completed games on which to train, which may have something to do with why the games considered solved by AI are also among those where exist a very rich and heavily annotated corpus of completed games in algebraic notation.
Of course - but in practice you won't be aiming towards fully a "solved" game or that kind of player skill for something like Civ - and even so, I severely doubt an LLM realistically can hope to even get in the vicinity unless the aforementioned "harness" does something similar anywayas part of its heavy lifting I mentioned.