>Some amount of knowledge is required for reasoning.

This is the root of problem. If you think about STEM universities, they don't really teach you things you need in the real world. They teach you what you need to know in order to go out there and accumulate the necessary information which can then be used to solve problems. Giving a person access to the internet or a super powerful calculator (like Mathematica) won't mean that they can do anything useful. They need tons of experience to use these tools in an effective way. That experience is basically all that implicit adjacent knowledge that we pick up along the way getting our degrees. And LLMs pick that up during pre-training. Drop this part and the outcome will be worthless.

Take mathematics as an example. Humanity has found math notation, which allowed to express math rules — distill them to the core. Before math was expressed in prose — a very inefficient way, very similar to current LLMs.

In my school, math teacher was giving me prose, which I was converting to math notation. I could argue, that this prose→reasoning conversion is not required at training, and can be obtained at inference time with search tools.