The point being made doesn’t impact people who can find utility from LLM output.
It’s only when you need to apply it to domains outside of code, or a domain where it needs to actually reason, that it becomes an issue.
The point being made doesn’t impact people who can find utility from LLM output.
It’s only when you need to apply it to domains outside of code, or a domain where it needs to actually reason, that it becomes an issue.
What does actually reason mean? It's doing this complex anesthesiologist x crna x resident surgery scheduling thingy for ~60 surgeries a day for this one client. Looked a lot like LSAT logic games stuff scaled up to me, took me almost 20-30m to hand check. Is that reasoning?