I think in general this lack affects almost all areas of human endeavor. All my speech teaching my kids how to think clearly, to young software engineers about how to build software in a some giant ass bureaucracy, how to debug some tricky problem, none of that sort of discovering truth one step at a time or teaching new stuff is in blogs or anything outside the moment.
When I do write something up, it is usually very finalized at that time; the process of getting to that point is not recorded.
The models maybe need more naturalistic data and more data from working things out.
The tech scales, but accessing the training data is a real problem. It's not like scraping the whole internet. And most of it is unlabeled.
I think in general this lack affects almost all areas of human endeavor. All my speech teaching my kids how to think clearly, to young software engineers about how to build software in a some giant ass bureaucracy, how to debug some tricky problem, none of that sort of discovering truth one step at a time or teaching new stuff is in blogs or anything outside the moment.
When I do write something up, it is usually very finalized at that time; the process of getting to that point is not recorded.
The models maybe need more naturalistic data and more data from working things out.
If you need more data to scale, and there is no data, it literally can't scale.
Scale is not always about trougput. You can be constrained by many things, in this case, data.
I was unclear. There is data, but it is expensive to access, so the value proposition is often not there without some beneficent entity.