I saw this on X/Twitter. I do not believe that human cooking, and all of its techniques and ingredients and the various ways that things can be prepared in different cultural contexts can be compressed in to 2 megabytes.
It is sort of like saying here is a 1GB model that can do tool calling and coding and then you try it out and it barely functions. Yes, it technically is a 1GB coding model, but it isn't a good one.
The space of palatable human food is small. There are only a few thousand ingredients and a few thousand preparation techniques. This could easily be compressed at high fidelity into a model.
I say prove it. If it was a coding bot we would see some metrics on how it did. Have this thing produce a bunch of on demand recipes and judge it against human chefs. Have the people preparing the food cook according to its descriptions faithfully as either robots or non-cooking people so that their knowledge doesn't leak into the preparation. And then let's judge those recipes.
IF you need experienced culturally knowledgeable chefs to prepare the food for it to work, then you haven't encoded all the techniques, just crib notes.
Citations needed