While this is very cool and llamafiles are quite universal there is anyway a nuance for Window systems which is the size limit for a Windows executable which is 4Gb maximum. As LLM models are tend to be quite large this limit is reached pretty fast. So for such cases llamafile.exe will be required (which is also universal and runs everywhere). And at the end it could be just llama.cpp tools which released for all platforms + the LLM model file itself.