A quick search say that this is a standard feature you cache the prefill and load it at PCIe bandwidth so it should be about 0.2s
A quick search say that this is a standard feature you cache the prefill and load it at PCIe bandwidth so it should be about 0.2s