If you're using the api cost of the model to estimate it's size, then you can't use this size estimate to estimate the inference cost.