Could go nicely with https://auge.franzai.com/ ( CLI on Apple Vision frameworks ) - do the first pass locally. If needed call their API for a more detailed analysis and then _finally_ we produce meaningful alt texts for images in HTML at a reasonable price ;)