Fal has the multi image interface to test against. (Replicate might as well, I haven't checked yet.)

THIS MODEL ROCKS!

It's no gpt-image-1, but it's ridiculously close.

There isn't going to be a moat in images or video. I was so worried Google and OpenAI would win creative forever. Not so. Anyone can build these.