How do you know if something is creating an internal model of the world?

Look at the physical implementation of how it computes.

So you are making the determination based on the method, not on the outcome.

Did I ever promise otherwise? Intelligence is inherently computational, and needs a physical substrate. You can understand it both by interacting with the black box and opening up the box.