What would you say is your benchmark for calling something intelligent?

Can it solve problems.