Deep think still makes many many many more mistakes than gpt 5.5 pro on math