Yeah, attention sinks were applied to gpt-oss