Comparing GPT-5.5, GPT-5.2 - LLM benchmark results