Comparing GPT-5.2, GPT-5 - LLM benchmark results