Comparing Claude Sonnet 4.6, GPT-5.2 - LLM benchmark results