Comparing Claude Mythos Preview, GPT-5.4 Pro - LLM benchmark results