Comparing GPT-5.4 mini, Haiku 4.5 - LLM benchmark results