Comparing Muse Spark, GPT-5.4 - LLM benchmark results