Terminal Bench Hard Benchmark Details | LLM Leaderboard | DataLearnerAI