OSWorld-Verified Benchmark Details | LLM Leaderboard | DataLearnerAI