Skip to content

AI Leaderboard

41 models benchmarked · updated May 24, 2026

Agentic Coding

  • 🥇GPT-5.5 (xhigh)OpenAI59.1
  • 🥈Gemini 3.1 Pro PreviewGoogle55.5
  • 🥉Claude Opus 4.7 (Non-reasoning, High Effort)Anthropic53.1
  • 4Qwen3.7 MaxAlibaba50.1
  • 5DeepSeek V4 Pro (Reasoning, Max Effort)DeepSeek47.5
  • 6Muse SparkMeta47.5
  • 7Grok 4.20 0309 (Reasoning)xAI42.2
  • 8Mistral Medium 3.5Mistral35.4
  • 9NVIDIA Nemotron 3 Super 120B A12B (Reasoning)NVIDIA31.2

Conversation

  • 🥇GPT-5.5 (xhigh)OpenAI60.2
  • 🥈Claude Opus 4.7 (Adaptive Reasoning, Max Effort)Anthropic57.3
  • 🥉Gemini 3.1 Pro PreviewGoogle57.2
  • 4Qwen3.7 MaxAlibaba56.6
  • 5Grok 4.3 (high)xAI53.2
  • 6Muse SparkMeta52.2
  • 7DeepSeek V4 Pro (Reasoning, Max Effort)DeepSeek51.5
  • 8Mistral Medium 3.5Mistral39.2
  • 9NVIDIA Nemotron 3 Super 120B A12B (Reasoning)NVIDIA36.0

Image Editing

  • 🥇GPT Image 1.5 (high)OpenAI1264
  • 🥈Nano Banana Pro (Gemini 3 Pro Image)Google1240
  • 🥉grok-imagine-image-qualityxAI1229
  • 4Wan 2.7 ProAlibaba1202

Image to Video

  • 🥇grok-imagine-videoxAI1325
  • 🥈Veo 3.1 Fast PreviewGoogle1280
  • 🥉Wan 2.5 PreviewAlibaba1244
  • 4SoraOpenAI977

Text to Image

  • 🥇GPT Image 2 (high)OpenAI1339
  • 🥈Nano Banana 2 (Gemini 3.1 Flash Image Preview)Google1264
  • 🥉grok-imagine-image-qualityxAI1210
  • 4Qwen Image Max 2512Alibaba1159
  • 5Sana Sprint 1.6BNVIDIA941
  • 6Janus ProDeepSeek714

Text to Speech

  • 🥇Gemini 3.1 Flash TTSGoogle1209
  • 🥈TTS-1 HDOpenAI1095
  • 🥉Voxtral TTSMistral1070
  • 4Magpie-Multilingual 357M (Feb 2026)NVIDIA1060
  • 5Qwen3 TTS FlashAlibaba934

Text to Video

  • 🥇grok-imagine-videoxAI1233
  • 🥈Veo 3.1 PreviewGoogle1224
  • 🥉Wan 2.6Alibaba1192
  • 4Sora 2 ProOpenAI1187

Spoton orchestrates all of this for you

Why pick one when you can have the best of every AI, for every task?

Try Spoton