AI Leaderboard
41 models benchmarked · updated April 9, 2026
Agentic Coding
- 🥇GPT-5.4 (xhigh)OpenAI57.3
- 🥈Gemini 3.1 Pro PreviewGoogle55.5
- 🥉Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)Anthropic50.9
- 4Muse SparkMeta47.5
- 5Qwen3.6 PlusAlibaba42.9
- 6Grok 4.20 0309 (Reasoning)xAI42.2
- 7DeepSeek V3.2 SpecialeDeepSeek37.9
- 8NVIDIA Nemotron 3 Super 120B A12B (Reasoning)NVIDIA31.2
- 9Mistral Small 4 (Reasoning)Mistral24.3
Conversation
- 🥇Gemini 3.1 Pro PreviewGoogle57.2
- 🥈GPT-5.4 (xhigh)OpenAI57.2
- 🥉Claude Opus 4.6 (Adaptive Reasoning, Max Effort)Anthropic53.0
- 4Muse SparkMeta52.1
- 5Qwen3.6 PlusAlibaba50.0
- 6Grok 4.20 0309 v2 (Reasoning)xAI49.3
- 7DeepSeek V3.2 (Reasoning)DeepSeek41.7
- 8NVIDIA Nemotron 3 Super 120B A12B (Reasoning)NVIDIA36.0
- 9Mistral Small 4 (Reasoning)Mistral27.2
Image Editing
- 🥇GPT Image 1.5 (high)OpenAI1271
- 🥈Nano Banana Pro (Gemini 3 Pro Image)Google1251
- 🥉grok-imagine-imagexAI1225
- 4Wan 2.6 ImageAlibaba1195
Image to Video
- 🥇grok-imagine-videoxAI1331
- 🥈Veo 3.1 FastGoogle1289
- 🥉Wan 2.5 PreviewAlibaba1252
- 4SoraOpenAI977
Text to Image
- 🥇GPT Image 1.5 (high)OpenAI1265
- 🥈Nano Banana 2 (Gemini 3.1 Flash Image Preview)Google1257
- 🥉grok-imagine-imagexAI1171
- 4Qwen Image Max 2512Alibaba1149
- 5Sana Sprint 1.6BNVIDIA935
- 6Janus ProDeepSeek704
Text to Speech
- 🥇TTS-1OpenAI1102
- 🥈Gemini 2.5 Flash Lite TTSGoogle1080
- 🥉Voxtral TTSMistral1029
- 4Magpie-Multilingual 357MNVIDIA1001
- 5Qwen3 TTSAlibaba932
Text to Video
- 🥇grok-imagine-videoxAI1229
- 🥈Veo 3.1 PreviewGoogle1227
- 🥉Sora 2 ProOpenAI1195
- 4Wan 2.6Alibaba1187
Spoton orchestrates all of this for you
Why pick one when you can have the best of every AI, for every task?
Try Spoton