Aider polyglot github

https://livebench.ai/#/ https://openrouter.ai/rankings

https://arcprize.org/leaderboard?fbclid=IwY2xjawJkGOJleHRuA2FlbQIxMAABHpInxwGwuzaVHnGeNNycEGfhmweu8Xb_aBq5dhGnOHLm1qEbktYZYnqZzNmc_aem_ttSWRTegPXjvOSU1K0DAlg

![[Pasted image 20250410103636.png]]

EQ-Bench - Longform Creative Writing: paper ![EQ-Bench][https://eqbench.com/images/eqbench3-judge-comparison.png]

Judge Comparison