The Intelligence Company (@Intelligence

The Intelligence Company

100 posts

The Intelligence Company

@Intelligence_ai

What’s the limit? Creators of @designarena, @predictionbench, @socialsarena

Joined January 2026

The Intelligence Company reposted
Design Arena
@Designarena
Jun 26
GLM-5.2 by @Zai_org is 5th in Mobile App Arena on Design Arena with an Elo of 1248. This is a 2 position jump from GLM-5.1, putting GLM-5.2 in the same performance band as Claude Sonnet 4.6 by @AnthropicAI. @Zai_org is the top open-weight lab in Mobile App Arena and the third
6.9K
The Intelligence Company reposted
Design Arena
@Designarena
Jun 25
Design Arena’s benchmarks now help power @OpenRouter’s MCP Get live model intelligence directly in your agent!
OpenRouter
@OpenRouter
Jun 25
Replying to @OpenRouter
The model performance rankings come from our new Benchmarks API, allowing your agent to query live benchmark scores (incl Artificial Analysis and Design Arena) Fun result: @Zai_org’s GLM-5.2 is the best available model for both coding & design Docs: openrouter.ai/docs/api/api-r…
6K
The Intelligence Company reposted
Design Arena
@Designarena
Jun 23
Article
What we've learned from 50,000+ AI slides generations… open sourced for you today
We've seen a massive uptick in demand for AI-generated slides at DesignArena - it’s become one of our fastest-growing creation categories. The harness-level implementation of slides is nuanced and...
22K
The Intelligence Company reposted
Design Arena
@Designarena
Jun 22
GLM-5.2 by @Zai_org is 2nd on Game Dev Arena on Design Arena with an Elo of 1368. This is a 6 position and 29 Elo jump from GLM-5.1, putting GLM-5.2 in the same performance band as Claude Fable 5 by @Anthropic. GLM-5.2 is the top open weight lab in Game Dev and second lab
65K
The Intelligence Company reposted
hilea
@hileamlak
Jun 19
Fable 5 vs GLM 5.2
Design Arena
@Designarena
Jun 19
Article
How GLM-5.2 Beat Fable 5 at Website Design
GLM 5.2 ranks 1st overall on Design Arena’s single-turn, HTML Web Design (Non-Agentic) evaluation, 5 places higher than its predecessor GLM-5.1. To do so, it beat Claude Fable 5, Opus 4.6, and Opus...
200K
The Intelligence Company reposted
Design Arena
@Designarena
Jun 19
Article
How GLM-5.2 Beat Fable 5 at Website Design
GLM 5.2 ranks 1st overall on Design Arena’s single-turn, HTML Web Design (Non-Agentic) evaluation, 5 places higher than its predecessor GLM-5.1. To do so, it beat Claude Fable 5, Opus 4.6, and Opus...
1.6M
The Intelligence Company reposted
Design Arena
@Designarena
Jun 16
BREAKING: Riverflow Pro 2.5, a reasoning model by @riverflow_ai that calls a mix of proprietary and open diffusion models, has scored 1st on Image Arena (Models + Routers), 1st on Graphic Design Arena, and 1st in Image Edit (Models + Routers). Riverflow Pro 2.5 averages 10 Elo
25K
The Intelligence Company
@Intelligence_ai
Jun 16
Congratulations to @Zai_org for establishing the new frontier!
Design Arena
@Designarena
Jun 16
BREAKING: GLM-5.2 is now 1st on Design Arena. With an Elo of 1360, GLM-5.2 has jumped ahead of the now unavailable Claude Fable 5. And it's open weights. This is an improvement of 4 positions and 27 Elo points to achieve one of the highest Elo scores in our code categories
1.9K
The Intelligence Company reposted
Design Arena
@Designarena
Jun 15
BREAKING: Reve 2.0 by @reve debuts at 2nd on Image Editing Arena with an Elo of 1325. Reve establishes a new Pareto frontier for Preference vs. Speed, faster than any model at this preference level with an average generation time of 86.8 seconds. Reve is now the highest-ranked
10K
The Intelligence Company reposted
Grace Li
@grx_xce
Jun 15
BREAKING: Le Chaton Fat has fully saturated our benchmark. We are at a loss for words. In response, we are retiring Design Arena. Congratulations to the @MistralAI team, and thanks for putting us on vacation.
92K
The Intelligence Company reposted
Design Arena
@Designarena
Jun 12
Article
Reve 2.0 establishes Reve as the top independent foundation image model lab
We are excited to introduce Reve 2.0 – Reve’s most capable image generation model to date. With this release, Reve becomes the highest-ranked independent foundation image model lab on Design Arena....
4.5K
The Intelligence Company reposted
Design Arena
@Designarena
Jun 12
Opus 4.8’s hyperfocus on agents may be making it worse at design. Opus 4.8 ranks 23rd overall on single-turn HTML Web Dev, a dramatic regression from Fable (1st), Opus 4.6 (2nd), and Opus 4.7 (3rd). This was particularly surprising as @AnthropicAI models have held the top spots
16K
The Intelligence Company reposted
Design Arena
@Designarena
Jun 10
BREAKING: Reve 2.0 by @reve is now 2nd overall on Image Arena with an Elo of 1354. Reve 2.0 establishes a 34 point Elo gap above GPT-Image 1.5 by @OpenAI in 3rd place. With this release, Reve is now the top independent foundation image model lab. Congratulations to the @reve
95K