7 articles

#model-release

Anthropic Releases Claude Opus 4.8 at Flat Pricing

Anthropic's Opus 4.8 raises SWE-bench to 69.2%, cuts Fast mode cost by 3×, and ships Dynamic Workflows for parallel sub-agent orchestration.

May 29, 20261 min read

Technologybreaking

Qwen 3.7 Max Ships, Tops Intelligence Index Above Gemini 3.5 Flash

Alibaba's Qwen 3.7 Max enters the top-10 intelligence index above Gemini 3.5 Flash and solves a 12-month-old proprietary puzzle benchmark on first attempt — generation-over-generation reasoning benchmark: 8 to 44.

May 27, 20261 min read

Technologybreaking

Gemini 3.5 Flash Disappoints on Coding; 3.5 Pro Delayed One Month

Gemini 3.5 Flash scores ~50% on Cursor Bench, 15 points below frontier at 4x the cost of Cursor Composer 2.5. Gemini 3.5 Pro delayed another month, narrowing Google's benchmark recovery window.

May 27, 20261 min read

Technologybreaking

Meta Debuts MuseSpark: First Proprietary Closed Model Near Top of Leaderboard

Meta releases MuseSpark, its first proprietary closed model at an estimated ~70B×16 parallel agents (>1T effective parameters), ranking just below Claude on major leaderboards.

May 20, 20261 min read

Technologybreaking

Google Gemma 4 Launches Under Apache 2.0: 31B Ranks #3 on LM Arena

Google DeepMind releases Gemma 4 under Apache 2.0: four models including a 31B dense ranking #3 globally on LM Arena and on-device E2B/E4B variants with audio, vision, and 256K context.

April 28, 20261 min read

Industrybreaking

Sam Altman Signals GPT-5.5 or GPT-6 Release for April 23

Sam Altman signaled via emoji reply that GPT-5.5 or GPT-6 may launch April 23 — confirmed independently by @swyx. No official announcement published yet.

April 23, 20261 min read

Technologybreaking

Anthropic Releases Claude Opus 4.7 at Unchanged Pricing

Anthropic releases Opus 4.7 with SWE-bench Verified up 7 points, vision ceiling tripled to 3.75 MP, a new xhigh reasoning tier, and no price increase.

April 17, 20261 min read

AI Intelligence Newsletter

Curated AI insights — sent when there's something worth your inbox.