7 articles

#on-device-ai

NVIDIA RTX Spark: 128GB Unified Memory for On-Device AI

NVIDIA RTX Spark brings ~1 PFLOP FP4 and up to 128GB unified memory to laptops and desktops — enough to run frontier-sized open models on-device. Shipping fall 2026.

June 7, 20261 min read

Toolsbreaking

Google Magenta RealTime 2: 200ms On-Device Music Generation

Google Magenta RealTime 2: open-weights real-time music at ~200ms latency, text/audio/MIDI control, 2.4B params — runs on a MacBook without a GPU. Previous latency: ~3 seconds.

June 7, 20261 min read

Technologybreaking

Google Releases Gemma 4 12B: Encoder-Free Multimodal

Google's Gemma 4 12B is encoder-free multimodal — text, audio, video, image — in 16GB VRAM under Apache 2.0. Day-0 in Transformers, llama.cpp, MLX, and Red Hat OpenShift.

June 7, 20261 min read

Technologybreaking

Apple to Showcase On-Device AI at WWDC Using Distilled Gemini

Apple plans to use WWDC 2026 to showcase on-device AI via custom silicon and a distilled Gemini model — framing local inference as its competitive edge over cloud-dependent rivals.

May 30, 20261 min read

Technologybreaking

Google Ships Chrome On-Device AI API Over W3C and Mozilla Objections

Google shipped the Chrome on-device Prompt API over objections from W3C, Mozilla, WebKit, and Microsoft, letting any site query Gemini Nano directly.

May 9, 20261 min read

Edwardian physics laboratory with glowing neural network overlay, evoking pre-relativistic science meeting modern AI

ResearchNotable

Talkie-LM: The 13B Model Frozen in 1931

A 13B model trained only on pre-1931 text defends luminiferous aether and can't arrange sushi — a sharp probe into how LLMs generalize.

April 28, 20262 min read

Strategybreaking

Apple's CEO Transition to Hardware Engineers Reads as On-Device AI Pivot

Apple puts two silicon engineers at the top, signaling a strategic pivot from cloud-AI velocity to on-device inference — a bet on winning a different race entirely.

April 27, 20261 min read

AI Intelligence Newsletter

Curated AI insights — sent when there's something worth your inbox.