# open-source

Gradio 6.16.0 Patches Path Traversal, OAuth, and SSRF

Gradio 6.16.0 patches path traversal (gr.FileExplorer), OAuth open-redirect bypass, and SSRF in Image/Gallery/Audio post-processing. Immediate upgrade recommended for all deployments.

June 7, 20261 min read

Industrybreaking

Supabase Raises $500M at $10B Valuation

Supabase raises $500M at $10B — with a 25% vested-share cashout option and 10-year exercise window that sets a new bar for developer-friendly equity terms.

June 7, 20261 min read

LiteParse v2 Released: 100× Faster Open-Source PDF Parser in Rust

LiteParse v2 is a Rust rewrite of LlamaIndex's PDF parser achieving 100× speed gains, sub-second on 450-page docs, with WASM and multi-language bindings now live.

May 29, 20261 min read

Tencent Hy-MT2: 440MB Open-Source Model Beats Microsoft Translation API

Tencent Hy-MT2: open-source 33-language translation model at 440MB (1.25-bit quantized) outperforms Microsoft commercial API and runs on mobile chips. Three sizes: 1.8B, 7B, 30B-A3B.

llama.cpp Ships WebGPU Backend: Full Browser-Based GPU Inference, No Install

llama.cpp ships a WebGPU backend enabling GPU-accelerated LLM inference directly in any modern browser — no data leaves the device, no install required, 18 months in development.

HuggingFace Ships LeRobot Humanoid: Open-Source Bipedal Robot for $2,500

HuggingFace releases LeRobot Humanoid — a fully open-source bipedal robot platform at ~$2,500 build cost with 3D-printed hardware, runtime, and training environments included.

Perplexity Open-Sources Bumblebee: Security Scanner for AI Dev Environments

Perplexity open-sources Bumblebee (Apache 2.0) — a supply-chain security scanner explicitly targeting MCP config files, browser extensions, and packages in AI developer environments.

GLM 5.1: First Open-Source ~1 Trillion Parameter Frontier Model Ships

Z.ai releases GLM 5.1: the first open-source ~1 trillion parameter FP16 frontier model. EXO Labs ran it on a $40K 4-Mac-Studio cluster via RDMA-over-Thunderbolt at ~20 tok/s.

Cohere Launches Command A+ Under Apache 2.0 for Enterprise Agents

Cohere releases Command A+ under Apache 2.0 — their most powerful LLM yet, optimized for minimal hardware, available on HuggingFace and Microsoft Foundry with W4A4 quantization and near-zero perf loss.

agentmemory Crosses 11.6k Stars: Persistent Memory Daemon for Coding Agents

agentmemory hits 11.6k GitHub stars as a cross-agent persistent memory daemon: 92% fewer tokens/session, 95.2% retrieval accuracy, SQLite-only, Apache-2.0.

AgentWall: Open-Source MCP-Proxy Runtime Safety Layer at 92.9% Enforcement Accuracy

AgentWall is an open-source MCP-proxy runtime safety layer achieving 92.9% enforcement accuracy with sub-millisecond overhead across Claude, Cursor, and Windsurf deployments.

LangChain Deep Agents v0.6: Harness Profiles, In-Loop Code REPL, Streaming

LangChain Deep Agents v0.6 ships with harness profiles, an in-loop code REPL for dataset processing without bash, and streaming — the biggest LangChain OSS release yet.

NVIDIA Releases SANA: 20× Smaller, 100× Faster Than Flux-12B

NVIDIA's SANA family runs 20× smaller and 100× faster than Flux-12B, generating 1024px images in 0.1s on H100 with full Diffusers, ComfyUI, and SGLang integration on day one.

Two competing AI compute installations face each other across a luminous chasm, representing the US-China AI competition framed in Anthropic's 2028 essay

StrategyNotable

Anthropic's '2028' Essay Draws Battle Lines on US-China AI and Open Source

Anthropic's '2028' essay frames the AI finish line as recursive self-improvement, splitting the industry on compute restriction vs open-source export strategy.

May 15, 20262 min read

Cline Open-Sources Its Runtime as a Model-Agnostic Agent SDK

Cline open-sources its runtime as @cline/sdk: plugins, scheduled agents, persistent sessions, any model/provider. Repositions from AI IDE to agent infrastructure.

Resemble AI Releases Dramabox: Open-Source Voice Model with Cryptographic Provenance

Resemble AI Dramabox: first open-source voice model with built-in cryptographic provenance signature. Expressive performance plus verifiable ownership. Available now.

GLM 5.1 Now Leads Artificial Analysis Intelligence Index Over Closed Models

GLM 5.1 tops Artificial Analysis intelligence index over all closed models. Chinese open-weights model leads SWE-Bench Pro. Intelligence index growth exceeding Moore's Law.

Sulphur 2: Uncensored Open-Source Video Model Generates 10s Clips on 16GB VRAM

Sulphur 2: uncensored open-weights video model on LTX base, 10s/24fps clips on 16GB VRAM, 125K+ training videos. Available on HuggingFace via ComfyUI/Pinokio.

Datadog Toto 2.0: First Time Series Foundation Model with Reliable Scaling Laws

Datadog Toto 2.0: open-weights TSFM family 4M–2.5B params, Apache 2.0. First time series FM with reliable scaling laws. Leads BOOM, GIFT-Eval, TIME benchmarks.

Hugging Face Hub Crosses 1 Million Open Datasets

Hugging Face Hub hits 1M open datasets: the 500K-to-1M doubling took 8 months vs 4 years for the first half — CEO credits AI agents for the acceleration.

May 12, 20261 min read

ByteDance Open-Sources 7B Desktop GUI Control Agent

ByteDance open-sources a 7B desktop GUI control model capable of operating any application without app-specific training — 10k GitHub stars on day one.

May 12, 20261 min read

GGUF Ecosystem Hits 176K Models; Monthly Growth Nearly Doubled Since March

GGUF local models on Hugging Face hit 176K with monthly creation rates doubling since March — local AI adoption has crossed an inflection point.

HiDream-01 by Vivago Becomes Top Open-Source Image Model on Artificial Analysis

HiDream-01 ranks #8 on Artificial Analysis globally — the top open-source image model — with no-VAE pixel generation and best-in-class text rendering.

Zyra Releases Zia 1 8B: First Frontier-Tier Model Trained Entirely on AMD

Zyra's Zia 1 8B is the first frontier-tier model trained fully on AMD Instinct GPUs — Apache 2.0, fits consumer hardware, competes with 235B models.

Strategybreaking

AI Alliance Launches Project Tapestry for AI Sovereignty Across 7 Nations

AI Alliance's Project Tapestry convenes in Paris to build open, sovereign AI infrastructure for seven nations across Asia, Europe, and Southeast Asia.

Mistral Releases First Open-Source Frontier TTS Model, 17ms First-Audio

Mistral's first open-source frontier TTS model delivers 17ms first-audio latency on a single GPU — a step change for voice-enabled AI agents.

Reachy Mini Open-Source Backend: 3,000+ Robots Online in 48 Hours

Reachy Mini gets a fully open-source voice agent backend on Hugging Face infra — 3,000+ robots connected in 48 hours, costs cut from $20+/day to near zero.

May 9, 20261 min read

LangChain and Harvey Open-Source Legal Agent Benchmark LAB

LangChain and Harvey released LAB, an open-source benchmark for AI agents on long-horizon legal tasks covering research, analysis, and document drafting.

May 7, 20261 min read

OpenAI Opens MRC Networking Protocol for AI Clusters

OpenAI and AMD, Broadcom, Intel, Microsoft, and NVIDIA released MRC — an open networking protocol reducing wasted GPU time in large AI training clusters.

May 7, 20261 min read

Voice-Pro: Open-Source Local Pipeline Replaces $23-48/hr SaaS Dubbing

Voice-Pro OSS: full YouTube → 100-language dubbed video pipeline on 4GB NVIDIA GPU, free and local, replaces $23-48/hr cloud dubbing services. 3,439 GitHub likes and rapidly viral.

May 6, 20261 min read

Strategybreaking

Turkey's Presidential Comms Directorate Joins Hugging Face, Triggering Sovereign AI Call

Turkey's Presidential Communications Directorate joins Hugging Face as first public institution on the platform — HF CEO Clément Delangue calls for global government sovereign AI via open-source.

May 6, 20261 min read

Superlinked Open-Sources 'Sie' Small-Model Inference Engine

Superlinked 'Sie' open-sourced: hot-swap multi-model inference on single GPU, per-family forward passes (BERT/Qwen/ColBERT), variable-length flash attention, KEDA auto-scaling. Addresses 'context rot' preprocessing gap.

May 6, 20261 min read

DocuSeal: Open-Source DocuSign Replacement Deployable in 30 Min for €5/mo

DocuSeal (AGPL-3.0, 11K+ stars) replicates DocuSign in 30 minutes on a €5/mo VPS — 13 field types, eIDAS/HIPAA compliance, vs DocuSign's $17,250/yr median contract.

May 5, 20261 min read

Nous Research Releases Hermes Agent v0.9.0: Skill-Compounding Provider-Agnostic CLI

Nous Research's Hermes Agent v0.9.0 records sessions as reusable skill trajectories — older installs outperform fresh ones as skills compound over time. Supports Claude, GPT, Kimi, Ollama, OpenRouter.

May 5, 20261 min read

DeepSeek V4 Shocks Users with Cost Differential vs Claude at 10M+ Tokens

After Claude rate limits, a researcher ran 10M+ tokens on DeepSeek V4 and was shocked by the cost gap. swyx: 'Efficiency is back on the menu again boys.'

May 4, 20261 min read

Alibaba's AgenticQwen-30B (3B Active) Matches Qwen3-235B on Tool-Use

AgenticQwen-30B-A3B scores 50.2 avg matching Qwen3-235B on tool-use benchmarks. Dual RL flywheels flip the cost curve for production agents.

May 4, 20261 min read

Industrybreaking

Vibe-Kanban Shuts Down Live Onstage at AIE Europe with 30K MAU

Vibe-kanban shut down live onstage at AIE Europe despite 30K MAU; CEO cited failure to enterprise-sell and failure to resell tokens. Project now open-sourced.

Aiden Open-Sources Local AI OS: 1,500 Skills, No Cloud Required

Aiden open-sources a local AI OS for Windows/Linux: 1,500+ skills, 89+ tools, 6-layer knowledge graph memory, subagent swarms, voice, Discord/Telegram — Ollama-backed.

OpenKB Launches Wiki-Style Knowledge Base Without Chunking or Vectors

VectifyAI's OpenKB builds structured wikis from raw documents without chunking or vectors, using LLMs to accumulate cross-document links and contradiction detection.

Qwen3.6-27B Outperforms 10× Larger Qwen3.5-397B on Coding Tasks

Qwen3.6-27B (Apache 2.0) beats Qwen3.5-397B on coding benchmarks, validating that targeted training recipe outperforms brute parameter scale for agentic coding tasks.

Ai2 Releases BAR: Modular MoE Post-Training for LLM Domain Updates

Ai2 releases BAR (Branch-Adapt-Route): modular MoE post-training with +16.5 coding and +13 math on BAR-5x7B, linear per-domain update cost. Apache 2.0, full checkpoints.

OpenMythos: Open-Source Hypothetical Claude Mythos Architecture

Kye Gomez releases OpenMythos: a ~770M open-source hypothetical Mythos architecture (Recurrent-Depth Transformer + MoE + LoRA adapters) gaining thousands of GitHub stars.

NVIDIA Releases Nemotron 3 Nano Omni: Open 30B Multimodal Model

NVIDIA open-releases Nemotron 3 Nano Omni (30B MoE/3B active): unified video/audio/image/text model with 9× video-reasoning capacity improvement vs. predecessors.

Tencent Releases Cube Sandbox: Sub-60ms VM Runtime for AI Agents

Tencent open-sources Cube Sandbox: a RustVMM/KVM agent VM runtime with sub-60ms cold start, under 5MB per instance, and drop-in E2B SDK compatibility.

HeyGen Releases HyperFrames: Agent-Native HTML-to-Video Framework

HeyGen open-sources HyperFrames: an HTML-to-video framework via Puppeteer/FFmpeg with native Claude Code, Cursor, Codex, and Gemini CLI skills. Apache 2.0, Node.js 22+.

Agent-Generated PRs on HuggingFace Transformers Quadrupled; Auto-Merge Showed Zero Regression

Agent-generated PRs to HuggingFace transformers quadrupled; bulk-merging hundreds of them showed zero benchmark regression on arc_challenge, gsm8k, hellaswag.

Mercor APEX-Agents Benchmark Gets Hugging Face Leaderboard for Open-Source Models

APEX-Agents benchmark for consultant/lawyer/banker-level AI work now has a Hugging Face leaderboard for open-source model evaluation.

NimbleList: Open-Source Visual Workspace for Claude Code and Codex CLI Orchestration

NimbleList is an open-source GUI layered over Claude Code and Codex CLI, adding Kanban, Mermaid, Excalidraw, and plan.md without a new SaaS subscription.

China Open-Sources Ling-2.6-1T: Trillion-Parameter Model Claiming Fewer Tokens Than US Peers

China's Ling-2.6-1T—a publicly available, inspectable trillion-parameter model—claims to use fewer tokens than comparable US efficient models.

Industrybreaking

Hugging Face and Mistral AI Named in TIME100 Most Influential Companies 2026

TIME100 names Hugging Face (top-10 AI) and Mistral AI among 2026's most influential companies—both positioned explicitly against closed-model lock-in.

Warp Terminal Goes Fully Open-Source Under AGPL v3 with Agent-First Contribution Model

Warp terminal open-sources its full Rust codebase under AGPL v3, hitting 40K stars in hours, with OpenAI as founding sponsor powering its Oz agent system.

DeepSeek V4 Released: 1.6T Parameters, 1M Context, Open-Source

DeepSeek V4 is out: 1.6T open-source parameters, 1M token context, 3.7× fewer FLOPs than V3.2, scoring 120/120 on Putnam 2025.

Dark terminal with three glowing AI agent nodes and an open-source star constellation representing Warp launch

ToolsNotable

Warp Goes Fully Open-Source: AGPL v3, Agent-First Contribution Model

Warp's full Rust codebase lands on GitHub under AGPL v3 with an agent-first contribution model: Oz handles coding, you review the output. OpenAI sponsors.

May 1, 20262 min read

BAML: Schema-Aligned Parsing Enables Typed Prompts Across Any LLM Model

BAML's Schema-Aligned Parsing extracts typed structured outputs from any LLM — including Deepseek-R1 and O1 — regardless of native tool-calling support.

April 29, 20261 min read

OpenAI Open-Sources Realtime Voice React Component Under Apache 2.0

OpenAI's open-source realtime-voice-component React package lets developers add GPT-powered voice control to apps via narrow action definitions.

April 29, 20261 min read

Shimmy v1.9.0: Single 4.8MB Binary Runs All GPU Backends for Local LLM Inference

Shimmy v1.9.0 is a 4.8MB single-binary OpenAI-compatible local inference server that bundles all GPU backends and claims 142x size advantage over Ollama.

April 29, 20261 min read

DeepSeek V4 Flash on 2-bit GGUF: First Frontier-Quality Local Inference

Developers running DeepSeek V4 Flash with 2-bit selective GGUF via llama.cpp describe it as 'the first time I feel I have a frontier model running on my computer' — a milestone for local AI.

Google Gemma 4 Launches Under Apache 2.0: 31B Ranks #3 on LM Arena

Google DeepMind releases Gemma 4 under Apache 2.0: four models including a 31B dense ranking #3 globally on LM Arena and on-device E2B/E4B variants with audio, vision, and 256K context.

OpenAI Open-Sources Symphony: Project Boards as Agent Control Planes

OpenAI open-sources Symphony, an agent orchestration spec that uses project management boards as control planes for coding agents — replacing bespoke orchestration glue with a shared standard.

smol-audio Launches: Local Audio Model Fine-Tuning Notebook Collection

HuggingFace releases smol-audio: an open notebook collection covering local fine-tuning of Whisper, Parakeet, Voxtral, Audio Flamingo 3, and Dia-1.6B TTS — a complete local audio AI toolkit.

Unsloth Surpasses Microsoft in HuggingFace Followers, Enters Top 10 Orgs

Unsloth, the fine-tuning optimization library, has surpassed Microsoft on HuggingFace to enter the top 10 most-followed organizations — reflecting open-source momentum over Big Tech on the platform.

DS2API Exposes DeepSeek Web as OpenAI/Claude/Gemini-Compatible APIs

DS2API exposes DeepSeek Web as OpenAI/Claude/Gemini wire-compatible APIs — reverse-engineered, disclaimer-heavy, but a clear signal of demand for free API access to frontier-adjacent models.

April 27, 20261 min read

Qwen3.6-27B Released: Strongest Dense Local Model Under Apache 2.0

Qwen3.6-27B drops quietly under Apache 2.0: AAII score 46, optimized for M-series local inference, strong agentic coding — the best dense local model available.

April 27, 20261 min read

Two precision modular stacks interlock on obsidian platform, cool-teal scatter, engraved 1M label

TechnologyNotable

DeepSeek-V4 and Kimi-K2.6 Shift the Open-Weights Agentic Baseline

DeepSeek-V4's MIT-licensed 1M-context MoE and Kimi-K2.6's multimodal orchestration create the first complete open-weights agentic deployment stack.

April 27, 20262 min read

Decepticon: Open-Source Autonomous AI Red-Team Agent Goes Public

Decepticon is an open-source autonomous AI red-team agent that reasons through attack paths and tests business logic—governed by strict scope, isolation, and logging requirements.

NVIDIA Ising: AI Models Compress Quantum Calibration from Days to Hours

NVIDIA Ising uses AI to shrink quantum computer calibration from days to hours and provides 3D neural network error correction faster than existing open-source methods.

Qwen3.6-35B Distilled from Claude Opus 4.6 Runs Locally at 13 GB RAM

Qwen3.6-35B-A3B distilled from Claude Opus 4.6 reasoning traces runs full agentic workflows locally at 13 GB in 2-bit mode—raising significant provider ToS questions.

RF-DETR Sets COCO Real-Time Detection SOTA at ICLR 2026, Apache 2.0

RF-DETR (ICLR 2026) sets new COCO detection SOTA; the Apache 2.0-licensed RF-DETR-L outperforms YOLO26-X on accuracy, ending AGPL lock-in for production CV.

Xiaomi MiMo 2.5 Pro: Tied #1 Open-Source Agentic Model

Xiaomi MiMo 2.5 Pro ties #1 on Artificial Analysis and autonomously built a full desktop video editor in 11.5 hours—open-source release incoming.

Rust-Native Spark Replacement Sail Claims 4× Faster, 94% Cost Reduction

lakehq/sail: Rust-native Spark replacement built on DataFusion and Arrow hits 4× faster TPC-H, 94% cost reduction, zero shuffle spill — PySpark code runs unchanged.

April 25, 20261 min read

llama.cpp Hits 100K Stars; Creator Predicts 90% of Agents Will Run Locally

llama.cpp hits 100K GitHub stars; creator @ggerganov predicts 90% of AI agents will run locally within 3–6 months as local model quality crosses the agentic threshold.

April 25, 20261 min read

DeepSeek V4-Pro Open-Sourced with 10x KV Cache Reduction

DeepSeek V4-Pro open-sourced with 1.6T params, 1M context window, and 10x KV cache reduction vs V3.2 — #1 HuggingFace trending in 43 minutes.

April 24, 20261 min read

Moonshot Ships Kimi K2.6: 300-Agent OSS Coding Model at $0.60/M Tokens

Moonshot's Kimi K2.6 runs 300 parallel sub-agents for 12+ hours autonomously at $0.60/M input tokens — open-weight, HuggingFace-available.

April 24, 20261 min read

Meta Releases Sapiens2: Vision Transformers Pretrained on 1B Human Images

Meta releases Sapiens2 — high-res vision transformers pretrained on 1B human images for pose estimation, segmentation, depth, and point maps.

April 24, 20261 min read

LlamaIndex Open-Sources LiteParse: Fast PDF Parser With No VLMs or ML

LlamaIndex open-sources LiteParse—a VLM-free, ML-free PDF parser using a 6-step grid projection algorithm that handles tables and complex layouts fast.

TrackioApp Launches Free Local-First Trace Logging for AI Agents

TrackioApp launches free local-first trace logging for AI agents, offering lightweight observability with no cloud dependency or cost.

HuggingFace ml-intern: Autonomous Post-Training Agent Pushes GPQA 10% → 32% in Under 10 Hours

HuggingFace's ml-intern autonomously runs the full ML research-to-training loop, lifting GPQA from 10% to 32% on a 1.7B model in <10 hours and beating Codex on HealthBench by 60%.

OpenAI Releases Privacy Filter — Its First Open-Weight Model of 2026

OpenAI's Privacy Filter — a 1.5B MoE model for PII detection under Apache 2.0 — is the company's first open-weight release of 2026, running on-device with 128k context.

Qwen3.6-27B: 27B Model Claims to Beat 397B MoE on All Coding Benchmarks

Qwen3.6-27B (Apache 2.0) claims to outperform the 397B Qwen3.5 MoE and Claude Opus 4.5 on coding benchmarks, running locally on 18GB RAM.

Green-glowing ML training terminal in darkness, reward curve ascending, no human present

Tools

ml-intern: HuggingFace Releases a Full-Loop Autonomous Post-Training Agent

ml-intern reads arXiv, cleans datasets, runs SFT/GRPO, diagnoses failures, and iterates — pushing GPQA from 10% to 32% in under 10 hours for roughly $1 of compute.

April 23, 20262 min read

Technology

Qwen3.6-27B Surpasses a 397B Model on Coding Benchmarks

Alibaba's Apache 2.0 27B model outperforms Qwen3.5-397B-A17B on all major coding tasks and runs locally on 18 GB RAM — 'bye bye subscription era' claims are spreading.

April 23, 20262 min read