Agent News
NEWS
Editorial coverage of launches, infrastructure shifts, interface upgrades, and agent tooling worth tracking. Follow the story, then jump straight into the software directory and agent profiles behind each headline.
OpenClaw 2026.6.10 makes the assistant feel faster without loosening the guardrails
OpenClaw 2026.6.10 is a runtime-quality release: fast mode for short conversational turns, tighter Zai and GLM routing, safer session and channel state, preserved trusted policies, and a provider onboarding fix.
Vercel Launches eve: The "Next.js for Agents" Is Here, and It's Open Source
Vercel's new filesystem-first framework treats every AI agent as a directory of files, bundling durable execution, sandboxed compute, and multi-channel deployment into a single open-source package.
OpenClaw 2026.6.9: 422 PRs of Telegram Delivery, Agent Recovery, and Codex Integration
OpenClaw's latest stable release improves Telegram HTML delivery, agent session recovery, Codex plugin approvals, and makes provider plugins standalone npm packages.
Hermes Agent v0.17 pushes agent work beyond the terminal
Hermes Agent v0.17.0 expands the open-source agent runtime with iMessage via Photon, Raft, background subagents, image editing, dashboard profile building, automation templates, managed scope, and a broad security pass.
OpenClaw v2026.6.8 Released: 373 Commits of Richer Channels, Safer Routing, and Reliable Agents
OpenClaw ships v2026.6.8 with 185 merged PRs and 373 commits. Highlights include richer Telegram and WhatsApp delivery, safer model routing with GLM-5.2 and Claude Haiku 4.5, more reliable agent execution, native usage footers, and improved memory resilience.
Kimi-K2.7-Code: Moonshot Open-Sources 1T Coding Model with Strong Agentic Gains
Moonshot AI released Kimi-K2.7-Code today, an open-weight 1T-parameter MoE model showing +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, and +31.5% on MLS Bench Lite over K2.6, with 30% lower reasoning token usage.
OpenClaw v2026.6.6 Tightens Security Boundaries Across MCP, Codex, and Channel Delivery
OpenClaw released v2026.6.6 today with 48 commits focused on hardening security boundaries and improving channel reliability across Telegram, iMessage, browser automation, and MCP.
xAI Opens the Grok Build Plugin Marketplace with MongoDB, Vercel, and Chrome DevTools at Launch
xAI turns its terminal-based coding agent into an extensible platform, shipping a built-in marketplace with plugins from six major vendors including MongoDB, Vercel, Sentry, Chrome DevTools, Cloudflare, and Superpowers.
OpenClaw v2026.6.5 Ships with Channel Hardening, Provider Fixes, and New CalVer Numbering
OpenClaw's first monthly patch under the new YYYY.M.PATCH scheme fixes QQBot reasoning leaks, Matrix voice and thread handling, Anthropic extended-thinking recovery, MCP tool-result coercion, and moves auth and state into SQLite.
Hermes Agent v0.16.0: The Surface Release Puts a Native Desktop App in Your Hands
Nous Research shipped 874 commits, 542 merged PRs, and a brand-new Electron desktop app in one week. Hermes v0.16 adds native macOS/Linux/Windows GUI, full web admin panel, leaner default skills, fuzzy model picker, /undo, and Simplified Chinese support.
xAI Ships Grok Build 0.1: A Purpose-Built Coding Model Enters the Agentic Race
xAI's new grok-build-0.1 model is now available via API. 256K context, 100+ tok/s, and a 70.8% SWE-bench score. Here is what the benchmarks, reviews, and pricing actually say.
Nous Research Drops Hermes Desktop: A Native App for the Self-Improving AI Agent
The open-source Hermes Agent, already running in terminals, Discord servers, and Telegram chats, now has a polished native desktop client. Public preview is live for macOS, Windows, and Linux with streaming chat, side-by-side previews, voice, and full config portability.
MiniMax M3 Drops: Open-Weight Model With 1M Context, Frontier Coding Scores, and a Price Tag That Undercuts Closed Rivals
MiniMax shipped M3 on June 1, 2026 — the first open-weight model to combine frontier coding, a 1-million-token context window, and native multimodality. Here's the full benchmark picture, pricing breakdown, and where to access it.
Hermes Agent’s Velocity Release Turns the CLI Into a Multi-Agent Workbench
Hermes Agent v0.15.0, tagged v2026.5.28, is less about one flashy feature than a broader shift: a smaller agent core, stronger Kanban orchestration, faster local recall, promptware defenses, Bitwarden secrets, and a larger plugin surface.
Claude Opus 4.8 Is Anthropic’s New Agent Benchmark, With One Clear Caveat
Anthropic’s Claude Opus 4.8 release is less about a new chat personality and more about long-running agent work: stronger SWE-bench Pro results, better tool use, 1M-token context, mid-conversation system messages, cheaper fast mode, and Claude Code dynamic workflows.
OpenClaw 2026.5.26 Makes the Agent Gateway Faster, Safer, and Easier to Inspect
OpenClaw’s v2026.5.26 release is a production-focused May rollup: faster Gateway and reply paths, first-class transcript handling, better voice/Talk runtime state, safer content boundaries, steadier Codex/provider behavior, stronger channel reliability, and clearer observability for operators.
OpenClaw v2026.5.22: Performance Gains, Meeting Notes, and 100+ Fixes
OpenClaw's May 2026 release delivers major gateway performance improvements, a new Meeting Notes plugin with Discord voice support, expanded platform coverage, and over 100 bug fixes across agents, channels, and tooling.
xAI Brings Grok OAuth to Coding Agents and Personal Assistants
xAI is adding OAuth support to open-source agents. Your X Premium or SuperGrok subscription now works inside Hermes, OpenClaw, and OpenCode with a single login.
How OpenAI Actually Uses Codex Internally: 7 Workflows and the Rules That Make Them Work
OpenAI published a rare look at how its own engineers use Codex day-to-day. The PDF reveals seven specific workflows, direct quotes from engineers across six teams, and six prescriptive best practices that govern how the company treats its own AI coding tool.
Cursor Composer 2.5 Hits 63.2% on CursorBench for Just $0.55 Per Task
Cursor shipped Composer 2.5 with major intelligence and behavior improvements. It scores 63.2% on CursorBench 3.1 at an average cost of $0.55 per task, undercutting frontier models by 5x to 20x while delivering comparable performance.
Google Ships Gemini 3.5 Flash, Kills Gemini CLI, and Triples the Price
Google launched Gemini 3.5 Flash at I/O 2026 with strong benchmark numbers and a new Antigravity platform, but developers are angry about a 3x price hike, sky-high token usage, and the sudden sunset of Gemini CLI on June 18.
Grok Build Is Here, But It Costs $300 a Month
xAI launched Grok Build, a terminal-based AI coding agent with plugins, subagents, and Claude Code compatibility. But at $300 per month behind the SuperGrok Heavy paywall, the pricing may kill its chances with everyday developers.

