Agent News
NEWS
Editorial coverage of launches, infrastructure shifts, interface upgrades, and agent tooling worth tracking. Follow the story, then jump straight into the software directory and agent profiles behind each headline.
Loop Engineering: The Complete Guide to Building Self-Improving AI Agents
Stop prompting your coding agents one shot at a time. Here is how to design loops that prompt them for you—and when the extra complexity is worth it.
xAI Ships Grok Build 0.1: A Purpose-Built Coding Model Enters the Agentic Race
xAI's new grok-build-0.1 model is now available via API. 256K context, 100+ tok/s, and a 70.8% SWE-bench score. Here is what the benchmarks, reviews, and pricing actually say.
Claude Opus 4.8 Is Anthropic’s New Agent Benchmark, With One Clear Caveat
Anthropic’s Claude Opus 4.8 release is less about a new chat personality and more about long-running agent work: stronger SWE-bench Pro results, better tool use, 1M-token context, mid-conversation system messages, cheaper fast mode, and Claude Code dynamic workflows.
Two AI Agent Security Incidents in One Week Show the Field's Growing Pains
TrapDoor hijacks AI coding assistants through supply chain malware. Composio gets breached via an internal AI agent. Here's what happened and what to do.
Grok Build Is Here, But It Costs $300 a Month
xAI launched Grok Build, a terminal-based AI coding agent with plugins, subagents, and Claude Code compatibility. But at $300 per month behind the SuperGrok Heavy paywall, the pricing may kill its chances with everyday developers.

