Data Points
U.S. chatbot use passes 50 percent: AA-Briefcase benchmark measures knowledge work
ARD, an open spec for discovery. North Mini Code gains traction. Security experts criticize U.S. government. Apple Intelligence, beyond Siri.
A concise guide to the week in AI. Everything you need to know that isn't featured in The Batch.
Data Points
ARD, an open spec for discovery. North Mini Code gains traction. Security experts criticize U.S. government. Apple Intelligence, beyond Siri.
Data Points
OpenRouter’s model mix-and-match. Subject expertise trumps software skills. OpenAI loses share to Google, Anthropic. Google ruled liable for AI mistakes.
Data Points
Claude Fable 5 no longer silently degrades. Hermes Agent maker streamlines setup. Agents’ Last Exam pushes top models. Gemini-SQL2 translates database queries.
Data Points
Google’s voice translation model covers 70+ languages. OpenAI’s preliminary public-offering paperwork. NotebookLM, now powered by Gemini 3.5 agents. FrontierCode, a new code-quality benchmark from Cognition.
Data Points
The first working vaccine built by AI. Kimi CLI, Moonshot’s software engineering agent. The White House’s plans for an OpenAI stake. OpenJarvis, an open-source agent that learns on-device.
Data Points
How agents think about search. Hermes now a multi-platform desktop app. Qwen3.7-Plus, Alibaba’s midsized cloud model. OpenAI’s latest plugins for Codex.
Data Points
MiniMax M3, the new open-weights champ. Nvidia’s PC superchips. Cosmos 3, an all-modal world/action model. Nvidia’s latest robotics partnerships.
Data Points
DeepSeek’s permanent V4 price cuts. MAI-Image-2.5, currently third on the Arena Leaderboard. Mythos-1’s remarkable security skills. How MCP will change later this year
Data Points
Nvidia’s Gated DeltaNet-2, its latest attention alternative. Trump calls off executive order regulating U.S. AI models. Microsoft’s benchmark-topping suite of computer use agents. Cohere’s Command-A+, a local alternative to big cloud AI.
Data Points
Google’s remade Antigravity, an alternative to IDEs. Omni Flash, Gemini’s voice-to-video generator. Google’s AI overhaul of web search. Corti’s Symphony, a specialist in medical transcription.
Data Points
Europe’s new media licensing initiative. ArXiv’s defenses against AI-assisted mistakes. The Mythos effect on financial institutions. Ukraine’s evolving drone and data strategy.
Data Points
GitHub’s new Copilot pricing. Baidu’s smaller, faster flagship model. Google’s rethinking of mathematical research. RL Conductor’s orchestration of AI agents.