GPT-5.4 Makes A Splash, AI’s Growth on Mobile, Data Centers Go Off-Grid, Apple’s Diffusion Research
The Batch AI News and Insights: Should there be a Stack Overflow for AI coding agents to share their learnings with each other?
The Batch AI News and Insights: Should there be a Stack Overflow for AI coding agents to share their learnings with each other?
Meta’s purchase of Moltbook, a social network for AI agents. Iran’s attacks on data centers in the UAE and elsewhere. Google’s new multimodal embedding model. Grammarly’s AI ventriloquism using famous writers.
Microsoft’s new open-weights vision reasoning model. Yuan 3.0 Ultra, a document-retrieving juggernaut. OpenAI’s hardware leader’s resignation. Black Forest’s new training method for image models.
I’m thrilled to announce Context Hub, a new tool to give to your coding agents the API documentation they need to write correct code.
LLMs have achieved gold-medal performance in math competitions. An agentic system showed strength in mathematical research as well.
Managers need to understand how their subordinates get work done, what resources they require, and what they accomplish. OpenAI’s latest product aims to fulfill this need when the teammates are AI agents.
OpenAI signed a contract with the U.S. military to provide AI systems that securely process classified information, displacing Anthropic’s Claude. OpenAI negotiated limits on how its technology can be used, but they leave room for interpretation.
Google launched a cheaper, faster successor to its flagship image generator, delivering greater interactivity at roughly half the price.
The Batch AI News and Insights: I’m thrilled to announce Context Hub, a new tool to give to your coding agents the API documentation they need to write correct code.
Claude’s expanded memory feature, including imports. GPT-5.3 Instant, a terse update to ChatGPT. How LLMs can unmask pseudonymous online users. Alibaba’s Qwen leadership losses soon after 3.5 release.
Nano Banana 2, Google’s fast, powerful image generator. GitHub Copilot’s multi-model and security scanning updates. Perplexity Computer, a premium multipurpose agent. OpenAI’s multibillion-dollar partnership with Amazon.
We just released a Skill Builder tool to help you understand in which areas of AI you’re strong, where you can learn more, and what to do next to keep building your skills.
Machine Learning Research
Projected demand for output from large language models is spurring a massive buildout of data centers. Researchers asked whether smaller models running on local devices could meaningfully lighten that load.
Business
Makers of software that runs large companies saw their share prices plunge as investors worried that AI systems could undermine their businesses. This week, their stocks rebounded somewhat as Anthropic partnered with some of the same companies.
Business
The fourth global AI summit marked a decisive shift from focusing on theoretical hazards to spreading AI’s benefits throughout the world.
Machine Learning Research
Google updated its flagship Gemini model, topping several benchmarks while undercutting competitors on performance per dollar.
The Batch Newsletter
The Batch AI News and Insights: We just released a Skill Builder tool to help you understand in which areas of AI you’re strong, where you can learn more, and what to do next to keep building your skills.
Data Points
Remote Control, a tool to vibe-code from your phone. Anthropic’s charge that DeepSeek and others distilled its models. Qwen 3.5’s suite of medium MoE models. Protenix-v1, Bytedance’s open-source take on AlphaFold 3.
Data Points
Why some AI models give worse answers to non-native English speakers. First Proof, a frontier math challenge OpenAI may have partially solved. LLaDA2.1’s RL framework for diffusion language models. Anthropic’s study suggesting it’s better to let agents cook.
Machine Learning Research
Difficulty sleeping often precedes heart disease, psychiatric disorders, and many other illnesses. Researchers used data gathered during sleep studies to detect such conditions.
Machine Learning Research
Reasoning models in the 1 to 2 billion-parameter range typically require more than 1 gigabyte of RAM to run. Liquid AI released one that runs in less than 900 megabytes, and does it with exceptional speed and efficiency.
Letters
Will AI create new job opportunities? My daughter Nova loves cats, and her favorite color is yellow.
Business
Top tech and AI companies spent more than $100 million to influence government policy in 2025, the first time they exceeded that figure.
Machine Learning Research
Z.ai more than doubled the size of its flagship large language model to deliver outstanding performance among open-weights competitors.