The Batch | DeepLearning.AI

Image depicts GPU-hours usage, highlighting 82% on development, emphasizing pretraining and mid-data processes.

A Full Accounting of Models’ GPU Use: Researchers detail the energy and water needed to train a family of open weight models

Assessments of the environmental impact of large language models typically focus on their final training runs, but there’s a lot more to building AI systems.

Data center energy demand trend chart, showing planned capacity increase through 2040 with major tech investments.

Anthropic, OpenAI Fight for Compute: Anthropic and OpenAI both spend big to make deals for data center sites and hardware

Data center buildout plans reached a new order of magnitude as new partnerships form and old ones fade away in the search for capacity to train and deliver AI.

Analysis of 16,866 replayed cybersecurity attacks during active tailscale and source control phases with timelines.

OpenAI Models Hack Hugging Face: Inside the accidental cyberattack that compromised Hugging Face's systems

To measure how good its models were at hacking, OpenAI reduced guardrails and ran them against a benchmark’s problem set.

Table illustrates Opus 5 scoring 90.8% in agentic search, outperforming Fable 5 and Opus 4.8. Data comparison displayed.

Claude Debuts Another Opus: Anthropic addresses some (but not all) complaints about Fable with Claude Opus 5

After launching Claude Fable 5, the future of Anthropic’s once-flagship Opus line was uncertain, except as a fallback for the company’s premium models.

Fable 5's safeguards flag a request to do a security scan of OpenWorker

Open Models, Open Harnesses, Open Security: What we did when Claude Fable 5 and GPT-5.6 Sol refused our security requests

My team recently had our own version of Hugging Face’s experience when closed models failed to defend the company following an accidental cyberattack from OpenAI, leading Hugging Face to use the open weight GLM 5.2 instead.

Opus Outshines Even Fable, Inside the Hugging Face Hack, AI Companies Spend Big for Compute

The Batch News & Insights: My team recently had our own version of Hugging Face’s experience when closed models failed to defend the company following an accidental cyberattack from OpenAI, leading Hugging Face to use the open weight GLM 5.2 instead.

A man lowers himself on a rope into a data center

Kimi K3 weights are open (with an asterisk): A full timeline for the OpenAI/Hugging Face hack

Anthropic team uses Claude to break crypto algorithms. MAI-Cyber-1-Flash is Microsoft’s answer to big security models. Black Forest Labs adds robotic actions to FLUX 3. MCP moves to fully stateless architecture.

Diverse team collaborates on laptops in modern office with whiteboard flowchart, enhancing productivity and teamwork.

Claude Opus 5 is a workhorse: Big Tech agrees: The world needs open weight AI models

Independent benchmarks for Claude Opus 5. Task crossover, or using AI to complement your main job. The expenditure horizon, where agents cost less than humans. U.S. threatens sanctions, China threatens back.

Diagram showing research program, first article collection, then question generation, followed by model evaluation and analysis

Web Retrieval Flusters LLMs: AI agents searching online can struggle to retrieve correct info, researchers find

Large language models often are called upon to gather news. In this task, researchers found, their ability to find relevant reports is the weakest link.

Cloudflare diagram on "why the web is being crawled" showing 51.8 percent model training, 36.6 percent mixed use, and 8.6 percent search

Cloudflare’s Web Crawler Flare-Up: Cloudflare moves to block all AI training bots by default

Web publishers using Cloudflare will soon be able to separately control an AI bot’s access based on what it does, allowing search indexing while blocking AI training or agent activity.

Animated GIF of Meta's Muse Spark 1.1's performance on benchmarks, showing strong performance on tool use and roughly second-tier overall intelligence

Meta Sparks A Price War: Muse Spark 1.1 makes a jump in intelligence at a lower price than peers

With Llama, Meta marked itself as an open alternative to OpenAI. With its new closed models, Meta now positions itself as a low-cost, high-value competitor.

Kimi diagram of open frontier model size over time, with Moonshot AI's Kimi K3 at 2.8 trillion parameters, ahead of DeepSeek at 1.6T parameters

Kimi K3 Reveals How A Giant Frontier AI Model Works: Moonshot's latest model outshines all but GPT-5.6 Sol and Claude 5 Fable

Moonshot’s latest model leapfrogged the month-old GLM-5.2 and a host of proprietary competitors to finish just behind GPT-5.6 Sol and Claude Fable 5 on many benchmarks.

Weekly AI news for engineers, executives, and enthusiasts.

Latest