Letters
When to Fine-Tune — and When Not To: Many teams that fine-tune their models would be better off prompting or using agentic workflows. Here's how to decide.
Fine-tuning small language models has been gaining traction over the past half year.
Letters
Fine-tuning small language models has been gaining traction over the past half year.
The Batch Newsletter
The Batch AI News and Insights: Fine-tuning small language models has been gaining traction over the past half year.
Data Points
Nvidia’s Nemotron adds reasoning to Llama models. Does ChatGPT make frequent users more lonely? OpenAI’s o1-pro costs a pretty penny. Mistral Small 3.1 gives Gemma 3 27B some competition.
Data Points
Nvidia gives Project DIGITS a new name. AI models compete to build Minecraft items. Claude chatbot now includes search. A Moore’s law-like regularity for AI agents.
The Batch Newsletter
The Batch AI News and Insights: Last Friday on Pi Day, we held AI Dev 25, a new conference for AI Developers.
Letters
Last Friday on Pi Day, we held AI Dev 25, a new conference for AI Developers.
Science
Materials that have specific properties are essential to progress in critical technologies like solar cells and batteries. A machine learning model designs new materials to order.
Business
The United States Copyright Office determined that existing laws are sufficient to decide whether a given AI-generated work is protected by copyright, making additional legislation unnecessary.
Machine Learning Research
An AI agent synthesizes novel scientific research hypotheses. It's already making an impact in biomedicine.
Machine Learning Research
Multilingual AI models often suffer uneven performance across languages, especially in multimodal tasks. A pair of lean models counters this trend with consistent understanding of text and images across major languages.
Data Points
Google’s two new Gemini vision-language-action robotics models. Cohere’s Command A, another lightweight LMM. New China regulations require mandatory labels for AI content. Monitoring reasoning models for reward hacking or unwanted behavior.
Data Points
OpenAI’s new SDK and APIs for agentic workflows. Olympic Coder, two powerful open coding models. Alibaba applies RL to emotion detection. GPT-4.5 and Claude Sonnet 3.7 top a new agent leaderboard.