Machine Learning Research
More Factual LLMs: FactTune, a method to fine-tune LLMs for factual accuracy without human feedback
Large language models sometimes generate false statements. New work makes them more likely to produce factual output.
Machine Learning Research
Large language models sometimes generate false statements. New work makes them more likely to produce factual output.
Science
Scientists pledged to control their use of AI to produce potentially hazardous biological materials.
Hardware
Nvidia’s latest chip promises to boost AI’s speed and energy efficiency.
Tech & Society
Microsoft took over most of the once high-flying chatbot startup Inflection AI in an unusual deal.
Machine Learning Research
Humanoid robots can play football (known as soccer in the United States) in the real world, thanks to reinforcement learning.
Tech & Society
The United States military is using computer vision to target enemy positions in the Red Sea and elsewhere.
Science
Researchers used an AI system to identify animal cell types from gene sequences, including a cell type that conventional approaches had discovered only in the past year.
Tech & Society
AI agents are typically designed to operate a particular software environment. Recent work enabled a single agent to take actions in a variety of three-dimensional virtual worlds.
Letters
Last week, I described four design patterns for AI agentic workflows that I believe will drive significant progress this year: Reflection, Tool use, Planning and Multi-agent collaboration.
The Batch Newsletter
The Batch AI News and Insights: Last week, I described four design patterns for AI agentic workflows that I believe will drive significant progress this year...
Data Points
This week's top AI news and research stories featured an agent for many environments, an AI system to identify animal cell types from gene sequences, a system that analyzes satellite and geolocation data that has been used to identify targets in real-world conflicts...
Machine Learning Research
Research aims to help users select large language models that minimize expenses while maintaining quality.