
Machine Learning Research
OpenAI’s GPT-4.5 Goes Big: OpenAI releases GPT-4.5, its most powerful non-reasoning model and maybe its last
OpenAI launched GPT-4.5, which may be its last non-reasoning model.
Machine Learning Research
OpenAI launched GPT-4.5, which may be its last non-reasoning model.
Machine Learning Research
Typical large language models are autoregressive, predicting the next token, one at a time, from left to right. A new model hones all text tokens at once.
Data Points
Mercury debuts diffusion language models. Alibaba’s top video model is now free to download. A new model from Tencent is built for speed. IBM’s Granite 3.2 models are built for business.
Letters
The Voice Stack is improving rapidly. Systems that interact with users via speaking and listening will drive many new applications.
The Batch Newsletter
The Batch AI News and Insights: The Voice Stack is improving rapidly. Systems that interact with users via speaking and listening will drive many new applications.
Machine Learning Research
Although large language models can improve their performance by generating a chain of thought (CoT) — intermediate text tokens that break down the process of responding to a prompt into a series of steps.
Tech & Society
A viral deepfake video showed media superstars who appeared to support a cause — but it was made without their participation or permission.
Business
Top AI companies announced plans to dramatically ramp up their spending on AI infrastructure.
Science
To date, efforts to decode what people are thinking from their brain waves often relied on electrodes implanted in the cortex. New work used devices outside the head to pick up brain signals that enabled an AI system, as a subject typed, to accurately guess what they were typing.
Data Points
Figure’s Helix vision language action robotics model. Google fine-tunes its own family of open VL models. SuperGPQA may be the most challenging general knowledge test yet. Meta creates new framework to evaluate agentic LLMs.
Data Points
Researchers develop a highly capable language diffusion model. Google’s hypothesis-making agent is your new research partner. A new family of vision-language models for low-resource devices. HP will brick Humane’s Ai Pin and repurpose its tech for new devices.
Letters
Last month, a drone from Skyfire AI was credited with saving a police officer’s life after a dramatic 2 a.m. traffic stop.