Tech & Society
Amazon’s Next-Gen Voice Assistant: Alexa+ adds generative AI and agents, using Claude and other models
Amazon announced Alexa+, a major upgrade to its long-running voice assistant.
Tech & Society
Amazon announced Alexa+, a major upgrade to its long-running voice assistant.
Machine Learning Research
Anthropic’s Claude 3.7 Sonnet implements a hybrid reasoning approach that lets users decide how much thinking they want the model to do before it renders a response.
Machine Learning Research
OpenAI launched GPT-4.5, which may be its last non-reasoning model.
Machine Learning Research
Typical large language models are autoregressive, predicting the next token, one at a time, from left to right. A new model hones all text tokens at once.
Data Points
Mercury debuts diffusion language models. Alibaba’s top video model is now free to download. A new model from Tencent is built for speed. IBM’s Granite 3.2 models are built for business.
Letters
The Voice Stack is improving rapidly. Systems that interact with users via speaking and listening will drive many new applications.
The Batch Newsletter
The Batch AI News and Insights: The Voice Stack is improving rapidly. Systems that interact with users via speaking and listening will drive many new applications.
Machine Learning Research
Although large language models can improve their performance by generating a chain of thought (CoT) — intermediate text tokens that break down the process of responding to a prompt into a series of steps.
Tech & Society
A viral deepfake video showed media superstars who appeared to support a cause — but it was made without their participation or permission.
Business
Top AI companies announced plans to dramatically ramp up their spending on AI infrastructure.
Science
To date, efforts to decode what people are thinking from their brain waves often relied on electrodes implanted in the cortex. New work used devices outside the head to pick up brain signals that enabled an AI system, as a subject typed, to accurately guess what they were typing.
Data Points
Figure’s Helix vision language action robotics model. Google fine-tunes its own family of open VL models. SuperGPQA may be the most challenging general knowledge test yet. Meta creates new framework to evaluate agentic LLMs.