Machine Learning Research
Free Agents: OpenHands launches as an open toolkit for advanced code generation and automation
An open source package inspired by the commercial agentic code generator Devin aims to automate computer programming and more.
Machine Learning Research
An open source package inspired by the commercial agentic code generator Devin aims to automate computer programming and more.
Tech & Society
Some voters navigated last week’s United States elections with help from a large language model that generated output based on verified, nonpartisan information.
Tech & Society
Two top AI companies changed their stances on military and intelligence applications.
Machine Learning Research
A new open source large language model outperforms competitors, including the open-weights Llama 3.1 405B, on a variety of benchmarks.
Data Points
FrontierMath’s hard math problems baffle models. Nvidia partners with Hugging Face’s robotics platform. Mistral offers multilingual text moderation. Grok teases free access for New Zealand’s X users.
Data Points
Oasis builds interactive Minecraft-style games in real-time. Microsoft releases system to coordinate AI agents. OpenAI’s new predicted outputs feature speeds up generation. GitHub launches Spark, a platform to build and host micro-apps.
Tech & Society
Shipping ports are the latest front in the rising tension between labor unions and AI-powered automation.
Letters
Trump and the Republican party chalked up huge wins this week. Did manipulation of social media by generative AI play any role in this election?
The Batch Newsletter
The Batch AI News and Insights: Trump and the Republican party chalked up huge wins this week. Did manipulation of social media by generative AI play any role in this election?
Machine Learning Research
Coding agents are improving, but can they tackle machine learning tasks?
Tech & Society
A new study suggests that leading AI models may meet the requirements of the European Union’s AI Act in some areas, but probably not in others.
Machine Learning Research
API commands for Claude Sonnet 3.5 enable Anthropic’s large language model to operate desktop apps much like humans do. Be cautious, though: It’s a work in progress.