Machine Learning Research
Computer Use Gains Momentum: OpenAI’s Operator automates online tasks with a new AI agent
OpenAI introduced an AI agent that performs simple web tasks on a user’s behalf.
Machine Learning Research
OpenAI introduced an AI agent that performs simple web tasks on a user’s behalf.
Machine Learning Research
Reinforcement learning is emerging as an avenue for building large language models with advanced reasoning capabilities.
Data Points
Claude’s Citations API makes it easier to track your sources. Browser Use challenges Computer Use, for free. How game developers both adopt and fear AI. Hunyuan’s new open model builds 3D assets with textures.
Data Points
ByteDance’s Doubao promises GPT-4o performance at cut-rate prices. Perplexity debuts new API grounding in web search. Hugging Face’s SmolVLM gets even smaller. Benchmark-maker Epoch AI and OpenAI criticized for keeping funding deal under wraps.
Letters
Greetings from Davos, Switzerland! Many business and government leaders are gathered here again for the annual World Economic Forum to discuss tech, climate, geopolitics, and economic growth.
The Batch Newsletter
The Batch AI News and Insights: Greetings from Davos, Switzerland! Many business and government leaders are gathered here again for the annual World Economic Forum to discuss tech, climate, geopolitics, and economic growth.
Machine Learning Research
Designing integrated circuits typically requires years of human expertise. Recent work set AI to the task with surprising results.
Tech & Society
Lawmakers in the U.S. state of Texas are considering stringent AI regulation.
Hardware
Chinese robot makers Unitree and EngineAI showed off relatively low-priced humanoid robots that could bring advanced robotics closer to everyday applications.
Machine Learning Research
A new open model rivals OpenAI’s o1, and it’s free to use or modify.
Data Points
Codestral matches or beats mid-sized fill-in-the-middle models. MiniMax’s “Lightning Attention” tries to improve on the transformer. Co-STORM now available for AI collaboration on encyclopedia articles. LlamaIndex uses agentic RAG to retrieve info from complex documents.
Data Points
Moondream’s lightweight vision model adds gaze detection. Fine-tuning Flux Pro’s generative model with just a few images. Copilot Chat brings simple agents to Microsoft 365. Google adds Gemini to all its Workspace plans.