
Data Points
The latest in AI from Feb. 29 to Mar. 6, 2024
This week's top AI news and research stories featured Mistral's new LLMs, a robot chemist, Google's open source LLMs, and a way to make LLMs better at math. But first:
Data Points
This week's top AI news and research stories featured Mistral's new LLMs, a robot chemist, Google's open source LLMs, and a way to make LLMs better at math. But first:
The Batch Newsletter
The Batch AI News and Insights: Progress on LLM-based agents that can autonomously plan out and execute sequences of actions has been rapid, and I continue to see month-over-month improvements.
Tech & Society
Google asserted its open source bona fides with new models. Google released weights for Gemma-7B, an 8.5 billion-parameter large language model intended to run GPUs, and Gemma-2B, a 2.5 billion-parameter version intended for deployment on CPUs and edge devices.
Science
A robot outperformed human chemists at synthesizing chemicals. Researchers at University of Amsterdam built RoboChem, an integrated robotic system that learned to design light-activated chemical reactions while achieving optimal yields and throughput.
Tech & Society
European AI champion Mistral AI unveiled new large language models and formed an alliance with Microsoft.
Machine Learning Research
Reinforcement learning from human feedback (RLHF) is widely used to fine-tune pretrained models to deliver outputs that align with human preferences. New work aligns pretrained models without the cumbersome step of reinforcement learning.
Machine Learning Research
The combination of language models that are equipped for retrieval augmented generation can retrieve text from a database to improve their output. Further work extends this capability to retrieve information from any application that comes with an API.
Machine Learning Research
Pruning weights from a neural network makes it smaller and faster, but it can take a lot of computation to choose weights that can be removed without degrading the network’s performance.
Tech & Society
OpenAI is focusing on autonomous agents that take action on a user’s behalf. The maker of ChatGPT is developing applications designed to automate common digital tasks by controlling apps and devices, The Information reported.
Hardware
An upstart chip company dramatically accelerates pretrained large language models. Groq offers cloud access to Meta’s Llama 2 and Mistral.ai’s Mixtral at speeds an order of magnitude greater than other AI platforms. Registered users can try it.
Tech & Society
An update of Google’s flagship multimodal model keeps track of colossal inputs, while an earlier version generated some questionable outputs.
Data Points
This week's top AI news and research stories featured Google's troubled Gemini launch, OpenAI's next act, Groq's blazing inference speed, and a method for faster network pruning. But first: