
Machine Learning Research
A Transformer Alternative Emerges: Mamba, a new approach that may outperform transformers
An architectural innovation improves upon transformers — up to 2 billion parameters, at least...
Machine Learning Research
An architectural innovation improves upon transformers — up to 2 billion parameters, at least...
Machine Learning Research
Pruning weights from a neural network makes it smaller and faster, but it can take a lot of computation to choose weights that can be removed without degrading the network’s performance.
Machine Learning Research
GPT-4V introduced a large multimodal model that generates text from images and, with help from DALL-E 3, generates images from text. However, OpenAI hasn’t fully explained how it built the system. A separate group of researchers described their own method.
Machine Learning Research
Images in the wild may not come with labels, but they often include metadata. A new training method takes advantage of this information to improve contrastive learning.
Science
You can reduce your model’s carbon emissions by being choosy about when and where you train it.
Tech & Society
Officials in charge of protecting children stopped using a machine learning model designed to help them make decisions in difficult cases. The U.S. state of Oregon halted its use of an algorithm intended to identify children who may benefit from intervention.
Machine Learning Research
Let’s say you have a pretrained language model and a small amount of data to fine-tune it to answer yes-or-no questions. Should you fine-tune it to classify yes/no or to fill in missing words — both viable approaches that are likely to yield different results?
Machine Learning Research
A reinforcement learning system enabled a four-legged robot to amble over unfamiliar, rapidly changing terrain.
Machine Learning Research
Researchers have used neural networks to generate novel views of a 3D scene based on existing pictures plus the positions and angles of the cameras that took them. In practice, though, you may not know the precise camera
Tech & Society
Social biases are well documented in decisions made by supervised models trained on ImageNet’s labels. But they also crept into the output of unsupervised models pretrained on the same dataset.
Machine Learning Research
Neural networks are famously bad at interpreting input that falls outside the training set’s distribution, so it’s not surprising that some models are certain that cat pictures show symptoms of Covid-19. A new approach won’t mistakenly condemn your feline to a quarantine.
Machine Learning Research
Human teachers can teach more effectively by adjusting their methods in response to student feedback. It turns out that teacher networks can do the same.