Machine Learning Research
Reasoning for No Reason: Anthropic finds chain-of-thought reasoning traces may omit key influences
Does a reasoning model’s chain of thought explain how it arrived at its output? Researchers found that often it doesn’t.
Machine Learning Research
Does a reasoning model’s chain of thought explain how it arrived at its output? Researchers found that often it doesn’t.
Science
The U.S. government is using AI to predict the paths of hurricanes.
Machine Learning Research
Reducing the number of bits used to represent each parameter in a neural network from, say, 16 bits to 8 bits shrinks the network’s size and boosts its speed. Researchers took this approach to an extreme: They built a competitive large language model whose weights are limited to three values.
Machine Learning Research
An agent designed for broad biological research could accelerate the work of scientists in specialties from anatomy to zoology.
Machine Learning Research
Apple revamped two vision-language models in a bid to catch up with fast-moving competitors.
Machine Learning Research
In Northern California, old property deeds may still include racial clauses: language, made illegal decades ago, that was designed to ban people of color from owning or living in certain homes.
Machine Learning Research
OpenAI launched o3-pro, a more capable version of its most advanced reasoning vision-language model.
Machine Learning Research
Researchers reduced the number of tokens needed to represent video frames to be fed to a transformer.
Machine Learning Research
Same character, new background, new action. That’s the focus of the latest text-to-image models from Germany’s Black Forest Labs.
Machine Learning Research
Researchers identified a simple way to mislead autonomous agents based on large language models.
Machine Learning Research
DeepSeek updated its groundbreaking DeepSeek-R1 large language model to strike another blow for open-weights performance.
Machine Learning Research
DeepSeek made headlines late last year, when it built a state-of-the-art, open-weights large language model at a cost far lower than usual. The upstart developer shared new details about its method.