Machine Learning Research
The Year AI Went Industrial: The State of AI Report 2025 says AI’s barriers aren’t technological but social and material
A year-in-review report heralds the dawn of AI’s industrial era.
Machine Learning Research
A year-in-review report heralds the dawn of AI’s industrial era.
Machine Learning Research
A new image generator reasons over prompts to produce outstanding pictures.
Machine Learning Research
Large language models often memorize details in their training data, including private information that may appear only once, like a person’s name, address, or phone number. Researchers built the first open-weights language model that’s guaranteed not to remember such facts.
Machine Learning Research
An open-weights model from Shanghai-based MiniMax challenges top proprietary models on key benchmarks for coding and agentic tasks.
Machine Learning Research
For decades, AI developers have treated the web as an open faucet of training data. Now publishers are shutting the tap. Will web data dry up?
Machine Learning Research
Honing an agent’s prompt can yield better results than fine-tuning the underlying large language model via reinforcement learning.
Machine Learning Research
The ability to easily connect large language models to tools and data sources has made Model Context Protocol popular among developers, but it also opens security holes, research shows.
Machine Learning Research
Reasoning models typically learn to undertake a separate process of “thinking” through their output of before they produce final response. Ant Group built a top non-reasoning model that can take similar steps as part of its immediate response.
Machine Learning Research
Robot control systems that accept only text input struggle to translate words into motions in space. Researchers developed a system that enables robots to plan spatial paths before they execute text instructions.
Machine Learning Research
The first offering from Thinking Machines Lab, the startup founded by former OpenAI CTO Mira Murati, aims to simplify — and democratize — the process of fine-tuning AI models.
Machine Learning Research
DeepSeek’s latest large language model can cut inference costs by more than half and processes long contexts dramatically faster relative to its predecessor.
Machine Learning Research
The approach known as LoRA streamlines fine-tuning by training a small adapter that modifies a pretrained model’s weights at inference. Researchers built a model that generates such adapters directly.