Machine Learning Research
Joseph Gonzalez: General intelligence
In 2025, I expect progress in training foundation models to slow down as we hit scaling limits and inference costs continue to rise.
Machine Learning Research
In 2025, I expect progress in training foundation models to slow down as we hit scaling limits and inference costs continue to rise.
Machine Learning Research
For years, the best AI models got bigger and bigger. But in 2024, some popular large language models were small enough to run on a smartphone.
Business
Fierce competition among model makers and cloud providers drove down the price of access to state-of-the-art models.
Machine Learning Research
How do agents based on large language models compare to human experts when it comes to proposing machine learning research? Pretty well, according to one study.
Machine Learning Research
Google’s Gemini 2.0 Flash, the first member of its updated Gemini family of large multimodal models, combines speed with performance that exceeds that of its earlier flagship model, Gemini 1.5 Pro, on several measures.
Machine Learning Research
Microsoft updated its smallest model family with a single, surprisingly high-performance model.
Machine Learning Research
Large language models that remember more hallucinate less.
Machine Learning Research
OpenAI launched not only its highly anticipated o1 model but also an operating mode that enables the model to deliver higher performance — at a hefty price.
Machine Learning Research
Mistral AI unveiled Pixtral Large, which rivals top models at processing combinations of text and images.
Business
One of the world’s biggest payment processors is enabling large language models to spend real money.
Business
Amazon and Anthropic expanded their partnership, potentially strengthening Amazon Web Services’ AI infrastructure and lengthening the high-flying startup’s runway.
Machine Learning Research
An up-and-coming Hangzhou AI lab unveiled a model that implements run-time reasoning similar to OpenAI o1 and delivers competitive performance. Unlike o1, it displays its reasoning steps.