Machine Learning Research
Life After Llama: With Muse Spark, Meta pivots away from its open-weights Llama strategy
Meta pivoted from its open-weights strategy to deliver a closed alternative.
Machine Learning Research
Meta pivoted from its open-weights strategy to deliver a closed alternative.
Machine Learning Research
Simulating complex physical systems through traditional numerical methods is slow and expensive, and simulations based on machine learning are usually specialized for a specific type of system, such as water in a pipe or atmosphere surrounding a planet.
Science
An open-weights model could help scientists compare the impact of genetic variations, identify mutations that cause diseases, and develop treatments.
Machine Learning Research
Anthropic took unusual steps to prepare the world for a forthcoming large language model that it said poses extraordinary risks to cybersecurity.
Machine Learning Research
Large language models typically become less accurate and slower when they process longer contexts, but researchers enabled an LLM to keep accuracy stable and inference time constant as its context grew.
Machine Learning Research
Google added a music generator to Gemini and YouTube, putting a model that produces synthetic songs in front of hundreds of millions of users.
Machine Learning Research
The inner workings of the popular coding agent Claude Code are available for all to see.
Machine Learning Research
When processing long contexts, large language models often lose track of details or devolve into nonsense. Researchers reduced these effects by managing context externally.
Machine Learning Research
xAI launched a video generator that topped an independent quality ranking at a fraction of competitors’ prices.
Machine Learning Research
Nvidia, the dominant supplier of AI chips, released a competitive open-source large language model whose speed tops its size class — the first open-weights leader to come from the United States since last year, when Meta delivered Llama 4.
Machine Learning Research
Multimodal models typically use different tokenizers to embed different media types, and different encoders when training to generate media rather than classify it.
Business
DeepSeek, the Chinese developer of outstanding open-weights models, has withheld an upcoming update of its flagship model from U.S. chip makers, a move that intensifies the AI rivalry between the U.S. and China.