Machine Learning Research
Qwen3 Takes On DeepSeek-R1: Alibaba releases the Qwen3 family of open LLMs with optional reasoning
Alibaba’s new model family may unseat DeepSeek-R1’s four-month reign as the top open-weights large language model.
Machine Learning Research
Alibaba’s new model family may unseat DeepSeek-R1’s four-month reign as the top open-weights large language model.
Machine Learning Research
Large language models can improve systems that recommend items to purchase by inferring customer preferences.
Machine Learning Research
Google refreshed its experimental tools for composers and producers.
Machine Learning Research
ChatGPT’s image generator is available via API.
Machine Learning Research
Large language models excel at processing text but can’t interpret images, video, or audio directly without further training on those media types. Researchers devised a way to overcome this limitation.
Hardware
Hugging Face has made a name by providing open AI models. Now it’s providing an open robot.
Machine Learning Research
OpenAI refreshed its roster of models and scheduled the largest, most costly one for removal.
Machine Learning Research
Researchers built a model that’s more robust to noisy inputs like misspellings, smarter about character-level information like the number of R's in strawberry, and potentially better able to understand unfamiliar languages that might share groups of letters with familiar languages.
Business
OpenAI embraced Model Context Protocol, providing powerful support for an open standard that connects large language models to tools and data.
Machine Learning Research
Google’s new flagship model raised the state of the art in a variety of subjective and objective tests.
Machine Learning Research
If you have a collection of variables that represent, say, a cancer patient and you want to classify the patient’s illness as likely cancer or not, algorithms based on decision trees, such as gradient-boosted trees, typically perform better than neural networks.
Machine Learning Research
Alibaba’s latest open-weights system raises the bar for multimodal tasks in a relatively small model.