Machine Learning Research
Vision-Language, Compact and Open: Google releases Gemma 3 vision-language models with open weights
Google updated its open-weights family of large language models to include versions that handle image and video inputs.
Machine Learning Research
Google updated its open-weights family of large language models to include versions that handle image and video inputs.
Business
The United States Copyright Office determined that existing laws are sufficient to decide whether a given AI-generated work is protected by copyright, making additional legislation unnecessary.
Tech & Society
Large language models built by developers in China may, in some applications, be less useful outside that country because they avoid topics its government deems politically sensitive. A developer fine-tuned DeepSeek-R1 to widen its scope without degrading its overall performance.
Machine Learning Research
Most models that have learned to reason via reinforcement learning were huge models. A much smaller model now competes with them.
Machine Learning Research
Anthropic’s Claude 3.7 Sonnet implements a hybrid reasoning approach that lets users decide how much thinking they want the model to do before it renders a response.
Machine Learning Research
OpenAI launched GPT-4.5, which may be its last non-reasoning model.
Machine Learning Research
Typical large language models are autoregressive, predicting the next token, one at a time, from left to right. A new model hones all text tokens at once.
Machine Learning Research
Although large language models can improve their performance by generating a chain of thought (CoT) — intermediate text tokens that break down the process of responding to a prompt into a series of steps.
Business
Elon Musk and a group of investors made an unsolicited bid to buy the assets of the nonprofit that controls OpenAI, complicating the AI powerhouse’s future plans.
Machine Learning Research
Replit, an AI-driven integrated development environment, updated its mobile app to generate further mobile apps to order.
Machine Learning Research
xAI’s new model family suggests that devoting more computation to training remains a viable path to building more capable AI.
Machine Learning Research
While Hangzhou’s DeepSeek flexed its muscles, Chinese tech giant Alibaba vied for the spotlight with new open vision-language models.