Machine Learning Research - The Batch | DeepLearning.AI (Page 5)

Bar chart comparing performance of Qwen3 models against others in diverse tasks, highlighting Qwen3-Max.

Machine Learning Research

Qwen3 Goes Big (and Smaller): Alibaba expands Qwen3 family with a 1 trillion-parameter Max model, open-weights Qwen3-VL, and the Qwen3-Omni voice model

Alibaba rounded out the Qwen3 family with its biggest large language model to date as well as smaller models that process text, images, video, and/or audio.

Comparison table highlighting Claude Sonnet 4.5's top scores in coding and reasoning benchmarks, featuring improved capabilities.

Machine Learning Research

Claude Levels Up: Anthropic launches Claude Sonnet 4.5 and the Claude Agent SDK, and overhauls Claude Code for developers

Anthropic updated its mid-size Claude Sonnet model, making it the first member of the Claude family to reach version 4.5. It also enhanced the Claude Code agentic coding tool with long-desired features.

Image illustrates data flow from raw satellite sources through processing to embeddings for climate tracking.

Machine Learning Research

Earth Modeled in 10-Meter Squares: Google’s AlphaEarth Foundations tracks the whole planet’s climate, land use, potential for disasters, in detail and at scale

Researchers built a model that integrates satellite imagery and other sensor readings across the entire surface of the Earth to reveal patterns of climate, land use, and other features.

Electron microscope image of bacteriophages with distinct hexagonal heads and tails on a gray background.

Machine Learning Research

AI Generates Viral Genomes: Researchers use genomic language models to create custom viruses

Researchers used AI models to create novel viruses from scratch.

Flowchart shows data reordering, probability sampling, and effective gradient updating in reinforcement learning.

Machine Learning Research

Faster Reinforcement Learning: New technique auto-selects training examples to speed up fine-tuning

Fine-tuning large language models via reinforcement learning is computationally expensive, but researchers found a way to streamline the process.

Chart details ChatGPT conversations. Writing (28.1%), info-seeking (21.3%), and guidance (28.3%) lead.

Machine Learning Research

What ChatGPT Users Want: ChatGPT users now more likely to be young, female, and seeking info, study shows

What do ChatGPT’s 700 million weekly active users do with it? OpenAI teamed up with a Harvard economist to find out.

Central AI agent icon links to merchant, cart, and payment symbols, illustrating agentic payments process.

Business

Agents of Commerce: Google’s AP2 gives developers new tools to build agentic payments

Google launched an open protocol for agentic payments that enables agents based on any large language model to purchase items over the internet.

Energy-Based Transformer refines predictions step by step, lowering energy for higher context compatibility.

Machine Learning Research

Transformers Energized: Energy-Based Transformers (EBTs) use gradient descent to gradually predict the next token

A new type of transformer can check its work. Instead of guessing the next output token in one shot like a typical transformer, it starts with a rough version of the token and improves it step by step.

Diagram of Qwen3-Next architecture with Mixture of Experts, Gated Attention, and Gated DeltaNet layers.

Machine Learning Research

Qwen3-Next Accelerates: Alibaba’s new model uses hybrid attention layers and a sparse MoE architecture for speed and performance

Alibaba updated its popular Qwen3 open-weights models with a number of fresh, speed-boosting tweaks.

Diagram comparing sliding window attention and ATLAS memory, showing wider context tracking in ATLAS.

Machine Learning Research

10 Million Tokens of Input Context: ATLAS, a transformer-like architecture, can process a context window as large as ten million tokens

An alternative to attention enables large language models to track relationships among words across extraordinarily wide spans of text.

Charts showing PromptGuard 2 blocking attacks, AlignmentCheck detecting goal hijacking, and CodeShield finding insecure code.

Machine Learning Research

Cybersecurity for Agents: Meta releases LlamaFirewall, an open-source defense against AI hijacking

Autonomous agents built on large language models introduce distinct security concerns. Researchers designed a system to protect agents from common vulnerabilities.

Table comparing DINO, DINOv2, DINOv3, SigLIP 2, and PE on segmentation, depth estimation, tracking, and classification tasks.

Machine Learning Research

Better Image Processing Through Self-Supervised Learning: Meta’s DINOv3 gets an updated loss term and improved vision performance

DINOv2 showed that a vision transformer pretrained on unlabeled images could produce embeddings that are useful for a wide variety of tasks. Now it has been updated to improve the performance of its embeddings in segmentation and other vision tasks.