
Data Points
Anthropic releases Claude 3.7 Sonnet as a hybrid reasoning model: DeepSeek’s FlashMLA is its first entry in OpenInfra week
Figure’s Helix vision language action robotics model. Google fine-tunes its own family of open VL models. SuperGPQA may be the most challenging general knowledge test yet. Meta creates new framework to evaluate agentic LLMs.