
Tech & Society
Benchmarks for Industry: Vals AI evaluates large language models on industry-specific tasks.
How well do large language models respond to professional-level queries in various industry domains? A new company aims to find out.
Tech & Society
How well do large language models respond to professional-level queries in various industry domains? A new company aims to find out.
Tech & Society
Anthropic announced a suite of large multimodal models that set new states of the art in key benchmarks.
Business
Weeks after it announced a huge partnership deal with Amazon, Anthropic doubled down on its earlier relationship with Alphabet.
Business
Amazon cut a multi billion-dollar deal with AI startup Anthropic, giving it a powerful ally in the generative arms race. Amazon committed to investing as much as $4 billion in Anthropic. In return, Amazon Web Services (AWS) became the primary provider of Anthropic’s Claude and other models.
Tech & Society
In the absence of nationwide laws that regulate AI, major U.S. tech companies pledged to abide by voluntary guidelines — most of which they may already be following.
Tech & Society
A new online tool ranks chatbots by pitting them against each other in head-to-head competitions. Chatbot Arena allows users to prompt two large language models simultaneously and identify the one that delivers the best responses.
Business
The demise of cryptocurrency exchange FTX threatens funding for some teams devoted to AI safety. FTX, the $32 billion exchange that plunged into bankruptcy last month amid allegations of fraud, had given or promised more than $530 million to over 70 AI-related organizations.