
Tech & Society
Benchmarks for Industry: Vals AI evaluates large language models on industry-specific tasks.
How well do large language models respond to professional-level queries in various industry domains? A new company aims to find out.
Tech & Society
How well do large language models respond to professional-level queries in various industry domains? A new company aims to find out.
Business
Amazon cut a multi billion-dollar deal with AI startup Anthropic, giving it a powerful ally in the generative arms race. Amazon committed to investing as much as $4 billion in Anthropic. In return, Amazon Web Services (AWS) became the primary provider of Anthropic’s Claude and other models.
Tech & Society
Anthropic, the startup behind the safety-focused Claude chatbot, teamed up with South Korea’s largest mobile phone provider.
Tech & Society
A new online tool ranks chatbots by pitting them against each other in head-to-head competitions. Chatbot Arena allows users to prompt two large language models simultaneously and identify the one that delivers the best responses.