Seedance creates new movie-making interface: Even top models struggle at identifying visual specifics
The future of ads in ChatGPT. AI stresses in high-tech workplaces. Big models’ curse of multilingual training. Opus 4.6’s expensive fast mode.
The future of ads in ChatGPT. AI stresses in high-tech workplaces. Big models’ curse of multilingual training. Opus 4.6’s expensive fast mode.
Hugging Face’s community approach to benchmarks. Four cloud companies’ plans to spend $650 billion this year. A new proof of an old math problem. Frontier, OpenAI’s enterprise agent system.
Job seekers in the U.S. and many other nations face a tough environment.
Mistral compressed Mistral Small 3.1 into much smaller versions, yielding a family of relatively small, open-weights, vision-language models that perform better by some measures than competing models of similar size. The method combines pruning and distillation.
On its 25th anniversary, Wikipedia celebrated with high-profile deals to make its data easier for AI companies to train their models in exchange for financial support.
An open source vision-language model unleashes minion agents that enable it to perform tasks more quickly and effectively.
The OpenClaw open-source AI agent became a sudden sensation, inspiring excitement, worry, and hype about the agentic future.
The Batch AI News and Insights: Job seekers in the U.S. and many other nations face a tough environment.
xAI’s merger with SpaceX. Microsoft’s plan for AI companies to pay publishers. Xcode’s incorporation of Claude and Codex. Nvidia’s quantization method for reasoning models.
OpenAI’s internal data analysis agent. SERA, open-weights coding models built for agents. Google’s gene prediction research in Nature. GPT-4o’s swan song.
Individuals and organizations increasingly use large language models to produce media that helps them compete for attention. Does fine-tuning LLMs to encourage engagement, purchases, or votes affect their alignment with social values? Researchers found that it does.
Artificial Analysis, which tests AI systems, updated the component evaluations in its Intelligence Index to better reflect large language models’ performance in real-world use cases.