Machine Learning Research
Grok 4 Shows Impressive Smarts, Questionable Behavior: Grok 4 launches with benchmark records and idiosyncratic behavior
xAI updated its Grok vision-language model and published impressive benchmark results. But, like earlier versions, Grok 4 showed questionable behavior right out of the gate.