2025-08-07
In the ARC-AGI-2 benchmark, which is designed to measure a language model's general reasoning skills, GPT-5 (High) scored 9.9 percent at a cost of $0.73 per task, according to ARC Prize.
The article Grok 4 edges out GPT-5 in complex reasoning benchmark ARC-AGI appeared first on THE DECODER.
[...]2025-10-08
The trend of AI researchers developing new, small open source generative models that outperform far larger, proprietary peers continued this week with yet another staggering advancement.Alexia Jolicoe [...]
2025-09-11
Grok has once again been caught spreading blatant misinformation on X. In several bizarre exchanges, the chatbot repeatedly claimed that Charlie Kirk was "fine" and that gruesome videos of h [...]
2025-07-12
The team behind Grok has issued a rare apology and explanation of what went wrong after X's chatbot began spewing antisemitic and pro-Nazi rhetoric earlier this week, at one point even calling it [...]
2025-02-18
xAI has launched its Grok 3 models during a livestream with Elon Musk, who said they were "an order of magnitude more capable than Grok 2." The Grok 3 mini model can answer questions quickly [...]
2025-07-09
One day after Grok posted a series of antisemitic and pro-Nazi rants on X, Elon Musk is seemingly trying to blame rogue users for the chatbot's unhinged posts. "Grok was too compliant to use [...]
2025-07-11
Grok 4 aligns its answers with Elon Musk's when it comes to controversial issues, users have discovered shortly after the company launched the new model. Some users posted screenshots on X asking [...]