BIG-Bench, developed in 2021 as a universal benchmark for testing large language models, has reached its limits as current models achieve over 90% accuracy. In response, Google DeepMind has introduced BIG-Bench Extra Hard (BBEH), which reveals substantial weaknesses even in the most advanced AI models.<br /> The article OpenAI beats Deepseek by a surprisingly wide margin in Google's lat [...]
In a funding round that signals a significant leap forward for AI-assisted software development, Bito has raised a $5.7 million seed extension to further advance its agentic AI platform for code review. The round, led by Vela Partners with backing from NextView Ventures, Maxitech Ventures, Eniac Ventures, and others, brings Bito’s total seed-stage funding to […]<br /> The post Bito Raise [...]
A WIRED investigation found that dozens of YouTube channels are using generative AI to depict cartoon cats and minions being beaten, starved, and sexualized—sparking fears of a new Elsagate wave. [...]