Patronus AI, the artificial intelligence evaluation startup backed by $20 million from investors including Lightspeed Venture Partners and Datadog, unveiled a new training architecture Tuesday that it says represents a fundamental shift in how AI agents learn to perform complex tasks.The technology, which the company calls "Generative Simulators," creates adaptive simulation environments that continuously generate new challenges, update rules dynamically, and evaluate an agent's performance as it learns — all in real time. The approach marks a departure from the static benchmarks that have long served as the industry standard for measuring AI capabilities but have increasingly come under fire for failing to predict real-world performance."Traditional benchmarks measur [...]
Artificial intelligence agents powered by the world's most advanced language models routinely fail to complete even straightforward professional tasks on their own, according to groundbreaking re [...]
Baseten, the AI infrastructure company recently valued at $2.15 billion, is making its most significant product pivot yet: a full-scale push into model training that could reshape how enterprises wean [...]
Researchers at Meta, the University of Chicago, and UC Berkeley have developed a new framework that addresses the high costs, infrastructure complexity, and unreliable feedback associated with using r [...]
OpenAI introduced a new paradigm and product today that is likely to have huge implications for enterprises seeking to adopt and control fleets of AI agent workers.Called "Workspace Agents," [...]
Resolve AI, the production-operations startup backed by Greylock and Lightspeed Venture Partners, today announced a sweeping expansion of its platform that introduces always-on background agents, a re [...]
Microsoft today announced the general availability of Agent 365 and Microsoft 365 Enterprise 7, two products designed to bring security and governance to the rapidly growing population of AI agents op [...]
Mistral AI on Monday launched Forge, an enterprise model training platform that allows organizations to build, customize, and continuously improve AI models using their own proprietary data — a move [...]
Multimodal AI is transforming the field of artificial intelligence by combining different types of data, such as text, images, video, and audio, to provide a deeper understanding of information. This [...]
3TS Capital Partners led the round with Grazia Equity and existing investors. 25,000 users, 85% daily wear rate, 500,000+ emergency calls handled. The company is building an AI companion for the watch [...]