Peektastic.com

wired

Pioneers of Reinforcement Learning Win the Turing Award

Having machines learn from experience was once considered a dead end. It's now critical to artificial intelligence, and work in the field has won two men the highest honor in computer science. [...]

Discover Copy

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

Self-improving language models are becoming reality with MIT's updated SEAL technique

Researchers at the Massachusetts Institute of Technology (MIT) are gaining renewed attention for developing and open sourcing a technique that allows large language models (LLMs) — like those underp [...]

More Copy

Match Score: 95.56

Algorithms from the 1980s power today's AI breakthroughs, earn Turing Award for researchers

Andrew Barto and Richard Sutton have won the 2024 A.M. Turing Award for developing key technologies that power modern artificial intelligence, including recent breakthroughs in large reasoning models. [...]

More Copy

Match Score: 74.57

fastcompany

AI pioneers win the Turing Award, tech’s top prize

Andrew Barto and Richard Sutton, are the winners of this year’s A.M. Turing Award, the tech world’s equivalent of the Nobel Prize. [...]

More Copy

Match Score: 63.86

venturebeat

Nvidia researchers boost LLMs reasoning skills by getting them to 'think' during pre-training

Researchers at Nvidia have developed a new technique that flips the script on how large language models (LLMs) learn to reason. The method, called reinforcement learning pre-training (RLP), integrates [...]

More Copy

Match Score: 63.71

venturebeat

MIT's new fine-tuning method lets LLMs learn new skills without losing old ones

When enterprises fine-tune LLMs for new tasks, they risk breaking everything the models already know. This forces companies to maintain separate models for every skill.Researchers at MIT, the Improbab [...]

More Copy

Match Score: 63.34

venturebeat

Google finds that AI agents learn to cooperate when trained against unpredictable opponents

Training standard AI models against a diverse pool of opponents — rather than building complex hardcoded coordination rules — is enough to produce cooperative multi-agent systems that adapt to eac [...]

More Copy

Match Score: 51.20

venturebeat

Google’s new AI training method helps small models tackle complex reasoning

Researchers at Google Cloud and UCLA have proposed a new reinforcement learning framework that significantly improves the ability of language models to learn very challenging multi-step reasoning task [...]

More Copy

Match Score: 50.81

venturebeat

AI agents fail 63% of the time on complex tasks. Patronus AI says its new 'living' training worlds can fix that.

Patronus AI, the artificial intelligence evaluation startup backed by $20 million from investors including Lightspeed Venture Partners and Datadog, unveiled a new training architecture Tuesday that it [...]

More Copy

Match Score: 49.30

AI pioneers warn OpenAI's corporate overhaul could betray its original mission for humanity

A group of former OpenAI employees, researchers, and nonprofit organizations is urging regulators to block OpenAI’s proposed corporate restructuring, arguing it threatens the company’s founding mi [...]

More Copy

Match Score: 42.62