wired

2025-03-05

Pioneers of Reinforcement Learning Win the Turing Award

Having machines learn from experience was once considered a dead end. It's now critical to artificial intelligence, and work in the field has won two men the highest honor in computer science. [...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-10-13

Self-improving language models are becoming reality with MIT's updated SEAL technique

Researchers at the Massachusetts Institute of Technology (MIT) are gaining renewed attention for developing and open sourcing a technique that allows large language models (LLMs) — like those underp [...]

Match Score: 117.89

Destination

2025-03-05

Algorithms from the 1980s power today's AI breakthroughs, earn Turing Award for researchers

Andrew Barto and Richard Sutton have won the 2024 A.M. Turing Award for developing key technologies that power modern artificial intelligence, including recent breakthroughs in large reasoning models. [...]

Match Score: 83.33

venturebeat

2025-10-09

Nvidia researchers boost LLMs reasoning skills by getting them to 'think' during pre-training

Researchers at Nvidia have developed a new technique that flips the script on how large language models (LLMs) learn to reason. The method, called reinforcement learning pre-training (RLP), integrates [...]

Match Score: 78.59

fastcompany

2025-03-05

AI pioneers win the Turing Award, tech’s top prize

Andrew Barto and Richard Sutton, are the winners of this year’s A.M. Turing Award, the tech world’s equivalent of the Nobel Prize. [...]

Match Score: 71.05

venturebeat

2025-11-14

Google’s new AI training method helps small models tackle complex reasoning

Researchers at Google Cloud and UCLA have proposed a new reinforcement learning framework that significantly improves the ability of language models to learn very challenging multi-step reasoning task [...]

Match Score: 62.39

venturebeat

2025-10-29

Vibe coding platform Cursor releases first in-house LLM, Composer, promising 4X speed boost

The vibe coding tool Cursor, from startup Anysphere, has introduced Composer, its first in-house, proprietary coding large language model (LLM) as part of its Cursor 2.0 platform update. Composer is d [...]

Match Score: 50.32

venturebeat

2025-10-24

Thinking Machines challenges OpenAI's AI scaling strategy: 'First superintelligence will be a superhuman learner'

While the world's leading artificial intelligence companies race to build ever-larger models, betting billions that scale alone will unlock artificial general intelligence, a researcher at one of [...]

Match Score: 48.97

Destination

2025-04-23

AI pioneers warn OpenAI's corporate overhaul could betray its original mission for humanity

A group of former OpenAI employees, researchers, and nonprofit organizations is urging regulators to block OpenAI’s proposed corporate restructuring, arguing it threatens the company’s founding mi [...]

Match Score: 46.11

venturebeat

2025-11-21

Google’s ‘Nested Learning’ paradigm could solve AI's memory and continual learning problem

Researchers at Google have developed a new AI paradigm aimed at solving one of the biggest limitations in today’s large language models: their inability to learn or update their knowledge after trai [...]

Match Score: 44.85