2025-10-09
Researchers at Nvidia have developed a new technique that flips the script on how large language models (LLMs) learn to reason. The method, called reinforcement learning pre-training (RLP), integrates [...]
2025-03-05
Andrew Barto and Richard Sutton have won the 2024 A.M. Turing Award for developing key technologies that power modern artificial intelligence, including recent breakthroughs in large reasoning models. [...]
2025-04-21
AI researcher Sebastian Raschka has published a new analysis that looks at how reinforcement learning is used to improve reasoning in large language models (LRMs).<br /> The article Go read this [...]
2025-06-24
A research team from Singapore and China has introduced LongWriter-Zero, an AI model that uses reinforcement learning to write texts longer than 10,000 words—without relying on synthetic training da [...]
2025-09-01
Prime Intellect, a San Francisco AI startup, has launched the Environments Hub, an open platform for building and sharing reinforcement learning (RL) environments. The aim is to counter the closed sys [...]
2025-04-23
A group of former OpenAI employees, researchers, and nonprofit organizations is urging regulators to block OpenAI’s proposed corporate restructuring, arguing it threatens the company’s founding mi [...]
2025-08-30
Andrej Karpathy, a former Tesla and OpenAI researcher, is part of a growing movement in the AI community calling for a new approach to building large language models (LLMs) and AI systems.<br /> [...]
2025-09-29
DeepSeek continues to push the frontier of generative AI...in this case, in terms of affordability.The company has unveiled its latest experimental large language model (LLM), DeepSeek-V3.2-Exp, that [...]