Destination

2025-06-07

Apple study finds "a fundamental scaling limitation" in reasoning models' thinking abilities

A new Apple study shows that current reasoning models such as Claude 3.7 Thinking or Deepseek-R1 not only fail with complex logic tasks, but paradoxically even think less with increasing difficulty. The models show three levels of performance: for simple tasks, classic language models without a special thinking function are more precise; for medium complexity, reasoning models have advantages; for high complexity, all models break down completely - regardless of the available computing budget. The researchers speak of a fundamental scaling limit of the reasoning approach and do not see any generalizable problem-so [...]</p>
                    <!-- Buttons -->
			        <div class= Discover Copy

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-10-08

Samsung AI researcher's new, open reasoning model TRM outperforms models 10,000X larger — on specific problems

The trend of AI researchers developing new, small open source generative models that outperform far larger, proprietary peers continued this week with yet another staggering advancement.Alexia Jolicoe [...]

Match Score: 110.16

venturebeat

2025-10-09

Nvidia researchers boost LLMs reasoning skills by getting them to 'think' during pre-training

Researchers at Nvidia have developed a new technique that flips the script on how large language models (LLMs) learn to reason. The method, called reinforcement learning pre-training (RLP), integrates [...]

Match Score: 95.76

venturebeat

2025-10-02

'Western Qwen': IBM wows with Granite 4 LLM launch and hybrid Mamba/Transformer architecture

IBM today announced the release of Granite 4.0, the newest generation of its homemade family of open source large language models (LLMs) designed to balance high performance with lower memory and cost [...]

Match Score: 84.20

Destination

2025-07-04

Apple's claims about large reasoning models face fresh scrutiny from a new study

A replication study of Apple's controversial "The Illusion of Thinking" paper confirms some of its main criticisms, but challenges the study's central conclusion.<br /> The a [...]

Match Score: 81.72

venturebeat

2025-10-08

AI21’s Jamba Reasoning 3B Redefines What “Small” Means in LLMs — 250K Context on a Laptop

The latest addition to the small model wave for enterprises comes from AI21 Labs, which is betting that bringing models to devices will free up traffic in data centers. AI21’s Jamba Reasoning 3B, a [...]

Match Score: 76.84

Destination

2025-09-26

Today's best iPad deals include a record-low price on the latest iPad Air M3

Apple's four iPad models each have their value — the mini is super portable, the standard model with the A16 chip is ideal for casual use while the Pros can handle complex tasks better than som [...]

Match Score: 74.67

Destination

2025-10-10

Today's best iPad deals include the iPad A16 for $279

We generally consider Apple’s iPads to be the best tablets for most people, but most of them don’t come cheap. To help you get the most value possible, we’re keeping a constant eye on sale price [...]

Match Score: 70.55

Destination

2025-02-26

The best Apple Watch in 2025

If you know you want an Apple Watch, but aren’t sure which one to get, this guide is here to explain the differences between the three models. The company’s flagship Apple Watch Series 10 has robu [...]

Match Score: 70.01

venturebeat

2025-10-01

Thinking Machines' first official product is here: meet Tinker, an API for distributed LLM fine-tuning

Thinking Machines, the AI startup founded earlier this year by former OpenAI CTO Mira Murati, has launched its first product: Tinker, a Python-based API designed to make large language model (LLM) fin [...]

Match Score: 69.90