Destination

2025-07-20

Alibaba's Qwen2.5 only excels at math thanks to memorized training data

Calculator with four question marks on the display against a green 3D grid background.


A new study finds that Alibaba's Qwen2.5 models achieve high math scores mainly by memorizing training data rather than through genuine reasoning.


The article Alibaba's Qwen2.5 only excels at math thanks to memorized training data appeared first on THE DECODER.< [...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

Destination

2025-03-26

Alibaba's Qwen2.5-VL-32B matches larger models with just 32B parameters

Alibaba has added a multimodal visual language model to its Qwen2.5 series, marking another step in the Chinese tech company's effort to compete in the commercial AI space.<br /> The articl [...]

Match Score: 121.95

venturebeat

2025-10-09

Nvidia researchers boost LLMs reasoning skills by getting them to 'think' during pre-training

Researchers at Nvidia have developed a new technique that flips the script on how large language models (LLMs) learn to reason. The method, called reinforcement learning pre-training (RLP), integrates [...]

Match Score: 75.31

Destination

2025-02-13

Apple will use Alibaba's generative AI for its iPhones in China

Apple will use Alibaba's generative AI to power artificial intelligence features for iPhones meant for sale in the Chinese market. Joe Tsai, Alibaba Group's Chairman, has confirmed the compa [...]

Match Score: 71.73

Destination

2025-07-10

How exactly did Grok go full 'MechaHitler?'

Earlier this week, Grok, X's built-in chatbot, took a hard turn toward antisemitism following a recent update. Amid unprompted, hateful rhetoric against Jews, it even began referring to itself as [...]

Match Score: 66.17

venturebeat

2025-10-02

'Western Qwen': IBM wows with Granite 4 LLM launch and hybrid Mamba/Transformer architecture

IBM today announced the release of Granite 4.0, the newest generation of its homemade family of open source large language models (LLMs) designed to balance high performance with lower memory and cost [...]

Match Score: 63.59

Destination

2025-01-01

Tech that can help you stick to your New Year’s resolutions

Regardless of how 2024 went for you, 2025 is another chance for all of us to make the new year better than the one that came before it. New Year’s resolutions are usually set with the best intention [...]

Match Score: 57.98

Destination

2025-01-28

Alibaba’s Qwen2.5-Max challenges U.S. tech giants, reshapes enterprise AI

Alibaba's Qwen2.5-Max AI model sets new performance benchmarks in enterprise-ready artificial intelligence, promising reduced infrastructure costs and improved efficiency for business application [...]

Match Score: 57.69

Destination

2025-07-08

Best Amazon Prime Day 2025 deals: Our top picks on headphones, TVs, robot vacuums and more

Amazon Prime Day 2025 has arrived and it has brought a slew of discounts across the entirety of Amazon’s online storefront. As expected, Amazon’s site is pretty overwhelming at the moment and will [...]

Match Score: 57.47

Destination

2025-07-11

The best Amazon Prime Day deals for the last day: Our top picks on headphones, TVs, robot vacuums and more

Amazon Prime Day is almost over, so now’s the time for members to stock up on discounted home essentials, clothing, shoes, and of course, tech. It’s safe to say that Amazon’s website has been ov [...]

Match Score: 57.18