Destination

2025-04-19

GPT-4o makes beautiful images but fails basic reasoning tests, UCLA study finds

Despite the introduction of the new multimodal model GPT-4o, image generation in ChatGPT is still based on DALL-E 3, but OpenAI currently seems to be working on the image generator.


A new study by the University of California, Los Angeles shows: GPT-4o produces impressive images, but fails at tasks that require real image understanding, contextual thinking and logical reasoning.


The article Discover Copy

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-11-17

Phi-4 proves that a 'data-first' SFT methodology is the new differentiator

AI engineers often chase performance by scaling up LLM parameters and data, but the trend toward smaller, more efficient, and better-focused models has accelerated. The Phi-4 fine-tuning methodology [...]

Match Score: 136.05

venturebeat

2025-11-14

Google’s new AI training method helps small models tackle complex reasoning

Researchers at Google Cloud and UCLA have proposed a new reinforcement learning framework that significantly improves the ability of language models to learn very challenging multi-step reasoning task [...]

Match Score: 125.22

venturebeat

2025-11-06

Moonshot's Kimi K2 Thinking emerges as leading open source AI, outperforming GPT-5, Claude Sonnet 4.5 on key benchmarks

Even as concern and skepticism grows over U.S. AI startup OpenAI's buildout strategy and high spending commitments, Chinese open source AI providers are escalating their competition and one has e [...]

Match Score: 115.63

Destination

2025-10-22

Private Internet Access VPN review: Both more and less than a budget VPN

I came into this review thinking of Private Internet Access (PIA) as one of the better VPNs. It's in the Kape Technologies portfolio, along with the top-tier ExpressVPN and the generally reliable [...]

Match Score: 107.88

venturebeat

2025-10-30

Meta researchers open the LLM black box to repair flawed AI reasoning

Researchers at Meta FAIR and the University of Edinburgh have developed a new technique that can predict the correctness of a large language model's (LLM) reasoning and even intervene to fix its [...]

Match Score: 100.62

Destination

2025-06-27

NordVPN Review 2025: Innovative features, a few missteps

When we say that NordVPN is a good VPN that's not quite great, it's important to put that in perspective. Building a good VPN is hard, as evidenced by all the shovelware VPNs flooding the ma [...]

Match Score: 99.71

venturebeat

2025-11-12

OpenAI reboots ChatGPT experience with GPT-5.1 after mixed reviews of GPT-5

ChatGPT is about to become faster and more conversational as OpenAI upgrades its flagship model GPT-5 to GPT-5.1.OpenAI announced two updates to the GPT-5 series: GPT-5.1 Instant and GPT-5.1 Thinking. [...]

Match Score: 96.20

Destination

2025-07-26

Surfshark VPN review: A fast VPN for casual users

Surfshark is one of the youngest major VPNs, but it's grown rapidly over the last seven years. Since 2018, it's expanded its network to 100 countries, added a suite of apps to its Surfshark [...]

Match Score: 94.51

Destination

2025-08-07

GPT-5 is here and it's free for everyone

A couple of days after announcing its first open-weight models in six years, OpenAI is releasing the long-awaited GPT-5. What's more, you can start using it today, even if you're a free user [...]

Match Score: 83.94