Peektastic.com

Google releases open-source LMEval to benchmark language and multimodal models

LMEval aims to standardize benchmarks and streamline safety analysis for large language and multimodal models.<br /> The article Google releases open-source LMEval to benchmark language and multimodal models appeared first on THE DECODER. [...]

Discover Copy

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

Microsoft built Phi-4-reasoning-vision-15B to know when to think — and when thinking is a waste of time

Microsoft on Tuesday released Phi-4-reasoning-vision-15B, a compact open-weight multimodal AI model that the company says matches or exceeds the performance of systems many times its size — while co [...]

More Copy

Match Score: 164.27

venturebeat

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning

Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models (VLMs) optimized for multimodal reasoning, frontend automation, and high-e [...]

More Copy

Match Score: 135.58

venturebeat

Baidu unveils proprietary ERNIE 5 beating GPT-5 performance on charts, document understanding and more

Mere hours after OpenAI updated its flagship foundation model GPT-5 to GPT-5.1, promising reduced token usage overall and a more pleasant personality with more preset options, Chinese search giant Bai [...]

More Copy

Match Score: 134.78

blogspot

How I Get Free Traffic from ChatGPT in 2025 (AIO vs SEO)

Three weeks ago, I tested something that completely changed how I think about organic traffic. I opened ChatGPT and asked a simple question: "What's the best course on building SaaS with Wor [...]

More Copy

Match Score: 124.38

venturebeat

What to be thankful for in AI in 2025

Hello, dear readers. Happy belated Thanksgiving and Black Friday!This year has felt like living inside a permanent DevDay. Every week, some lab drops a new model, a new agent framework, or a new “th [...]

More Copy

Match Score: 122.57

venturebeat

Baidu just dropped an open-source multimodal AI that it claims beats GPT-5 and Gemini

Baidu Inc., China's largest search engine company, released a new artificial intelligence model on Monday that its developers claim outperforms competitors from Google and OpenAI on several visio [...]

More Copy

Match Score: 111.55

venturebeat

Alibaba's small, open source Qwen3.5-9B beats OpenAI's gpt-oss-120B and can run on standard laptops

Despite political turmoil in the U.S. AI sector, in China, the AI advances are continuing apace without a hitch.Earlier today, e-commerce giant Alibaba's Qwen Team of AI researchers, focused prim [...]

More Copy

Match Score: 111.50

venturebeat

Black Forest Labs launches Flux.2 AI image models to challenge Nano Banana Pro and Midjourney

It's not just Google's Gemini 3, Nano Banana Pro, and Anthropic's Claude Opus 4.5 we have to be thankful for this year around the Thanksgiving holiday here in the U.S.No, today the Germ [...]

More Copy

Match Score: 104.53

venturebeat

Frontier models are failing one in three production attempts — and getting harder to audit

AI agents are now embedded in real enterprise workflows, and they're still failing roughly one in three attempts on structured benchmarks. That gap between capability and reliability is the defin [...]

More Copy

Match Score: 97.97