Peektastic.com

venturebeat

Ai2's new Olmo 3.1 extends reinforcement learning training for stronger reasoning benchmarks

The Allen Institute for AI (Ai2) recently released what it calls its most powerful family of models yet, Olmo 3. But the company kept iterating on the models, expanding its reinforcement learning (RL) runs, to create Olmo 3.1.The new Olmo 3.1 models focus on efficiency, transparency, and control for enterprises. Ai2 updated two of the three versions of Olmo 2: Olmo 3.1 Think 32B, the flagship model optimized for advanced research, and Olmo 3.1 Instruct 32B, designed for instruction-following, multi-turn dialogue, and tool use. Olmo 3 has a third version, Olmo 3-Base for programming, comprehension, and math. It also works well for continue fine-tuning. Ai2 said that to upgrade Olmo 3 Think 32B to Olmo 3.1, its researchers extended its best RL run with a longer training schedule. “Afte [...]

Discover Copy

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

Ai2’s Olmo 3 family challenges Qwen and Llama with efficient, open reasoning and customization

The Allen Institute for AI (Ai2) hopes to take advantage of an increased demand for customized models and enterprises seeking more transparency from AI models with its latest release.Ai2 made the late [...]

More Copy

Match Score: 476.76

venturebeat

Bolmo’s architecture unlocks efficient byte‑level LM training without sacrificing quality

Enterprises that want tokenizer-free multilingual models are increasingly turning to byte-level language models to reduce brittleness in noisy or low-resource text. To tap into that niche — and make [...]

More Copy

Match Score: 249.37

venturebeat

Microsoft built Phi-4-reasoning-vision-15B to know when to think — and when thinking is a waste of time

Microsoft on Tuesday released Phi-4-reasoning-vision-15B, a compact open-weight multimodal AI model that the company says matches or exceeds the performance of systems many times its size — while co [...]

More Copy

Match Score: 240.15

venturebeat

Ai2 releases MolmoWeb, an open-weight visual web agent with 30K human task trajectories and a full training stack

Engineers building browser agents today face a choice between closed APIs they cannot inspect and open-weight frameworks with no trained model underneath them. Ai2 is now offering a third option.The S [...]

More Copy

Match Score: 166.75

venturebeat

Phi-4 proves that a 'data-first' SFT methodology is the new differentiator

AI engineers often chase performance by scaling up LLM parameters and data, but the trend toward smaller, more efficient, and better-focused models has accelerated. The Phi-4 fine-tuning methodology [...]

More Copy

Match Score: 163.97

venturebeat

Nvidia researchers boost LLMs reasoning skills by getting them to 'think' during pre-training

Researchers at Nvidia have developed a new technique that flips the script on how large language models (LLMs) learn to reason. The method, called reinforcement learning pre-training (RLP), integrates [...]

More Copy

Match Score: 161.41

venturebeat

Google’s new AI training method helps small models tackle complex reasoning

Researchers at Google Cloud and UCLA have proposed a new reinforcement learning framework that significantly improves the ability of language models to learn very challenging multi-step reasoning task [...]

More Copy

Match Score: 160.09

venturebeat

Ai2’s Molmo 2 shows open-source models can rival proprietary giants in video understanding

Fresh off releasing the latest version of its Olmo foundation model, the Allen Institute for AI (Ai2) launched its open-source video model, Molmo 2, on Tuesday, aiming to show that smaller, open model [...]

More Copy

Match Score: 144.54

venturebeat

New training method boosts AI multimodal reasoning with smaller, smarter datasets

Researchers at MiroMind AI and several Chinese universities have released OpenMMReasoner, a new training framework that improves the capabilities of language models in multimodal reasoning.The framewo [...]

More Copy

Match Score: 132.75

Ai2's new Olmo 3.1 extends reinforcement learning training for stronger reasoning benchmarks

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

Ai2’s Olmo 3 family challenges Qwen and Llama with efficient, open reasoning and customization

Bolmo’s architecture unlocks efficient byte‑level LM training without sacrificing quality

Microsoft built Phi-4-reasoning-vision-15B to know when to think — and when thinking is a waste of time

Ai2 releases MolmoWeb, an open-weight visual web agent with 30K human task trajectories and a full training stack

Phi-4 proves that a 'data-first' SFT methodology is the new differentiator

Nvidia researchers boost LLMs reasoning skills by getting them to 'think' during pre-training

Google’s new AI training method helps small models tackle complex reasoning

Ai2’s Molmo 2 shows open-source models can rival proprietary giants in video understanding

New training method boosts AI multimodal reasoning with smaller, smarter datasets

Ai2's new Olmo 3.1 extends reinforcement learning training for stronger reasoning benchmarks