2025-10-23
In a striking act of self-critique, one of the architects of the transformer technology that powers ChatGPT, Claude, and virtually every major AI system told an audience of industry leaders this week [...]
2025-07-07
The Japanese AI startup Sakana AI has developed a new method that lets multiple large language models, such as ChatGPT and Gemini, work together on the same problem. Early tests suggest this collabora [...]
2025-11-04
When the transformer architecture was introduced in 2017 in the now seminal Google paper "Attention Is All You Need," it became an instant cornerstone of modern artificial intelligence. Ever [...]
2025-10-13
Enterprises often find that when they fine-tune models, one effective approach to making a large language model (LLM) fit for purpose and grounded in data is to have the model lose some of its abiliti [...]
2025-06-01
With the Darwin-Gödel Machine (DGM), Sakana AI introduces an AI system that can iteratively improve itself through self-modification and open-ended exploration. Early results look promising, but the [...]
2025-05-18
Sakana AI, a Tokyo-based startup, has introduced a new kind of AI system designed to mimic how the brain processes time.<br /> The article Japanese startup Sakana AI explores time-based thinking [...]
2025-06-21
Japanese company Sakana AI built an AI agent that can tackle complex optimization problems used in industry. In a live competition, their AI went head-to-head with more than 1,000 human programmers.&l [...]
2025-10-13
Researchers at the Massachusetts Institute of Technology (MIT) are gaining renewed attention for developing and open sourcing a technique that allows large language models (LLMs) — like those underp [...]