Destination

2025-08-03

Persona vectors allow Anthropic to steer language model behaviors like sycophancy and evil


Anthropic has developed a technique for monitoring, controlling, and even preventing specific personality traits in language models.


The article Persona vectors allow Anthropic to steer language model behaviors like sycophancy and evil appeared first on THE DECODER.

[...]

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

2025-12-03

AWS claims 90% vector cost savings with S3 Vectors GA, calls it 'complementary' - analysts split on what it means for vector databases

Vector databases emerged as a must-have technology foundation at the beginning of the modern gen AI era. What has changed over the last year, however, is that vectors, the numerical representations o [...]

Match Score: 298.69

venturebeat

2025-10-15

Anthropic is giving away its powerful Claude Haiku 4.5 AI for free to take on OpenAI

Anthropic released Claude Haiku 4.5 on Wednesday, a smaller and significantly cheaper artificial intelligence model that matches the coding capabilities of systems that were considered cutting-edge ju [...]

Match Score: 183.64

Destination

2025-02-28

Engadget Podcast: iPhone 16e review and Amazon's AI-powered Alexa+

The keyword for the iPhone 16e seems to be "compromise." In this episode, Devindra chats with Cherlynn about her iPhone 16e review and try to figure out who this phone is actually for. Also, [...]

Match Score: 177.92

venturebeat

2025-12-08

Anthropic's Claude Code can now read your Slack messages and write code for you

Anthropic on Monday launched a beta integration that connects its fast-growing Claude Code programming agent directly to Slack, allowing software engineers to delegate coding tasks without leaving the [...]

Match Score: 167.27

Destination

2025-08-20

Resident Evil Requiem feels very familiar, but it's so well made that I respect the hell out of it

For nearly 30 years, developer Capcom has been redefining its particular brand of survival horror for the Resident Evil series. Despite its tone shifting between action-horror games and more pure horr [...]

Match Score: 146.87

venturebeat

2025-10-27

Anthropic rolls out Claude AI for finance, integrates with Excel to rival Microsoft Copilot

Anthropic is making its most aggressive push yet into the trillion-dollar financial services industry, unveiling a suite of tools that embed its Claude AI assistant directly into Microsoft Excel and c [...]

Match Score: 120.01

venturebeat

2025-10-16

How Anthropic’s ‘Skills’ make Claude faster, cheaper, and more consistent for business workflows

Anthropic launched a new capability on Thursday that allows its Claude AI assistant to tap into specialized expertise on demand, marking the company's latest effort to make artificial intelligenc [...]

Match Score: 115.53

venturebeat

2025-11-24

Anthropic’s Claude Opus 4.5 is here: Cheaper AI, infinite chats, and coding skills that beat humans

Anthropic released its most capable artificial intelligence model yet on Monday, slashing prices by roughly two-thirds while claiming state-of-the-art performance on software engineering tasks — a s [...]

Match Score: 113.07

venturebeat

2025-12-04

Anthropic vs. OpenAI red teaming methods reveal different security priorities for enterprise AI

Model providers want to prove the security and robustness of their models, releasing system cards and conducting red-team exercises with each new release. But it can be difficult for enterprises to pa [...]

Match Score: 107.40