Anthropic today announced the launch of its latest AI model, Claude Opus 4.8. Anthropic claims the model is a "more effective collaborator" with improvements in agentic coding, multidisciplinary reasoning, agentic computer use, knowledge work, and agentic financial analysis.
Testers have found Opus 4.8 to be "more reliable and sharper in its judgement" when doing agentic tasks, and the model also made gains in honesty.
Early testers report that Opus 4.8 is more likely to flag uncertainties about its work and less likely to make unsupported claims. This is borne out in our evaluations, which show that Opus 4.8 is around four times less likely than its predecessor to allow flaws in code it has written to pass unremarked.
Alignment assessments suggest the model hits new highs on measures of prosocial traits like supporting user autonomy and acting in the user's best interest. Rates of misaligned behavior like deception are lower than Opus 4.7 and similar to the Claude Mythos Previe
Discussion
Get the discussion rolling
A single comment can start something great.