Claude Opus 4.8 Debuts as Anthropic Pushes for ‘Improved Honesty’

A split-screen minimalist graphic with handwritten loops on grid paper beside a black outline hand pointing to a white neural network icon.

Image: Anthropic

Écrit par

May 28, 2026

3 minute read

eWeek Le contenu et les recommandations de produits sont indépendants de la rédaction. Nous pouvons gagner de l'argent lorsque vous cliquez sur des liens vers nos partenaires. En savoir plus

Anthropic says its latest AI model is better at admitting when it might be wrong.

The artificial intelligence lab on Thursday released Claude Opus 4.8, an upgraded version of its leading generally available model. According to the company, the release delivers measurable improvements across coding, reasoning, and practical knowledge work benchmarks while keeping standard costs identical to its predecessor, Opus 4.7.

A primary focus of the Opus 4.8 rollout centers on model reliability and what Anthropic describes as improved honesty. The company noted that AI models frequently "jump to conclusions", confidently claiming to have made progress in their work despite the evidence being thin."

To combat this, the new model was trained to more readily surface gaps in its own confidence and avoid making unsupported assertions. Internal evaluations indicate that Opus 4.8 is roughly 4x less likely than Opus 4.7 to allow flaws in code it has written to pass unremarked.

Early testers have reported a greater tendency for the system to flag uncertainties about its work rather than projecting unearned confidence.

The upgrade also registered across-the-board gains on published benchmarks:

Agentic Coding (Terminal-Bench 2.1): Rose from 64.3% to 69.2%.
Multidisciplinary Reasoning with Tools: Improved from 54.7% to 57.9%.
Agentic Computer Use: Shifted upward from 82.8% to 83.4%.
Knowledge Work: Scores increased from 1,753 to 1,890.

Anthropic's alignment team said the model "reaches new highs on our measures of prosocial traits like supporting user autonomy and acting in the user’s best interest," lowering rates of deceptive or misaligned behaviors down to levels near its restricted preview models.

New controls and parallel subagents

Launching alongside the model are several tool updates aimed at giving users more command over operational costs and computing power.

A newly introduced effort control panel on claude.ai and Cowork allows users to manually adjust the intensity of Claude's processing. Selecting higher effort configurations prompts the model to think more frequently and deeply for complex assignments, consuming more tokens. Lower effort settings accelerate response times and preserve rate limits for simpler tasks.

For software development, Anthropic introduced a research preview feature called "dynamic workflows" in Claude Code. The functionality allows the system to organize, plan, and execute massive projects by deploying hundreds of parallel subagents in a single session.

The company claims this allows Claude to execute codebase-scale migrations spanning hundreds of thousands of lines of code from kickoff to merge, using existing test suites as the benchmark for validation.

For developers using the Messages API, Anthropic added the capability to accept system entries inside the messages array. This allows engineers to adjust instructions, environment contexts, or token budgets mid-task without interrupting the prompt cache or waiting for a user turn.

Mythos looms in the background

Even as Anthropic rolls out Opus 4.8, the company is already pointing toward a more advanced family of AI systems.

Anthropic said it plans to release “a new class of model with even higher intelligence than Opus” in the coming weeks. Those systems are tied to Project Glasswing and Claude Mythos Preview, which are currently being tested by a small number of organizations for cybersecurity work.

According to Anthropic, Mythos-class models require stronger safeguards before wider deployment because of their advanced cybersecurity capabilities.

Also read: Google’s Gemini 3.5 Flash and Antigravity 2.0 add another enterprise coding challenge for Anthropic and OpenAI.

Aminu Abdullahi

Aminu Abdullahi is a B2C and B2B technology and finance writer with more than six years of experience covering enterprise IT, cybersecurity, cloud computing, artificial intelligence, fintech, business software, and emerging technologies. His work has appeared in publications including TechRepublic, eWEEK, Channel Insider, Geekflare, Enterprise Networking Planet, eSecurity Planet, CIO Insight, and Webopedia. With a technical background in computer science, he specializes in translating complex technology topics into clear, accessible content for business leaders and decision-makers.

Claude Opus 4.8 Debuts as Anthropic Pushes for ‘Improved Honesty’

New controls and parallel subagents

Mythos looms in the background

Aminu Abdullahi

Entreprise

Catégories