Alvin Lang
Feb 05, 2026 18:35
Claude Opus 4.6 scores 23% larger on monetary evaluation benchmarks, provides Excel and PowerPoint integrations for funding banking workflows.
Anthropic dropped Claude Opus 4.6 on February 5, positioning its newest AI mannequin as a direct play for monetary providers workflows. The headline quantity: a 23 share level enchancment over Claude Sonnet 4.5 on the corporate’s inside Actual-World Finance benchmark, which assessments roughly 50 funding and monetary evaluation use instances.
The mannequin now scores 60.7% on Vals AI’s Finance Agent benchmark—a 5.47% soar from Opus 4.5—which evaluates efficiency on SEC submitting analysis. It additionally hits 76% on TaxEval, one other exterior benchmark testing tax-related reasoning.
The place Analysts Truly Work
The true story right here is not simply benchmark scores. Anthropic is pushing Claude immediately into the instruments finance professionals use day by day: Excel and PowerPoint.
Claude in Excel now handles pivot tables, chart modifications, conditional formatting, and what Anthropic calls “finance-grade formatting.” The mixing helps multi-file drag-and-drop and auto-compaction for lengthy conversations—addressing the copy-paste hell that plagues anybody constructing complicated monetary fashions throughout a number of tabs.
Claude in PowerPoint launches in beta for Max, Staff, and Enterprise customers. The AI reads present layouts, fonts, and grasp slides earlier than producing new content material, theoretically letting analysts construct client-ready decks with out ranging from scratch.
The Productiveness Declare
Anthropic’s advertising and marketing supplies present side-by-side comparisons of economic due diligence outputs—the type of acquisition evaluation work they are saying “would usually take a senior analyst two to 3 weeks to finish.” First-pass high quality has improved noticeably, based on companions already testing the system.
“Creating monetary PowerPoints that used to take hours now takes minutes,” mentioned Aabhas Sharma, CTO at Hebbia. Nico Christie, co-founder of Shortcut AI, known as it “a watershed second for spreadsheet brokers.”
Lloyd Hilton, Head of Hg Catalyst, famous the mannequin handles “unstructured knowledge and intelligently working with minimal prompting to meaningfully automate complicated evaluation.”
What’s New Below the Hood
Opus 4.6 ships with a 1-million-token context window, letting it course of large datasets in single classes. The mannequin additionally improved on BrowseComp and DeepSearchQA benchmarks, which check info extraction from giant, unstructured doc units—important for anybody doing earnings name evaluation or regulatory submitting critiques.
Cowork, Anthropic’s desktop app characteristic, now lets finance groups kick off a number of analyses concurrently whereas steering Claude’s method on every deliverable. A company finance plugin offers pre-built workflows for journal entries, variance analyses, and reconciliation.
The Superb Print
Anthropic is not claiming full autonomy. “Customers ought to proceed to assessment Claude’s outputs to make sure it meets their specs; significantly for high-stakes work, human judgment stays important,” the corporate famous in its launch.
For crypto and fintech companies evaluating AI integration, Opus 4.6 represents the clearest sign but that basis mannequin corporations are shifting past chatbot interfaces towards embedded enterprise instruments. The query now: how rapidly will competing fashions from OpenAI and Google match these finance-specific capabilities?
Claude Opus 4.6 is offered now on all paid Claude plans. The PowerPoint integration stays in analysis preview for higher-tier subscribers.
Picture supply: Shutterstock


