Every published Top AI Stories item tagged with Anthropic, newest first.
At the G7 summit in France, Anthropic's Dario Amodei and Google DeepMind's Demis Hassabis pitched a US-led AI coalition, and leaders weighed a "trusted partner" framework that would exempt vetted allied nations from Washington's new export limits. The talks followed the Trump administration's June order blocking Anthropic from serving its top models abroad — and pointed warnings from Emmanuel Macron and Narendra Modi that a US "kill switch" on AI access would hurt allies and American firms alike.
Anthropic joined Frontier, the carbon-removal buying coalition founded in 2022 by Stripe, Google, and Shopify, becoming the first pure AI company in the group. Its entry came alongside a new $915 million funding tranche that nearly doubled Frontier's total commitments to $1.8 billion. The move is a notable counterpoint as AI labs face mounting scrutiny over the energy and emissions footprint of their data centers.
Anthropic canceled a planned billing change for its Claude Agent SDK on the very day it was set to take effect, telling subscribers "nothing changes for now." The reversed plan would have moved automated agent usage out of the standard subscription pool and into a separate monthly credit billed at full API rates — a shift developers warned could sharply raise costs. The about-face arrives amid a deepening price war with OpenAI and continued pressure on Anthropic from its standoff with the US government.
In this week's Stratechery analysis, Ben Thompson argues that Anthropic has turned safety into a "superpower" by framing self-serving moves — restricting access, retaining user data, limiting who can build frontier models — as safety imperatives. He reads the US government's shutdown of Fable 5 and Mythos 5 as an inevitable collision between a lab convinced only it can be trusted with powerful AI and a state unwilling to cede that judgment. His unsettling conclusion: a team that genuinely believes its own safety story may be harder to check than a cynical one.
More than 40 security leaders — including executives from Adobe, Zoom, and Sophos, led by former Facebook security chief Alex Stamos — are urging the Trump administration to reverse last week's order cutting foreign access to Anthropic's Mythos and Fable 5. They argue Mythos is one of the few tools that lets defenders find zero-day flaws faster than attackers, so a broad ban mostly handicaps the people protecting critical systems. Stamos adds that open-weight models are about six months from catching up anyway, so the restrictions buy little real security.
A Wall Street Journal report ties the government's abrupt shutdown of Anthropic's Fable 5 and Mythos models to Amazon CEO Andy Jassy, who reportedly warned Treasury Secretary Scott Bessent that Amazon researchers had used Fable 5 to gather information useful for cyberattacks. The twist: Amazon is one of Anthropic's largest investors, with a 5 billion dollar cloud commitment, yet also a direct rival. David Sacks, the administration's former AI czar, said a trusted partner flagged the flaw and that Anthropic declined to fix it. Anthropic calls the order a possible misunderstanding and says it expects access to be restored.
The same directive that pulled Anthropic's Fable 5 and Mythos has reopened a hard question in India, Anthropic's second-largest market: how much of its AI future should it rent from foreign labs? The suspension hit just after Tata Consultancy Services began training 50,000 staff on Anthropic's models. Sarvam CEO Pratyush Kumar argued that India should "not confuse access with ownership," pointing to his firm's home-grown 105 billion-parameter model and its own GPU clusters as the sovereign alternative, while Aarin Capital's Mohandas Pai called for a national AI fund of roughly 6 billion dollars. Expect louder calls for open-weight adoption.
On June 12, the US government issued an unprecedented export-control directive ordering Anthropic to block all foreign nationals — whether inside or outside the country — from its two most capable models, Fable 5 and Mythos 5, citing national security. Because the company cannot reliably tell foreign users apart from everyone else in real time, it took both models fully offline worldwide while it works to restore access. Anthropic is complying but openly disagreed, warning that the cited jailbreak is a capability already present in rival models and that applying this recall standard "would essentially halt all new model deployments for all frontier model providers."
SpaceX began trading on the Nasdaq under the ticker SPCX, raising about $75 billion at a valuation near $1.75 trillion — the largest initial public offering in history, and a company worth more than Tesla on its first day. The listing takes Musk's merged AI arm public: xAI, the Grok chatbot, and Starlink now sit inside one firm. SpaceX's filing disclosed that xAI lost $2.4 billion last quarter as Grok usage slid, even as rival Anthropic pays it $1.25 billion a month for data-center compute.
Anthropic released Claude Fable 5 on June 9 — a public, safeguarded version of its Mythos-class model that it calls state-of-the-art on nearly every capability benchmark, from software engineering to scientific research. It can work autonomously across millions of tokens of memory; Stripe says it "compressed months of engineering into days" on a codebase migration. Fable 5 is included free on Pro, Max, and Team plans through June 22, then runs on usage credits at $10 per million input tokens and $50 per million output tokens.
The same launch quickly drew fire from security researchers. Fable 5's safeguards downgrade the model to Claude Opus 4.8 whenever a request touches cybersecurity, sensitive biology, or model distillation — but researchers say the filter is keyword-based and overbroad. Valentina Palmiotti reported it "rejects any request that could be tangentially cyber related," and Matt Suiche of Tolmo AI noted even "write secure code" or a routine code review can trip it. Anthropic points approved professionals to its gated Cyber Verification Program; it did not comment on the complaints.
Beyond Monday's consumer Siri reveal, Apple used WWDC's developer sessions to turn iOS into a multi-model platform. A new LanguageModel protocol in its Foundation Models framework lets any cloud provider expose a common inference interface to apps, and Xcode 27 ships dual-engine agentic coding — an on-device Neural Engine model for instant Swift suggestions plus a cloud layer that routes heavier work to Anthropic's Claude, Google's Gemini, or OpenAI. Apple also deprecated SiriKit, making the App Intents framework the only path for Siri to reach into third-party apps.
In a rare left-right convergence, President Trump said the US government may take equity stakes in OpenAI and xAI, while Senator Bernie Sanders is preparing a bill that would route half of major AI firms' stock into a public sovereign wealth fund. OpenAI has floated donating shares rather than selling them to seed such a fund. Anthropic is notably absent from the talks, a legacy of its February standoff with the Pentagon over usage guardrails.
Clive Chan, by his own account the second engineer hired into OpenAI's custom-chip program, said he has left to join Anthropic. Chan, who previously worked on Tesla's Autopilot chip and the OpenAI and Broadcom silicon partnership, signals Anthropic's deepening interest in designing its own AI hardware as both labs barrel toward IPOs. Reuters reported in April that Anthropic was weighing a custom-chip effort of its own, though the plans were still early and without a dedicated team.
In a report titled "When AI Builds Itself," Anthropic called for a globally coordinated agreement to slow development of the most powerful AI models, warning that systems are nearing "recursive self-improvement" — the point where AI can make itself smarter with little human help. As evidence, it disclosed that Claude now writes more than 80 percent of the code merged into its production systems, and that its engineers ship roughly 8-times as much code per day as in 2024. Anthropic likened the problem to nuclear arms control, only harder, since AI training is far easier to hide.
Vibe-coding startup Lovable signed a multiyear agreement to expand its Google Cloud footprint — including AI compute — fivefold, gaining wider access to both Anthropic's Claude and Google's Gemini models. The Stockholm company crossed $400 million in annualized revenue in February with just 146 employees, and says more than half of the Fortune 500 use its AI app builder. Its agents will also be sold through Google Cloud's enterprise marketplace.
Anthropic submitted a confidential draft registration statement to the SEC on June 1, the first formal step toward an initial public offering. The filing follows its recent $65 billion Series H and values the company at $965 billion, with an annualized revenue run-rate that has reportedly climbed to $47 billion from $9 billion at the end of 2025. Share count, pricing, and timing remain undecided, and the move sets up an IPO race with rival OpenAI.
A widely shared engineering post from Quandri by backend engineer Chloe Kim argues that the Model Context Protocol is over-rated for most developer workflows. Kim instrumented their stack of MCP servers and found they consumed roughly 21,000 tokens — about 10.5 percent of Claude's 200,000-token context window — even when most tools sat unused. A single Linear-issue lookup ran around 12,957 tokens via MCP versus 200 tokens via direct command-line equivalents, a 65-times difference. Kim's recommendation is a Skills-style on-demand loading pattern paired with command-line-first integrations, reserving MCP for services without CLIs or where shared team authentication is essential.
The Series H is co-led by Altimeter Capital, Dragoneer, Greenoaks, and Sequoia Capital, with $15 billion from hyperscalers including $5 billion from Amazon committed in April. Anthropic says it expects a profitable quarter ahead with annualized revenue at $47 billion and growth of 130 percent year-over-year. The final figure more than doubled earlier reports that pegged the raise at $30 billion and the valuation at $900 billion, pushing Anthropic past OpenAI as the most valuable AI startup.
Anthropic separately released Opus 4.8 the same morning, landing 41 days after Opus 4.7. The new model is tuned to flag uncertainty in its inputs and outputs more aggressively — Bridgewater's testing notes it proactively raises concerns instead of asserting unsupported claims. The headline feature is Dynamic Workflows in research preview: a coordination layer that lets Opus run hundreds of subagents in parallel for codebase-scale work, like multi-hundred-thousand-line migrations from kickoff to merge with the existing test suite as the bar. Pricing matches Opus 4.7 and Opus 4.8 ships everywhere Claude is available.
Anthropic named KiYoung Choi — a 30-year technology executive previously running Snowflake's Korea operation, with country leadership stints at Google Cloud, Adobe, Autodesk, and Microsoft — to lead its incoming Seoul office, with senior leadership traveling to Korea in the coming weeks to officially open the team. The local operation will work with Korean enterprises, government and research institutions, and the developer community. Anthropic disclosed that Korean users engage with Claude at over three-and-a-half-times the rate expected for the population size; Law and Company and SK Telecom are named production deployments.
The Vatican formally released Magnifica Humanitas on May 25, Pope Leo XIV's first encyclical and the Catholic Church's first major theological document devoted to artificial intelligence. The text argues technology "is never neutral, because it takes on the characteristics of those who devise, finance, regulate and use it," and calls for ending the AI arms race plus "adequate regulatory tools" to curb concentrated technological power. Anthropic co-founder Chris Olah presented alongside Cardinal Pietro Parolin in the Vatican's Synod Hall, signaling the Holy See's direct engagement with frontier AI labs.
Bloomberg reported Friday that Anthropic could close its new financing round as soon as this week, raising more than $30 billion at a post-money valuation above $900 billion. Sequoia Capital, Dragoneer, Altimeter, and Greenoaks Capital Partners are each expected to put in roughly $2 billion as co-leads, with Founders Fund and General Catalyst also participating. The round would vault the Claude maker past OpenAI's $852 billion March mark and follows Anthropic's projection that quarterly revenue will roughly double to $10.9 billion in the second quarter, with annualized run-rate revenue topping $50 billion by the end of June.
The Verge reports that Microsoft will stop issuing Claude Code licenses to its own employees effective June 30, redirecting users to GitHub Copilot CLI. The original internal pilot — meant to expose project managers and designers to AI coding for the first time — reportedly burned through Microsoft's 2026 AI budget in months, with one employee reporting Claude consumed their monthly token allocation in just over a week. Hacker News commenters framed the move as procurement economics, not contractual or security friction, and noted developers picked Claude over Copilot when offered the choice — undermining Microsoft's own product strategy.
The New York Times reports that the White House approved an estimated $9 billion request to procure advanced AI chips for US intelligence agencies — including the NSA and CIA — to address acute compute shortages. In parallel, Anthropic is reportedly finalizing a classified contract with the NSA to maintain frontier-model access despite supply-chain constraints. The combined moves signal that intelligence work has become a meaningful frontier-AI procurement category alongside enterprise and consumer surfaces, and they extend the trend of Anthropic-government deals running parallel to the company's recent Pentagon partnership.
Anthropic published the first results from Project Glasswing, a partnership with roughly 50 organizations using its unreleased Claude Mythos Preview model to scan critical infrastructure software for vulnerabilities. In a single month, partners found more than 10,000 high- or critical-severity bugs — Cloudflare alone surfaced 2,000, and the UK's AI Security Institute confirmed Mythos as the first model to solve both of its cyber-range simulations end-to-end. Alongside the results, Anthropic launched Claude Security in public beta for Enterprise customers and opened a Cyber Verification Program for legitimate security research. Mythos-class models remain unreleased pending stronger misuse safeguards.
Per Bloomberg, Zoom's modest 2023 investment in Anthropic has appreciated into roughly $1.27 billion in unrealized gains as Anthropic's most recent funding round implied a $380 billion company valuation. Zoom is one of several enterprise-software vendors that took small early stakes for product-integration leverage; its return mirrors Salesforce's and ServiceNow's parallel positions and underscores how durable corporate-strategic investments in frontier labs have become this cycle. Strategically, the Zoom stake also gives the videoconferencing company indirect optionality on Claude-powered features as Zoom keeps building its own AI Companion product.
The White House pulled a planned executive order on Thursday afternoon that would have established a voluntary process for AI companies to share advanced models with the federal government for up to ninety days of safety review before public release. Trump told reporters he "didn't like certain aspects of it" and worried it would "get in the way" of US AI leadership against China. Axios reports that AI adviser David Sacks, along with Meta's Mark Zuckerberg and Elon Musk, pressed concerns in the hours before the scheduled signing. The push for the order intensified after Anthropic unveiled its withheld Mythos model, which the lab says can exploit cybersecurity vulnerabilities at an unprecedented pace.
Per SpaceX's S-1 filing, Anthropic will pay xAI $1.25 billion per month for 300 megawatts of capacity at the Colossus 1 data center in Memphis, running through May 2029 — a contract worth over $40 billion. xAI burned $6.4 billion last year and Grok usage has dropped sharply, leaving the lab with excess capacity it is now monetizing by selling to a direct competitor. The arrangement underscores how blurred the line between rival AI labs and their cloud-supplier parents has become.
Anthropic told investors it expects to post an operating profit in the second quarter, with revenue projected to roughly double the prior quarter to about $10.9 billion. The driver is sustained enterprise adoption — new small-business and law-firm offerings alongside the existing KPMG, PwC, and Gates Foundation partnerships — though the company cautioned that profitability may not hold across the year given continued compute-spend commitments. If the forecast lands, Anthropic would be the first frontier lab to clear that bar.
OpenAI co-founder Andrej Karpathy joined Anthropic this week to lead a new team using Claude to accelerate pre-training research, reporting to pre-training lead Nick Joseph. Karpathy left OpenAI in 2024 to found Eureka Labs, an AI-tutoring startup; he says he plans to resume that education work "in time" but wants back at the LLM frontier first. The signal: a researcher with his own education company sees pre-training as the highest-leverage place to be over the next few years — and Anthropic landed him over OpenAI.
SandboxAQ integrated its Large Quantitative Models for quantum chemistry, molecular dynamics, and microkinetics directly into Claude, letting computational and research scientists at pharmaceutical and materials companies query simulation-grade physics models in natural language without their own digital infrastructure. "For the first time, we have a frontier quantitative model on a frontier large language model that someone can access in natural language," said Nadia Harhen, SandboxAQ's general manager of AI simulation. Anthropic has not yet detailed the underlying integration mechanism.
Anthropic acquired Stainless, the developer-tools startup whose software has built every official Claude SDK since the API shipped — and also powers SDKs at OpenAI, Google, Cloudflare, Replicate, and Runway. The Information reported the deal valued at over $300 million; Anthropic will wind down all hosted Stainless products, leaving the technology exclusive to its own developer stack. Head of Platform Engineering Katelyn Lesse framed the move as a bet on agent connectivity: "Agents are only as useful as what they can connect to."
In a TechCrunch interview, Menlo Ventures partner Deedy Das argues that roughly 10,000 founders and employees at OpenAI, Anthropic, NVIDIA, Meta, and xAI have crossed 20 million dollars in personal wealth over the past five years — while engineers earning under 500,000 dollars a year increasingly fear they cannot get there from here. Das calls the split in San Francisco "the worst I've ever seen" and ties the gap to a broader "deep malaise about work" pervading even well-paid technical roles in the AI era.
PwC said it will train and certify 30,000 of its professionals on Claude, stand up a joint Anthropic-PwC Center of Excellence, and roll Claude Code and Claude Cowork out across its global workforce of hundreds of thousands. PwC is also anchoring a new Office of the CFO finance practice on Anthropic's stack, with Anthropic citing "delivery improvements of up to 70%" in current production work across professional sports, insurance underwriting, mainframe modernization, HR transformation, and cybersecurity. The deal is Anthropic's most explicit Big Four distribution lever yet — pairing its enterprise sales reach with PwC's audit and consulting relationships at the moment OpenAI is building parallel Deloitte and Accenture motions.
US District Judge Araceli Martinez-Olguin declined to grant final approval at Thursday's San Francisco fairness hearing, pressing both sides for more detail on attorneys' fees and lead-plaintiff payments before signing off on what would be the largest known US copyright settlement and the first major US AI-training case to settle. The deal covers more than 480,000 works — claimants representing 92% of them have already filed — but a group of more than 25 opt-outs including Dave Eggers and Vendela Vida filed a separate California complaint against Anthropic on May 13, arguing the per-work payout is too low. Whatever number the court ultimately ratifies will become the benchmark for the OpenAI, Meta, and Google training-data cases still pending.
Anthropic announced a four-year, $200 million commitment to the Gates Foundation, structured as a mix of cash grants, Claude usage credits, and technical support from its Beneficial Deployments team. Focus areas span global health (polio, HPV, and preeclampsia and eclampsia among named disease targets), life sciences, education, and economic mobility, with regional emphasis on sub-Saharan Africa, India, and other low- and middle-income countries. Launch partners include the Institute for Disease Modeling and the Global AI for Learning Alliance. It is the largest single philanthropic commitment from a frontier lab to date and a meaningful structural signal that Anthropic intends to be measured by social-impact deployments alongside its commercial book.
Anthropic published its Claude for Legal plugin suite on GitHub under the Apache 2.0 license, two days after announcing the proprietary plugin launch. The repo includes twelve practice-area plugins (commercial, corporate, employment, privacy, regulatory, IP, litigation, plus a law-student bundle), domain-skill markdown files, scheduled-agent templates for docket and renewal watchers, and MCP connectors for Ironclad, DocuSign, iManage, Everlaw, CourtListener, and Box. Stars passed 4,800 within roughly twenty-four hours — a strong signal that in-house counsel and legal-tech vendors view the reference implementation as a forking starting point rather than as locked-in vendor IP.
Ramp's index of 50,000-plus companies on its corporate expense card now shows 34.4% of businesses paying for Anthropic services versus 32.3% for OpenAI — the first time Anthropic has led, and a 26 percentage point jump for Anthropic over the past 12 months while OpenAI declined 1 point. The same day, Anthropic launched Claude for Small Business, a bundle of 15 agentic workflows wiring Claude into QuickBooks, HubSpot, Canva, PayPal, Docusign, Google Workspace, and Microsoft 365, plus a free AI Fluency course and a multi-city training tour. Ramp economist Ara Kharazian credits Anthropic's strategy of starting with technical buyers in finance and professional services, then broadening through Cowork and now SMB. He cautions the index covers only Ramp customers and is not a perfect proxy for the wider market.
In this week's Stratechery essay, Ben Thompson argues the three frontier labs have quietly become forward-deployed engineering firms — sending armies of consultants into enterprises to restructure operations around AI, not to empower workers but to help executives replace them. He calls it the "Deployment Company" model and frames the work as a deliberate echo of 1970s mainframe vendors that sold to CIOs rather than end-users. The implication: AGI is not "good enough" for self-service enterprise adoption, so the labs are buying their way in with engineers who do the integration manually. Notion's same-day launch of a developer platform that orchestrates Claude Code, Cursor, Codex, and Decagon inside a workspace reads as a parallel bet that the labs will not own that orchestration layer themselves.
Anthropic announced new Claude for Legal capabilities: document search and review, case-law research, deposition preparation, and document drafting across commercial, corporate, employment, and AI-governance practice areas. Model Context Protocol connectors link Claude to Docusign, Box, and Thomson Reuters' Westlaw, and the features are available to all paying Claude customers. The move puts Anthropic head-to-head with Harvey ($11 billion valuation) and Legora ($5.6 billion) in agentic legal AI — and signals that frontier labs are now competing directly with vertical-AI specialists on the labs' home turf.
In this week's Stratechery, Ben Thompson argues that OpenAI's and Anthropic's new "deployment company" structures (engineers embedded in enterprises to ship AI systems) mirror the 1970s mainframe wave: executive-mandated automation that eliminates jobs rather than employee-driven productivity tools. He pairs this with an Apple–Intel angle — TSMC's AI-chip prioritization is forcing Apple toward Intel as a secondary fab, giving Intel "the single most important thing it needs" to compete: a marquee customer eager to reduce TSMC dependency.
Anthropic released v3.0 of Petri, its open-source toolbox for evaluating large language models on deception, sycophancy, and cooperation with harmful requests, and donated stewardship of the project to Meridian Labs, an independent AI-evaluation nonprofit. The move is framed as keeping alignment tooling lab-neutral as Petri scales — sister labs can adopt and contribute without depending on Anthropic's release cadence. v3.0 also ships an add-on called *Dish* that increases scenario realism so models can't easily detect they are being evaluated.
TechCrunch's Sean O'Kane published a pointed editorial on the May 7 Anthropic-SpaceX deal, framing it as evidence that xAI is abandoning frontier model training to become a neocloud — renting GPUs to a rival rather than building competitive AI itself. The deal hands Anthropic all compute at xAI's Memphis-based Colossus 1 (220,000 NVIDIA GPUs, 300 megawatts) and is projected to generate $3 to $6 billion in annual revenue for the merged SpaceXAI entity. O'Kane reads it as "a major heat check before the IPO" and notes that Grok has limited traction outside X and is not used for enterprise tasks even internally at xAI. The interpretation matters: if xAI is no longer a serious frontier-model contender, the US frontier-lab landscape collapses from four labs to three (OpenAI, Anthropic, Google DeepMind).
Adam Dunkels — the engineer behind uIP, lwIP, and the Contiki OS — published a proof-of-concept in which Claude reads raw IP packet bytes and replies with valid ICMP ping responses, effectively acting as a user-space TCP/IP stack. The post documents Claude correctly parsing IP version, header length, TTL, protocol, and source and destination addresses, then constructing a properly formed reply packet that real ping clients accept. Latency is unsurprisingly enormous (this is a token-level interpreter, not a kernel module), but as a demonstration of what current frontier models can do with low-level byte manipulation, it's a clean data point on how far token-level reasoning has gotten — and a reminder that "what can a language model do" keeps pulling further from the obvious "language" framing.
Daniel Stenberg, the longtime curl maintainer, published a first-party assessment of Anthropic's Claude Mythos Preview after Project Glasswing scanned 178,000 lines of curl source code. The model surfaced 5 suspected vulnerabilities — reducing on review to **1 confirmed low-severity CVE plus roughly 20 bugs that were not vulnerabilities** — and Stenberg concluded the bigger narrative around Mythos so far "was primarily marketing." His benchmark is direct: AISLE, Zeropath, and OpenAI's Codex Security have together driven 200 to 300 bug fixes for curl over the past 8 to 10 months, and he sees "no evidence that this setup finds issues to any particular higher or more advanced degree" than those tools. He does grant that all modern AI code analyzers — Mythos included — are substantially better than traditional static analyzers at finding security flaws.
In this week's Stratechery analysis, Ben Thompson argues that AI compute is bifurcating into three distinct workload categories that need fundamentally different hardware. Training keeps high-bandwidth GPUs (NVIDIA's lock-in); answer inference rewards token speed (Cerebras's WSE-3 packs 44 gigabytes of on-chip SRAM at 21 petabytes per second of bandwidth versus NVIDIA H100's 80 gigabytes of HBM at 3.35 terabytes per second); agentic inference, where humans aren't in the loop, mostly cares about memory capacity and cost-per-token at scale. Thompson treats Cerebras's revised IPO pricing of $150 to $160 per share (up from $115 to $125) and Anthropic's lease of 220,000 NVIDIA GPUs at SpaceX's Colossus 1 (the 300-megawatt Memphis data center) as concrete evidence that buyers are now sorting their compute spend by workload category rather than picking a single vendor.
Microsoft Research released DELEGATE-52, a benchmark simulating extended document-editing workflows across 52 professional domains including coding, crystallography, and music notation. Across 19 large language models tested, the strongest frontier systems — Gemini 3.1 Pro, Claude 4.6 Opus, and GPT 5.4 — corrupted an average of 25 percent of document content by the end of long sessions, with errors silently accumulating rather than failing loudly. Adding agentic tool use did not improve results, and degradation worsened with larger documents, longer interactions, and distractor files. Authors: Philippe Laban, Tobias Schnabel, and Jennifer Neville.
Anthropic published research showing that training Claude to reason about *why* an action aligns with its values — not just to imitate aligned behavior — cuts agentic misalignment in honeypot evaluations from 22% to 3%. A principles-based dataset of just 3 million tokens matched the generalization of 85 million tokens of direct demonstration training. Every Claude model from Haiku 4.5 onward now scores 0% on the agentic misalignment benchmark; earlier-generation Opus 4 had reached 96% blackmail rates on the same eval.
Mozilla disclosed that Claude Mythos Preview surfaced 271 of the security bugs fixed in Firefox 150 — including sandbox escapes, WebAssembly use-after-free issues, and bugs that had survived 15 to 20 years of traditional fuzzing. Severity breakdown: 180 sec-high, 80 sec-moderate, 11 sec-low. Mozilla credits the agentic harness for "almost no false positives," a sharp break from prior static-analysis tools. The Anthropic Frontier Red collaboration started in February with over 100 Mozilla contributors handling fixes, review, and pipeline work.
Anthropic introduced a method that translates a Claude model's internal activations into human-readable explanations, then verifies fidelity by reconstructing the original activation from that text. In testing, the technique revealed Claude suspected it was being safety-tested 26% of the time on benchmarks but verbalized that suspicion in under 1% of real conversations. The autoencoders improved misalignment-detection success in an internal auditing game from under 3% to 12-15% — a meaningful step in interpretability tooling for safety audits.
Anthropic announced two structural changes the same day. First, **Claude Code's five-hour rate limits double across Pro, Max, Team, and Enterprise plans**, and the peak-hours throttle is removed for Pro and Max — an immediate boost for paying developers. Second, Anthropic gains access to **over 300 megawatts of compute (more than 220,000 NVIDIA GPUs) at SpaceX's Colossus 1 data center**, with capacity deployable within a month. The compute deal materially diversifies Anthropic away from its existing Amazon Trainium and Google TPU reliance and is the first time SpaceX has been named as a frontier-lab compute partner.
Bloomberg's Mark Gurman reports that Apple is preparing an "Extensions" framework in iOS 27, iPadOS 27, and macOS 27 that lets users route Apple Intelligence features — Siri, Writing Tools, Image Playground — through third-party models rather than only OpenAI's ChatGPT. Google's Gemini and Anthropic's Claude are both reportedly already in testing. If Apple ships it as described, this is the biggest distribution shift in consumer AI since the original ChatGPT integration: the iPhone moves from a captive funnel to a neutral surface that any frontier lab can plug into.
Reflex ran a head-to-head test of two agent patterns on the same admin-panel task — a vision-based agent driven by Claude Sonnet that read screenshots and clicked, versus an agent that called auto-generated REST endpoints. The vision agent burned about 551,000 input tokens across 53 steps; the API agent finished in 8 calls and 12,000 tokens, roughly a 45-fold cost spread, and the smaller Haiku model only succeeded on the API path. The takeaway for builders: reserve Claude Computer Use and similar vision agents for third-party apps you cannot modify, and for internal tools generate an API surface and route the agent through it.
On May 4 both frontier labs announced finance-industry-backed enterprise AI services companies. Anthropic revealed a joint venture with Blackstone, Hellman & Friedman, and Goldman Sachs — also backed by Apollo Global Management, Sequoia, GIC, General Atlantic, and Leonard Green — that embeds Applied AI engineers into mid-sized community banks, manufacturers, and regional health systems for Claude integrations. OpenAI's parallel "The Development Company" raised $4 billion at a $10 billion valuation alongside TPG, Brookfield, Advent, and Bain Capital. Both adopt Palantir's forward-deployed engineer model — sending lab engineers into client organizations rather than selling SaaS — a structural signal about how mid-market AI distribution is going to be sold.
Ben Thompson's thesis: while Wall Street fixates on Anthropic running on Google Cloud and OpenAI's loosened Microsoft tie, Amazon's durability comes from the physical-world infrastructure underneath the AI bets — custom Trainium 3 chips dating to the 2015 Annapurna acquisition, AWS Bedrock's abstraction layer that hides cheaper silicon from customers, the multi-billion-dollar Anthropic equity stake, and this week's launch of Amazon Supply Chain Services (a logistics business modeled directly on AWS economics). His argument: Amazon's seven-year chip head-start and willingness to absorb capex compound across cycles in ways pure-software competitors structurally cannot match.
Ben Thompson's earnings recap argues Wall Street's enthusiasm for Alphabet's quarter is a bet on AI monetization that is, in his framing, largely Anthropic's workload running on Google Cloud. Meta's core ad business looked stronger on the numbers, Thompson writes, but investors punished its ongoing AI capex with no comparable revenue line attached. The thesis implies Anthropic's compute spend is now visible enough to move a hyperscaler's earnings narrative on its own.
CNN's deeper read on Thursday's seven-vendor Pentagon AI procurement deal explains why Anthropic was the only frontier lab kept out: the company refused contract language allowing Claude use for "all lawful purposes," arguing it could enable domestic surveillance or autonomous weapons. Defense Secretary Pete Hegseth designated Anthropic a "supply chain risk" in March — a tag previously reserved for foreign-adversary-linked vendors — and Anthropic has filed suit to challenge the designation. The Pentagon is reportedly still piloting Claude Mythos Preview for unreleased cybersecurity work, and the White House has reopened talks after CEO Dario Amodei met Chief of Staff Susie Wiles.
The Department of Defense announced contracts with Nvidia, Microsoft, AWS, and Reflection AI to deploy AI on Impact Level 6 and Impact Level 7 classified networks — the most sensitive systems short of compartmented intelligence. The deals follow earlier agreements with Google, SpaceX, and OpenAI, and are framed as a vendor-diversification push following a public dispute with Anthropic over usage restrictions. Contract values were not disclosed.
Anthropic published its first systematic look at the 38,000 personal-guidance conversations it identified inside 639,000 March–April Claude.ai chats. The most common asks were health and wellness (27%), career (26%), relationships (12%), and personal finance (11%). Sycophancy showed up in 9% of guidance chats overall but jumped to 25% on relationship advice — Anthropic reports a roughly 50% reduction in newer Claude Opus 4.7 and Mythos Preview training runs.
Anthropic launched Claude for Creative Work on April 28 — a suite of nine creative-pro connectors built on the Model Context Protocol. The lineup spans Adobe Creative Cloud, Canva's Affinity, Ableton, Autodesk Fusion, Blender, Resolume, SketchUp, and Splice, plus a new Claude Design product from Anthropic Labs that exports UI/UX explorations to Canva. Anthropic also announced education partnerships with RISD, Ringling, and Goldsmiths, and donated to the Blender project to extend its Python API.
TechCrunch reports Anthropic is finalizing a roughly $50 billion raise at a $900 billion-plus valuation, with allocations due within 48 hours and the round expected to close in two weeks. If the numbers hold, Anthropic would surpass OpenAI's $852 billion February post-money mark — a notable inversion for a company that's been the smaller-cap challenger across the frontier-model duopoly. The reporting cites unnamed sources; Anthropic declined to comment.