Every published Top AI Stories item tagged with Cloud Infrastructure & Data Processing, newest first.
Amazon made its Graviton5 processor generally available, pitching it as purpose-built for the real-time reasoning, code generation, and multi-step orchestration that AI agents demand. AWS says the chip delivers up to 25 percent better compute performance than the prior generation, with 192 cores per processor and 33 percent lower inter-core latency. Uber and Snowflake are among the early adopters, joining more than 120,000 customers already running on Graviton. The launch underscores how the cloud giants are racing to design their own silicon and cut their dependence on Nvidia for the inference work that agents run around the clock.
OpenAI and Oracle said Oracle Cloud Infrastructure customers will be able to apply their existing Oracle Universal Credits toward OpenAI's frontier models and the Codex coding agent, putting that access inside the purchasing workflow enterprises already use. Availability is expected in the coming weeks. It is the latest thread in the deep Stargate compute relationship between the two companies, and it gives OpenAI another enterprise distribution channel — a reminder that the IPO-bound lab is still busy widening the on-ramps that drive its revenue.
Meta will build a 168-megawatt AI data center in Jamnagar, Gujarat, with Reliance — its first AI infrastructure investment in India, expandable over time and slated to open within two years. Meta is covering all energy and water costs, drawing on renewable power and seawater cooling, and has contracted nearly a gigawatt of additional clean capacity through separate deals. The agreement deepens a partnership that began with Meta's $5.7 billion stake in Reliance's Jio Platforms in 2020.
Bloomberg reports that Beijing is preparing to spend roughly 2 trillion yuan, about $295 billion, over five years on a network of interconnected AI computing hubs run mostly by state firms like China Mobile and China Telecom. The blueprint, drafted by agencies including the National Development and Reform Commission, would source at least 80 percent of the hardware, including AI chips, from domestic suppliers such as Huawei — effectively designing Nvidia and AMD out of the country's largest buildout. It is the clearest signal yet that China intends to win the AI compute race on homegrown silicon.
A City Council proposal on the June 9 agenda would direct El Paso to negotiate terminating the tax incentives behind Meta's planned $10 billion, one-gigawatt AI data center, the largest such fight yet over a hyperscale campus. The site would draw on city water for cooling and rely partly on hundreds of gas-fired generators, fueling community opposition. City attorneys have warned that breaking the 2023 agreement without cause could expose taxpayers to liability above $1 billion, and the sponsoring representative needs four more votes to prevail — making the outcome a national test case for local pushback against AI infrastructure.
Amazon signed a multiyear, multibillion-dollar agreement for Corning to supply optical fiber, cable, and connectivity for its US data centers, with production centered at Corning's North Carolina plants. The deal creates about 1,000 advanced-manufacturing jobs plus a new technician-training program, and underscores how fiber has become a bottleneck for AI clusters — optical links move far more data between thousands of chips than copper, with less power loss. It is Corning's third AI mega-deal of 2026, after a $6 billion Meta agreement in January and a $3.2 billion Nvidia partnership in May.
Alphabet unveiled a plan to raise $80 billion through equity offerings to fund AI infrastructure and compute — its largest capital raise to date. The package pairs $30 billion in underwritten public offerings with a $40 billion at-the-market program, plus a $10 billion private placement from Warren Buffett's Berkshire Hathaway. Alphabet expects 2026 capital spending of $180 billion to $190 billion, and says 2027 will rise significantly.
OpenAI's frontier models and its Codex coding agent are now generally available on Amazon Bedrock, graduating from the limited preview that opened in April. GPT-5.5 runs in AWS's US East region and GPT-5.4 in US East and US West, callable through the Responses API with pricing that matches OpenAI's first-party rates. The move lets AWS customers reach OpenAI models inside their existing security, logging, and compliance controls — a notable thaw between OpenAI and a cloud rival to its main backer, Microsoft.
Environmental advocate Erin Brockovich has launched a campaign pressing for transparency in how data centers get sited near communities. After inviting reports in April, she received about 4,000 submissions in a month — citing noise, water use, and utility costs, but most often a lack of openness, with projects revealed only after permits are signed and some local officials bound by non-disclosure agreements. She stresses she opposes the secrecy, not data centers or AI themselves.
Google Research deployed a new privacy-preserving analytics technique into Android SafetyCore that lets safety-model developers measure the true-positive rate of on-device classifiers across a global fleet without ever seeing the private content that triggered each alert. The technique combines a lattice-based cryptographic protocol that allows a single encrypted submission per device, instead of the multi-round handshakes prior secure-aggregation methods required, with trusted execution environments that attest the server code. It is now used to detect model drift, surface hidden classifier biases, and measure Smart Reply error rates at production scale.
In this week's Stratechery analysis, Ben Thompson argues that SpaceX's rumored IPO at a $2 trillion valuation only makes sense if Starship enables data centers in orbit. His core thesis: terrestrial data center expansion is now constrained more by community zoning opposition than by power generation, and the existing Starlink V2 Mini satellite form factor — about 7.4 meters by 2.7 meters — is comparable to NVIDIA's NVL72 rack. Combined with Starlink's laser interconnects, the constellation already has the network topology required for distributed orbital compute; power dissipation and radiation hardening become engineering problems rather than fundamental obstacles to agentic-inference workloads.
OpenRouter, the multi-model API gateway founded in 2023, raised a $113 million Series B led by CapitalG — Google's growth venture fund — at a $1.3 billion post-money valuation, more than doubling from its $547 million Series A last June. The company routes requests across more than 400 models from Anthropic, Google, OpenAI, xAI, and DeepSeek, and now serves roughly 8 million users at around 100 trillion tokens per month — a 5-times jump in throughput over the past six months. The funding signals that multi-model routing has moved from optional plumbing to default architecture for production AI deployments.
A Gallup poll released this month found that 70 percent of Americans oppose construction of an AI data center in their local community, including 48 percent who say they are *"strongly opposed."* Half of opponents cite excessive use of resources — 18 percent each name water and energy use specifically — while another 16 percent cite pollution and noise. Two-thirds of supporters cite economic benefits, mostly local job creation. The first national survey of public sentiment on the issue lands as Senator Bernie Sanders has introduced moratorium legislation, Maine's legislature has passed and then watched a governor's veto kill a construction ban, and protests have escalated from Tremonton, Utah, to Monterey Park, California — a sign that the *"AI is good for jobs"* pitch is not yet outweighing local concern about water, grid, and noise.
Nvidia reported $81.6 billion in revenue for the quarter ending April 26, a 20% sequential jump, with data-center revenue at a record $75.2 billion. The earnings disclosure also surfaced $43 billion in non-marketable startup equity — nearly double the prior quarter — including a previously-undisclosed []0 billion commitment to OpenAI. On the call, Jensen Huang positioned the new Vera CPU as a "brand new toolIds: 00 billion total addressable market" built for autonomous-acting agents.
Per SpaceX's S-1 filing, Anthropic will pay xAI $1.25 billion per month for 300 megawatts of capacity at the Colossus 1 data center in Memphis, running through May 2029 — a contract worth over $40 billion. xAI burned $6.4 billion last year and Grok usage has dropped sharply, leaving the lab with excess capacity it is now monetizing by selling to a direct competitor. The arrangement underscores how blurred the line between rival AI labs and their cloud-supplier parents has become.
Stratechery's Sunday essay tackles the political backlash forming around hyperscale data-center sites — a theme that's recurred across this week's newsletter (NAACP versus xAI in Mississippi, the Pennsylvania town hall, Peter Thiel's bet on offshore wave-powered data centers via Panthalassa). Ben Thompson's framework: local opposition is rational, because the upside (compute capacity, AI capability) accrues to coastal labs and shareholders while the downsides (noise, water draws, transmission lines, falling property values) hit the communities directly. His proposed fix is the politically uncomfortable but mathematically simple one — pay opponents directly rather than promising jobs that don't materialize.
A May 13 virtual town hall organized by the Better Path Coalition and No False Climate Solutions PA pulled in roughly 225 viewers and over 20 speakers from multiple counties, all describing the same set of grievances: surging electricity bills, heavy water draws, noise, industrial-scale builds next to homes and schools, and what one speaker called being "bulldozed over" by projects whose permitting moves faster than community input. State Senator Katie Muth is pushing a three-year moratorium on hyperscale development, and Chester County's North Coventry Township is now drafting a local data-center ordinance. The Pennsylvania backlash echoes the NAACP's suit against xAI's Mississippi turbines from this week — community pushback against AI compute siting is becoming a named political issue, not just a permitting headache.
Oregon-based Panthalassa raised $140 million led by Peter Thiel's personal fund — with Founders Fund a returning backer from 2018 — pushing the company's valuation to roughly $1 billion. The pitch is unsubtle: pair massive floating orbs that harvest wave energy with on-board AI compute, use cold ocean water for cooling, and uplink results to land via low-Earth-orbit satellites. CEO Garth Sheldon-Coulson is targeting a northern Pacific Ocean-3 pilot deployment later this year and commercial nodes in 2027. The bet — placed the same week Pennsylvanians filled a town hall demanding onshore data centers slow down — is that ocean siting bypasses transmission costs, water-rights fights, and the community veto that keeps blocking gigawatt-scale builds on land.
Cerebras Systems priced its IPO at $185 per share — above its $115 to $160 range — and raised roughly $5.5 billion before the stock more than doubled on debut Thursday, closing at $311 for a $66 billion market cap. The S-1 names OpenAI, Group 42, Saudi Arabia's MBZUAI, and Amazon Web Services as top customers, and Cerebras swung to profitability on $510 million of 2025 revenue. The pop validates the thesis that frontier labs will pay a premium for Nvidia alternatives if inference economics work — and gives Cerebras a public-market currency to chase Nvidia's data-center share more aggressively.
The Southern Environmental Law Center, on behalf of the NAACP, filed suit alleging xAI is operating 46 natural gas turbines at its Mississippi data center after state regulators permitted only 15 of them. Mississippi treats the trailer-mounted turbines as "mobile" equipment, which it argues exempts them from federal air-pollution standards for a year — but the complaint contends federal law allows states to regulate trailer-mounted power plants as stationary sources, and that Mississippi is simply declining to enforce. The NAACP frames the unchecked emissions as worsening air quality in an already polluted region. xAI has not responded publicly. The case is the first major regulatory test of the "neocloud" pattern of standing up frontier-scale data centers on temporary fossil-fuel power.
TechCrunch's Sean O'Kane published a pointed editorial on the May 7 Anthropic-SpaceX deal, framing it as evidence that xAI is abandoning frontier model training to become a neocloud — renting GPUs to a rival rather than building competitive AI itself. The deal hands Anthropic all compute at xAI's Memphis-based Colossus 1 (220,000 NVIDIA GPUs, 300 megawatts) and is projected to generate $3 to $6 billion in annual revenue for the merged SpaceXAI entity. O'Kane reads it as "a major heat check before the IPO" and notes that Grok has limited traction outside X and is not used for enterprise tasks even internally at xAI. The interpretation matters: if xAI is no longer a serious frontier-model contender, the US frontier-lab landscape collapses from four labs to three (OpenAI, Anthropic, Google DeepMind).
In this week's Stratechery analysis, Ben Thompson argues that AI compute is bifurcating into three distinct workload categories that need fundamentally different hardware. Training keeps high-bandwidth GPUs (NVIDIA's lock-in); answer inference rewards token speed (Cerebras's WSE-3 packs 44 gigabytes of on-chip SRAM at 21 petabytes per second of bandwidth versus NVIDIA H100's 80 gigabytes of HBM at 3.35 terabytes per second); agentic inference, where humans aren't in the loop, mostly cares about memory capacity and cost-per-token at scale. Thompson treats Cerebras's revised IPO pricing of $150 to $160 per share (up from $115 to $125) and Anthropic's lease of 220,000 NVIDIA GPUs at SpaceX's Colossus 1 (the 300-megawatt Memphis data center) as concrete evidence that buyers are now sorting their compute spend by workload category rather than picking a single vendor.
Nvidia has now committed more than $40 billion to equity investments in AI companies in 2026, including a single $30 billion stake in OpenAI, up to $3.2 billion in glassmaker Corning, and up to $2.1 billion in data-center operator IREN. Wedbush analyst Matthew Bryson described the pattern as "squarely circular" — money cycling between chip vendor, model customer, and infrastructure provider. The chipmaker has also closed roughly two dozen private startup rounds plus 67 venture deals across 2025, intensifying scrutiny of how concentrated the AI capital stack has become around a single supplier.
Matthew Prince and Michelle Zatlyn told staff Cloudflare is laying off more than 1,100 employees — roughly 20% of headcount — even as Q1 2026 revenue hit a record $639.8 million, up 34% year over year. Prince explicitly framed the cut as structural, not cost-driven, citing internal AI usage up over 600% in three months and per-employee productivity gains of 2 to 100x since November 2025. Affected roles skew toward support and back-office functions; sales staff with revenue quotas were spared.
Ben Thompson's weekly argues that Apple, Amazon, Meta, Google, and Microsoft are running rationally disciplined — not reckless — AI investment programs, even as their combined Q1 capex topped three times the inflation-adjusted cost of the entire Manhattan Project. Wall Street rewarded Google over Meta this cycle because Google is monetizing inference today; Amazon is recast as well-positioned for the inference era despite missing the training era; Microsoft is rolling out an agentic business model while Apple wrestles with chip and memory constraints.
Anthropic announced two structural changes the same day. First, **Claude Code's five-hour rate limits double across Pro, Max, Team, and Enterprise plans**, and the peak-hours throttle is removed for Pro and Max — an immediate boost for paying developers. Second, Anthropic gains access to **over 300 megawatts of compute (more than 220,000 NVIDIA GPUs) at SpaceX's Colossus 1 data center**, with capacity deployable within a month. The compute deal materially diversifies Anthropic away from its existing Amazon Trainium and Google TPU reliance and is the first time SpaceX has been named as a frontier-lab compute partner.
Cerebras Systems filed to sell 28 million shares priced between $115 and $125 per share, targeting $3.5 billion in proceeds at a $26.6 billion valuation — the largest US tech IPO of 2026 so far. OpenAI is one of the chipmaker's largest customers under a multi-year contract worth more than $10 billion signed in January, and holds a $1 billion secured loan plus warrants for over 33 million shares, potentially making OpenAI a major shareholder post-listing. The offering puts Cerebras' Wafer-Scale Engine 3 against Nvidia in a public-market test of GPU pricing power and frontier-lab compute lock-in.
Ben Thompson's thesis: while Wall Street fixates on Anthropic running on Google Cloud and OpenAI's loosened Microsoft tie, Amazon's durability comes from the physical-world infrastructure underneath the AI bets — custom Trainium 3 chips dating to the 2015 Annapurna acquisition, AWS Bedrock's abstraction layer that hides cheaper silicon from customers, the multi-billion-dollar Anthropic equity stake, and this week's launch of Amazon Supply Chain Services (a logistics business modeled directly on AWS economics). His argument: Amazon's seven-year chip head-start and willingness to absorb capex compound across cycles in ways pure-software competitors structurally cannot match.
AWS and OpenAI announced on April 28 that GPT-5.5, GPT-5.4, the Codex coding agent, and a new Amazon Bedrock Managed Agents capability are now in limited preview for enterprise customers via Bedrock — OpenAI's first major distribution outside the seven-year Microsoft Azure exclusivity. Customers can evaluate and deploy OpenAI models alongside Anthropic, Meta, Mistral, Cohere, and Amazon's own models in a single Bedrock console with unified security, governance, and cost controls. The launch follows last week's amended Microsoft-OpenAI agreement that ended exclusive cloud rights through 2032 and signals a multi-cloud distribution era for frontier models.