Filtered by company

27 stories about NVIDIA

Every published Top AI Stories item tagged with NVIDIA, newest first.

Jun 16, 2026Top AI Stories

Nvidia raises $25 billion in its first bond sale since 2021 as orders top $85 billion

Nvidia sold $25 billion of investment-grade notes across seven tranches maturing as far out as 2056 — its first corporate bond deal since a $5 billion raise in 2021. Demand reached more than $85 billion, over three times the offering, and the company upsized the sale from an initial $20 billion. For a firm sitting on a roughly $5 trillion market value and ample cash, tapping the debt market signals just how much it plans to spend building out AI data-center capacity.

Jun 16, 2026Top AI Stories

A Loft Orbital satellite runs Google's Gemma model in orbit to find targets on its own

Loft Orbital says its YAM-9 satellite became the first spacecraft to run a vision-language model in orbit, using Google DeepMind's Gemma 3 to answer plain-English queries — classifying where wilderness meets development, or spotting infrastructure near rail hubs — without beaming raw imagery to ground analysts first. The model ran on an Nvidia Jetson edge chip paired with NAVI-Orbital software from NASA's Jet Propulsion Laboratory. Onboard triage like this could sharply cut how much data satellites need to downlink, and Loft says 50 to 100 such craft would give near-real-time eyes on Earth.

Jun 15, 2026Top AI Stories

Neura Robotics raises $1.4 billion, the largest round ever for a robot maker

Germany's Neura Robotics raised $1.4 billion in a Series C round backed by Amazon, NVIDIA, Qualcomm, Bosch, and the European Investment Bank, valuing the cognitive-robotics company at about $7 billion. It is the biggest single round ever raised by a full-stack robotics maker, and a sign that strategic players are betting hard on humanoids for warehouse and factory work. Neura already holds more than $1 billion in pre-orders for its two-armed, two-legged 4NE-1 robot, whose first units are due to ship late this year.

Jun 12, 2026Top AI Stories

Nvidia opens orders for its Vera AI CPU to Chinese customers

Nvidia told Chinese cloud customers that its new Vera data-center CPU is now open for orders and could ship as early as August, according to Reuters. The pitch is a workaround: after US export curbs choked Nvidia's GPU sales in China, the company is leaning on standalone CPUs — which face looser restrictions — to defend the market. Nvidia's finance chief has said Vera CPU revenue could approach $20 billion in the coming fiscal year.

Jun 11, 2026Top AI Stories

Nvidia and Hyundai expand their alliance to put physical AI on the factory floor

Jensen Huang and Hyundai chair Chung Euisun used a meeting in Seoul to widen their partnership from research prototypes toward factory-ready robotics, spanning mobility, manufacturing, and autonomous driving. Hyundai will stand up an AI supercomputer built on 50,000 Nvidia Blackwell GPUs to train models for self-driving and smart factories, and the two companies will work with the Korean government on new AI technology centers. Huang called Hyundai one of the best robotics companies in the world. The deal is one of the largest national commitments yet to physical AI — the effort to move machine intelligence off the screen and into machines that move.

Jun 9, 2026Top AI Stories

China drafts a $295 billion plan to build nationwide AI data centers and cut out Nvidia

Bloomberg reports that Beijing is preparing to spend roughly 2 trillion yuan, about $295 billion, over five years on a network of interconnected AI computing hubs run mostly by state firms like China Mobile and China Telecom. The blueprint, drafted by agencies including the National Development and Reform Commission, would source at least 80 percent of the hardware, including AI chips, from domestic suppliers such as Huawei — effectively designing Nvidia and AMD out of the country's largest buildout. It is the clearest signal yet that China intends to win the AI compute race on homegrown silicon.

Jun 7, 2026Top AI Stories

Google will pay SpaceX 920 million dollars a month to rent 110,000 Nvidia AI chips

Google will pay SpaceX about $920 million a month from October 2026 through June 2029 — roughly $30 billion in total — for access to around 110,000 Nvidia chips, capacity SpaceX first built for its own xAI division. Google Cloud called it short-term "bridge capacity" for its Gemini Enterprise platform while its own data centers scale up. The deal lands days before SpaceX's IPO, where Google already holds about a 5 percent stake, and echoes an earlier SpaceX compute arrangement with Anthropic worth roughly $1.25 billion a month.

Jun 4, 2026Top AI Stories

Ben Thompson: in the age of AI agents, 'thin is in' and the cloud is the hub

In this week's Stratechery analysis, Ben Thompson argues that AI agents shift the center of gravity from the device to the cloud — making Nvidia's GPU-heavy RTX Spark AI PC a poor fit, since agents want strong local CPUs that call out to cloud inference, while praising Microsoft's Project Solara, which treats the cloud as the hub and phones and PCs as interchangeable spokes. He frames Microsoft's in-house MAI models as a way for cautious enterprises to own custom agents without handing their workflows to frontier labs.

Jun 2, 2026Top AI Stories

Intel details Crescent Island, a memory-heavy AI inference GPU to undercut Nvidia

At Computex, Intel detailed Crescent Island, a data-center GPU built on its Arc Xe3P architecture and aimed at AI inference. Partner cards can carry up to 480 gigabytes of LPDDR5X memory — more than Nvidia's and AMD's flagships — while skipping the scarce, expensive high-bandwidth memory those chips rely on, which Intel says makes the card cheaper to produce and run on a 350-watt air cooler. Intel is targeting a second-half 2026 launch as it tries to chip away at Nvidia's data-center lead.

Jun 1, 2026Top AI Stories

NVIDIA jumps into PC chips with the Arm-based N1X, debuting in Microsoft, Dell, and HP laptops

NVIDIA used its Computex keynote to enter the personal-computer market, unveiling the N1X — an Arm-based chip co-designed with MediaTek that pairs a 20-core processor with a Blackwell graphics processor and 6,144 CUDA cores. It will debut in Windows laptops from Microsoft, Dell, HP, ASUS, Lenovo, and MSI before the 2026 holidays, with performance models priced above $2,000 to challenge Apple's MacBook Pro and longtime chip leaders Intel and AMD. A lower-power N1 variant will start under $1,500.

Jun 1, 2026Top AI Stories

NVIDIA also launches Nemotron 3 Ultra, a 550-billion-parameter open-weights model, at Computex

Separately at the same Computex keynote, NVIDIA launched Nemotron 3 Ultra, the largest model in the open-weights family it first previewed in December. The hybrid Mamba-Transformer mixture-of-experts (MoE) model carries roughly 550 billion total parameters with about 50 billion active per token, and NVIDIA says it tops US open-weights rankings while running about 30 percent cheaper than leading alternatives. The weights and training recipes are free to download.

May 30, 2026Top AI Stories

Groq raises 650 million dollars to fund inference cloud after Nvidia talent deal

Groq is raising 650 million dollars to refocus the company on its inference neocloud — the on-demand cloud platform powered by Groq's own AI chips — after a December 2025 arrangement saw Nvidia pay 20 billion dollars for senior Groq engineering talent and a hardware license. Existing investors are leading the round, with Disruptive and Infinitium committed to fill any unsubscribed shares. Adam Winter is now interim CEO and Matt Eng interim CFO. The company argues inference is a much bigger market than training right now, and the wedge for the smaller team that remains.

May 28, 2026Top AI Stories

Nvidia pledges $150 billion a year for Taiwan, calling it the 'epicenter' of the AI revolution

At the groundbreaking for a new Taipei headquarters, Jensen Huang announced that Nvidia will spend $150 billion a year in Taiwan, up from $10 to $15 billion annually four to five years ago. The headquarters will employ 4,000 people and target a 2030 opening, anchoring Nvidia closer to TSMC, which fabricates its chips, and Foxconn, which assembles them into server racks. The pledge lands as Washington has spent two years pushing chipmakers to build US capacity, and it cements Taiwan rather than the United States as the structural center of advanced AI manufacturing for at least the rest of the decade.

May 27, 2026Top AI Stories

Ben Thompson: SpaceX's IPO valuation is a bet on orbital AI data centers

In this week's Stratechery analysis, Ben Thompson argues that SpaceX's rumored IPO at a $2 trillion valuation only makes sense if Starship enables data centers in orbit. His core thesis: terrestrial data center expansion is now constrained more by community zoning opposition than by power generation, and the existing Starlink V2 Mini satellite form factor — about 7.4 meters by 2.7 meters — is comparable to NVIDIA's NVL72 rack. Combined with Starlink's laser interconnects, the constellation already has the network topology required for distributed orbital compute; power dissipation and radiation hardening become engineering problems rather than fundamental obstacles to agentic-inference workloads.

May 26, 2026Top AI Stories

Ben Thompson reads Nvidia's new reporting taxonomy as a commoditization fight

In this week's Stratechery analysis, Ben Thompson argues that Nvidia's revised reporting taxonomy — splitting hyperscaler revenue from everyone else — reveals the contours of an emerging commoditization fight at the top of the stack. Nvidia is "fighting commoditization" with its largest customers, where hyperscalers increasingly design their own silicon, while it "runs the whole stack" for AI clouds, sovereigns, and enterprises. The reframe lands one week after Nvidia's record $81.6 billion quarter on May 20, when hyperscaler revenue held at roughly half of the $75.2 billion data-center total.

May 25, 2026Top AI Stories

NVIDIA CEO Jensen Huang says NVIDIA has 'largely conceded' China's AI chip market to Huawei as the company's share falls toward zero

At a press event in Taipei this week, NVIDIA CEO Jensen Huang told reporters the company has *"largely conceded"* China's AI accelerator market to Huawei, with NVIDIA's share now near zero after the US H200 export-clearance stalemate dragged into a second month. Huawei expects roughly $12 billion in 2026 revenue from its Ascend line — up from $7.5 billion in 2025 — on orders already placed by Alibaba, ByteDance, and Tencent, all of which deployed DeepSeek V4 services within hours of the model's Ascend-optimized release in April. Huang said he still expects Beijing to eventually allow H200 imports, but for now the homegrown stack is shipping while NVIDIA's clearance letters sit in customs.

May 24, 2026Top AI Stories

NVIDIA releases open-weights Nemotron Diffusion, a 14-billion-parameter tri-mode model with a 2.2-times speedup at matched accuracy

NVIDIA Labs has quietly posted Nemotron Diffusion to Hugging Face — a 14-billion-parameter language model that switches between autoregressive decoding, parallel diffusion decoding, and a *"self-speculation"* mode that drafts with diffusion and verifies with autoregression, all without changing model weights. The accompanying technical report claims a 2.2-times throughput lift over the comparable Qwen 3 8-billion-parameter baseline at matched accuracy, scaling to 850 tokens per second on a GB200 (a 3.3-times lift). Base, instruct, and vision-language variants are all open-weight; the architecture is positioned as a path from memory-bound to compute-bound inference as GPUs keep outrunning memory bandwidth.

May 21, 2026Top AI Stories

Nvidia posts record $81.6 billion quarter and names Vera CPU a toolIds: 00 billion agentic market

Nvidia reported $81.6 billion in revenue for the quarter ending April 26, a 20% sequential jump, with data-center revenue at a record $75.2 billion. The earnings disclosure also surfaced $43 billion in non-marketable startup equity — nearly double the prior quarter — including a previously-undisclosed []0 billion commitment to OpenAI. On the call, Jensen Huang positioned the new Vera CPU as a "brand new toolIds: 00 billion total addressable market" built for autonomous-acting agents.

May 17, 2026Top AI Stories

NVIDIA Labs open-sources SANA-WM, a 2.6 billion parameter one-minute video world model

NVIDIA Labs released SANA-WM, an open-source 2.6 billion parameter world model under Apache 2.0 that generates one-minute videos at 720p resolution with 6 degrees of camera-pose control. The team reports training on roughly 213,000 video clips in 15 days on 64 H100 GPUs, and says a distilled variant runs on a single consumer GPU. The paper positions SANA-WM as a baseline for embodied-AI and robotics research at a fraction of closed-model compute budgets.

May 17, 2026Top AI Stories

Menlo's Deedy Das: AI gold rush has minted 10,000 multi-millionaires and 'malaise' for everyone else

In a TechCrunch interview, Menlo Ventures partner Deedy Das argues that roughly 10,000 founders and employees at OpenAI, Anthropic, NVIDIA, Meta, and xAI have crossed 20 million dollars in personal wealth over the past five years — while engineers earning under 500,000 dollars a year increasingly fear they cannot get there from here. Das calls the split in San Francisco "the worst I've ever seen" and ties the gap to a broader "deep malaise about work" pervading even well-paid technical roles in the AI era.

May 11, 2026Top AI Stories

Ben Thompson: AI compute is splitting into training, answer inference, and agentic inference

In this week's Stratechery analysis, Ben Thompson argues that AI compute is bifurcating into three distinct workload categories that need fundamentally different hardware. Training keeps high-bandwidth GPUs (NVIDIA's lock-in); answer inference rewards token speed (Cerebras's WSE-3 packs 44 gigabytes of on-chip SRAM at 21 petabytes per second of bandwidth versus NVIDIA H100's 80 gigabytes of HBM at 3.35 terabytes per second); agentic inference, where humans aren't in the loop, mostly cares about memory capacity and cost-per-token at scale. Thompson treats Cerebras's revised IPO pricing of $150 to $160 per share (up from $115 to $125) and Anthropic's lease of 220,000 NVIDIA GPUs at SpaceX's Colossus 1 (the 300-megawatt Memphis data center) as concrete evidence that buyers are now sorting their compute spend by workload category rather than picking a single vendor.

May 10, 2026Top AI Stories

Nvidia commits over $40 billion to equity AI deals in early 2026, led by $30 billion OpenAI bet

Nvidia has now committed more than $40 billion to equity investments in AI companies in 2026, including a single $30 billion stake in OpenAI, up to $3.2 billion in glassmaker Corning, and up to $2.1 billion in data-center operator IREN. Wedbush analyst Matthew Bryson described the pattern as "squarely circular" — money cycling between chip vendor, model customer, and infrastructure provider. The chipmaker has also closed roughly two dozen private startup rounds plus 67 venture deals across 2025, intensifying scrutiny of how concentrated the AI capital stack has become around a single supplier.

May 10, 2026Top AI Stories

NVIDIA Nemotron Elastic packs three nested reasoning models in a single checkpoint

NVIDIA Research published Nemotron Elastic, a post-training method that embeds 30, 23, and 12 billion-parameter nested reasoning models inside a single 30 billion-parameter parent — extractable via zero-shot slicing without further fine-tuning. The recipe achieves a 360-times token reduction over pretraining from scratch, and the 30 billion checkpoint compresses to 18.7 gigabytes under NVFP4 quantization. The 23-to-30 billion configuration advances the accuracy-and-latency Pareto frontier with up to 16 percent higher accuracy and 1.9-times lower latency than the default Nemotron Nano v3 budget control. All three precision variants (BF16, FP8, NVFP4) are available on Hugging Face under nvidia/NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B.

May 5, 2026Top AI Stories

Cerebras files largest US tech IPO of 2026 with OpenAI as top customer

Cerebras Systems filed to sell 28 million shares priced between $115 and $125 per share, targeting $3.5 billion in proceeds at a $26.6 billion valuation — the largest US tech IPO of 2026 so far. OpenAI is one of the chipmaker's largest customers under a multi-year contract worth more than $10 billion signed in January, and holds a $1 billion secured loan plus warrants for over 33 million shares, potentially making OpenAI a major shareholder post-listing. The offering puts Cerebras' Wafer-Scale Engine 3 against Nvidia in a public-market test of GPU pricing power and frontier-lab compute lock-in.

May 2, 2026Top AI Stories

Pentagon adds Nvidia, Microsoft, AWS, Reflection AI to classified-network AI fleet

The Department of Defense announced contracts with Nvidia, Microsoft, AWS, and Reflection AI to deploy AI on Impact Level 6 and Impact Level 7 classified networks — the most sensitive systems short of compartmented intelligence. The deals follow earlier agreements with Google, SpaceX, and OpenAI, and are framed as a vendor-diversification push following a public dispute with Anthropic over usage restrictions. Contract values were not disclosed.

May 1, 2026Top AI Stories

Mistral ships Medium 3.5 open-weights with 256k context + async cloud coding agents

Mistral released Mistral Medium 3.5, a 128-billion-parameter dense model with a 256,000-token context window, available open-weights under a modified MIT license at $1.50 / $7.50 per million input/output tokens. The release pairs with Vibe remote agents — async cloud coding agents launched from CLI or Le Chat that handle refactors and test generation in parallel. Reported benchmarks: 77.6% on SWE-Bench Verified, 91.4 on τ³-Telecom, and self-hosting on as few as four GPUs.

May 1, 2026Top AI Stories

Legora hits $5.6 billion valuation as legal AI battle with Harvey escalates

Swedish legal-AI startup Legora closed a $50 million Series D extension led by NVentures (NVIDIA) and Atlassian at a $5.6 billion post-money valuation, claiming over $100 million ARR and 1,000+ law firms across 50 markets. Harvey still leads at an $11 billion valuation with 100,000 lawyers and 1,300 organizations as customers. Both companies are launching celebrity ad campaigns — Harvey with Gabriel Macht, Legora with Jude Law — and expanding into each other's geographies.