Filtered by industry

37 stories about Semiconductor Manufacturing

Every published Top AI Stories item tagged with Semiconductor Manufacturing, newest first.

Jul 24, 2026Top AI Stories

AMD launches Helios, a 72-GPU rack system to challenge Nvidia

AMD used its Advancing AI event to launch Helios, its first full rack-scale system aimed squarely at Nvidia's grip on AI data centers. Each rack packs 72 of AMD's new Instinct MI455X GPUs with sixth-generation EPYC processors and Pensando networking, delivering up to 2.9 exaflops of inference performance. Microsoft will run Helios on Azure and Anthropic plans to install up to two gigawatts of the chips, with OpenAI and Meta also signed on. Systems ship in the second half of the year.

Jul 24, 2026Top AI Stories

Etched raises $300 million, more than doubling its valuation to $10.3 billion

Etched, a startup building chips hardwired specifically for the transformer models behind today's chatbots, raised $300 million, led by Sequoia, more than doubling its valuation to $10.3 billion. Andreessen Horowitz, SK Hynix, and backers including Peter Thiel and Andrej Karpathy joined the round. Founded by three Harvard dropouts, Etched says it has already booked $1 billion in orders and is shipping its first systems — a bet that purpose-built silicon can undercut Nvidia's general-purpose GPUs on inference cost.

Jul 18, 2026Top AI Stories

AI's hunger for memory chips drags the global phone market to a 13-year low

Surging AI data-center demand for memory is squeezing consumer electronics, as chip makers steer supply toward far more profitable AI servers. Counterpoint Research says mobile memory prices in the second quarter ran nearly double their late-2025 levels, helping drag global smartphone shipments down about 3 percent to their weakest second quarter in 13 years and ending nine straight quarters of growth. Budget phones are hit hardest — models priced under 150 dollars face what analysts call a near-permanent squeeze out of the market — with price-sensitive markets like India seeing the sharpest declines.

Jul 8, 2026Top AI Stories

China's DeepSeek is quietly designing its own AI inference chip

Reuters reports that DeepSeek has spent about a year secretly recruiting chip engineers and courting manufacturing partners to build an in-house processor for AI inference — the stage where a trained model answers user queries. The goal is to cut its dependence on Nvidia and Huawei as US export controls keep tightening. It follows OpenAI's Broadcom-built Jalapeno inference chip and Anthropic's own reported chip ambitions — a sign that frontier labs increasingly want to own their silicon.

Jul 8, 2026Top AI Stories

AI chip maker SambaNova raises $1 billion at an $11 billion valuation

SambaNova, which designs chips tuned to run large models efficiently, raised $1 billion in a round led by General Atlantic that values it at $11 billion — just five months after its last mega-round. Backers include Intel, BlackRock, and T. Rowe Price. The company pitches itself as an inference-focused alternative to Nvidia, and its CEO signaled that a public listing, not a sale, is the likely path.

Jul 1, 2026Top AI Stories

AI-chip startup Etched books $1 billion in orders for its inference hardware

Etched, the startup building chips specialized purely for AI inference rather than general-purpose GPUs, said it has booked $1 billion in contracts for its frontier inference clusters and carries a $5 billion valuation. Backed by Jane Street, Hudson River Trading, and angels including Andrej Karpathy and Geoffrey Hinton, the Harvard-dropout-founded company is chasing the inference bottleneck that dominates AI costs at scale — where it takes on Nvidia, Cerebras, and Groq.

Jun 30, 2026Top AI Stories

South Korea unveils an $880 billion plan to dominate AI chips and robots

President Lee Jae-myung unveiled three "mega projects" totaling roughly $880 billion over ten years — about 5 percent of the country's GDP. The package pours $518 billion into new Samsung and SK Hynix memory fabs to ease the AI-driven shortage some call "RAMageddon," funds $356 billion of AI data centers, and aims to lift South Korea's share of the humanoid-robot market from 1 percent to 20 percent, with Hyundai's Boston Dynamics mass-producing Atlas robots by 2028.

Jun 23, 2026Top AI Stories

Groq confirms a $650 million raise to rebuild after Nvidia's $20 billion talent grab

AI chipmaker Groq confirmed a $650 million funding round, led by the investment firm Disruptive, as it rebuilds after Nvidia's roughly $20 billion deal late last year to license Groq's technology and hire away its founder and much of its top engineering team. Groq is now doubling down on its "neocloud" business — renting out fast inference capacity — and restocking its executive bench. The startup last carried a valuation near $6.9 billion and did not disclose a new one.

Jun 21, 2026Top AI Stories

Qualcomm is in talks to buy AI-chip startup Tenstorrent for up to $10 billion

Qualcomm is reportedly negotiating to acquire Tenstorrent, the AI-chip startup led by legendary designer Jim Keller, in a deal valued between $8 billion and $10 billion. Tenstorrent builds RISC-V-based processors it claims beat general-purpose GPUs on specific AI workloads, so the purchase would hand Qualcomm a credible data-center play against Nvidia and a path beyond smartphone chips. The talks are unconfirmed and could still fall apart, but they show how aggressively mobile-chip makers are now chasing AI silicon.

Jun 19, 2026Top AI Stories

Amazon plans to sell its Trainium AI chips outside AWS, taking aim at Nvidia

Amazon is in talks to sell its in-house Trainium chips directly to outside data centers for the first time, breaking from years of reserving them for AWS cloud customers. AI chief Peter DeSantis confirmed the discussions in Paris but named no buyers, pointing to rising demand — especially in Europe — to keep AI compute under local control. CEO Andy Jassy has called the homegrown chip business an opportunity worth 50 billion dollars a year; pulling it off would put Amazon in direct hardware competition with Nvidia for the first time.

Jun 19, 2026Top AI Stories

The US warns ASML its most restricted chip tool may be in China; ASML denies it

Commerce Secretary Howard Lutnick told ASML executives that Washington believes one of the company's extreme ultraviolet lithography (EUV) machines — the kind banned from export to China — may have ended up there, Bloomberg reported. ASML pushed back hard, circulating a document titled "No indication of any ASML EUV system in China" and noting it has never shipped an EUV system, component, or module to the country. No evidence of an actual transfer has been made public, and the systems weigh roughly 180 metric tons, making a quiet move unlikely.

Jun 16, 2026Top AI Stories

Nvidia raises $25 billion in its first bond sale since 2021 as orders top $85 billion

Nvidia sold $25 billion of investment-grade notes across seven tranches maturing as far out as 2056 — its first corporate bond deal since a $5 billion raise in 2021. Demand reached more than $85 billion, over three times the offering, and the company upsized the sale from an initial $20 billion. For a firm sitting on a roughly $5 trillion market value and ample cash, tapping the debt market signals just how much it plans to spend building out AI data-center capacity.

Jun 15, 2026Top AI Stories

Tencent-backed chipmaker Enflame wins approval for a Shanghai AI-chip IPO

Enflame, a Shanghai designer of AI training and inference chips, has won approval for an initial public offering on the STAR Market, where it plans to raise about 6 billion yuan — roughly $888 million — to fund future chip generations. Tencent owns about 20 percent of Enflame and supplies the bulk of its revenue, making the listing a test of how far Chinese buyers will bankroll a homegrown answer to NVIDIA, which still holds around 70 percent of the domestic market. Enflame is the last of China's 'four little dragons' of AI chips to head public.

Jun 12, 2026Top AI Stories

Nvidia opens orders for its Vera AI CPU to Chinese customers

Nvidia told Chinese cloud customers that its new Vera data-center CPU is now open for orders and could ship as early as August, according to Reuters. The pitch is a workaround: after US export curbs choked Nvidia's GPU sales in China, the company is leaning on standalone CPUs — which face looser restrictions — to defend the market. Nvidia's finance chief has said Vera CPU revenue could approach $20 billion in the coming fiscal year.

Jun 11, 2026Top AI Stories

AWS launches Graviton5, an Arm chip built for agentic AI workloads

Amazon made its Graviton5 processor generally available, pitching it as purpose-built for the real-time reasoning, code generation, and multi-step orchestration that AI agents demand. AWS says the chip delivers up to 25 percent better compute performance than the prior generation, with 192 cores per processor and 33 percent lower inter-core latency. Uber and Snowflake are among the early adopters, joining more than 120,000 customers already running on Graviton. The launch underscores how the cloud giants are racing to design their own silicon and cut their dependence on Nvidia for the inference work that agents run around the clock.

Jun 11, 2026Top AI Stories

Nvidia and Hyundai expand their alliance to put physical AI on the factory floor

Jensen Huang and Hyundai chair Chung Euisun used a meeting in Seoul to widen their partnership from research prototypes toward factory-ready robotics, spanning mobility, manufacturing, and autonomous driving. Hyundai will stand up an AI supercomputer built on 50,000 Nvidia Blackwell GPUs to train models for self-driving and smart factories, and the two companies will work with the Korean government on new AI technology centers. Huang called Hyundai one of the best robotics companies in the world. The deal is one of the largest national commitments yet to physical AI — the effort to move machine intelligence off the screen and into machines that move.

Jun 9, 2026Top AI Stories

China drafts a $295 billion plan to build nationwide AI data centers and cut out Nvidia

Bloomberg reports that Beijing is preparing to spend roughly 2 trillion yuan, about $295 billion, over five years on a network of interconnected AI computing hubs run mostly by state firms like China Mobile and China Telecom. The blueprint, drafted by agencies including the National Development and Reform Commission, would source at least 80 percent of the hardware, including AI chips, from domestic suppliers such as Huawei — effectively designing Nvidia and AMD out of the country's largest buildout. It is the clearest signal yet that China intends to win the AI compute race on homegrown silicon.

Jun 2, 2026Top AI Stories

Intel details Crescent Island, a memory-heavy AI inference GPU to undercut Nvidia

At Computex, Intel detailed Crescent Island, a data-center GPU built on its Arc Xe3P architecture and aimed at AI inference. Partner cards can carry up to 480 gigabytes of LPDDR5X memory — more than Nvidia's and AMD's flagships — while skipping the scarce, expensive high-bandwidth memory those chips rely on, which Intel says makes the card cheaper to produce and run on a 350-watt air cooler. Intel is targeting a second-half 2026 launch as it tries to chip away at Nvidia's data-center lead.

May 31, 2026Top AI Stories

Xcena raises $135 million at a $570 million valuation to put AI compute inside DRAM

Korean chip startup Xcena closed a $135 million Series B at a $570 million valuation on Friday, co-led by Atinum and IMM Investment with Corstone Asia, SBI Investment, and Mirae Asset Capital joining. Its MX1 chip embeds compute directly inside DRAM modules over the CXL interconnect, aiming to cut the round-trips between GPU and memory that increasingly bottleneck inference. The founders are former Samsung and SK Hynix veterans, and Xcena says the design could let workloads that currently need 10 servers fit on one — though it has not named a launch customer.

May 30, 2026Top AI Stories

Groq raises 650 million dollars to fund inference cloud after Nvidia talent deal

Groq is raising 650 million dollars to refocus the company on its inference neocloud — the on-demand cloud platform powered by Groq's own AI chips — after a December 2025 arrangement saw Nvidia pay 20 billion dollars for senior Groq engineering talent and a hardware license. Existing investors are leading the round, with Disruptive and Infinitium committed to fill any unsubscribed shares. Adam Winter is now interim CEO and Matt Eng interim CFO. The company argues inference is a much bigger market than training right now, and the wedge for the smaller team that remains.

May 30, 2026Top AI Stories

Mistral details industrial AI partnerships with Airbus, BMW, and ASML at Paris summit

At the AI Now Summit, Mistral named Airbus (commercial aircraft, helicopters, defense, and space), BMW Group (the carmaker's Large Industry Model initiative for multimodal reasoning on engineering data), and ASML (semiconductor parts design and surrogate modeling) as production customers for its new Mistral Physics AI line. The lab also confirmed a new 10 megawatt data center in Les Ulis south of Paris, dedicated to inference and scheduled to open in the third quarter of 2026, alongside an existing 40 megawatt Paris site and a planned Swedish facility — Mistral's answer to the inference supply-chain risk that has loomed over European AI customers.

May 28, 2026Top AI Stories

Nvidia pledges $150 billion a year for Taiwan, calling it the 'epicenter' of the AI revolution

At the groundbreaking for a new Taipei headquarters, Jensen Huang announced that Nvidia will spend $150 billion a year in Taiwan, up from $10 to $15 billion annually four to five years ago. The headquarters will employ 4,000 people and target a 2030 opening, anchoring Nvidia closer to TSMC, which fabricates its chips, and Foxconn, which assembles them into server racks. The pledge lands as Washington has spent two years pushing chipmakers to build US capacity, and it cements Taiwan rather than the United States as the structural center of advanced AI manufacturing for at least the rest of the decade.

May 28, 2026Top AI Stories

Mistral acquires Emmi AI to launch Physics AI for engineering, with Airbus, ASML, Safran, and Siemens Energy on board

Mistral acquired Vienna-based Emmi AI and used the deal to launch Physics AI, a class of data-driven models that learn from physics solver outputs and predict the behavior of physical systems in seconds on a single GPU, replacing simulations that traditionally take hours or weeks. Mistral named Airbus, ASML, Safran, and Siemens Energy as launch partners, targeting aerospace, semiconductors, energy, and industrial equipment. The offering pitches three concrete use cases: exploring thousands of design variants for new products, optimizing factory tooling, and running real-time digital twins on live sensor data — engineering acceleration with named industrial deployments rather than future-tense lab promises.

May 26, 2026Top AI Stories

Ben Thompson reads Nvidia's new reporting taxonomy as a commoditization fight

In this week's Stratechery analysis, Ben Thompson argues that Nvidia's revised reporting taxonomy — splitting hyperscaler revenue from everyone else — reveals the contours of an emerging commoditization fight at the top of the stack. Nvidia is "fighting commoditization" with its largest customers, where hyperscalers increasingly design their own silicon, while it "runs the whole stack" for AI clouds, sovereigns, and enterprises. The reframe lands one week after Nvidia's record $81.6 billion quarter on May 20, when hyperscaler revenue held at roughly half of the $75.2 billion data-center total.

May 25, 2026Top AI Stories

NVIDIA CEO Jensen Huang says NVIDIA has 'largely conceded' China's AI chip market to Huawei as the company's share falls toward zero

At a press event in Taipei this week, NVIDIA CEO Jensen Huang told reporters the company has *"largely conceded"* China's AI accelerator market to Huawei, with NVIDIA's share now near zero after the US H200 export-clearance stalemate dragged into a second month. Huawei expects roughly $12 billion in 2026 revenue from its Ascend line — up from $7.5 billion in 2025 — on orders already placed by Alibaba, ByteDance, and Tencent, all of which deployed DeepSeek V4 services within hours of the model's Ascend-optimized release in April. Huang said he still expects Beijing to eventually allow H200 imports, but for now the homegrown stack is shipping while NVIDIA's clearance letters sit in customs.

May 22, 2026Top AI Stories

US Commerce takes equity in 9 quantum firms, awarding $2 billion led by IBM at $1 billion

The Department of Commerce signed letters of intent on Thursday to provide $2.013 billion in federal incentives to nine quantum companies — IBM alone receives $1 billion, GlobalFoundries gets $375 million, and seven computing firms (D-Wave, Rigetti, Quantinuum, PsiQuantum, Atom Computing, Infleqtion, and Diraq) get between $38 million and $100 million each. Unlike traditional CHIPS Act grants, the department takes a **minority, non-controlling equity stake** in each company "to enhance the return for the U.S. taxpayer," mirroring the structure of last year's Intel deal. Secretary Howard Lutnick framed the package as building "thousands of high-paying American jobs while advancing American quantum capabilities."

May 21, 2026Top AI Stories

Nvidia posts record $81.6 billion quarter and names Vera CPU a toolIds: 00 billion agentic market

Nvidia reported $81.6 billion in revenue for the quarter ending April 26, a 20% sequential jump, with data-center revenue at a record $75.2 billion. The earnings disclosure also surfaced $43 billion in non-marketable startup equity — nearly double the prior quarter — including a previously-undisclosed []0 billion commitment to OpenAI. On the call, Jensen Huang positioned the new Vera CPU as a "brand new toolIds: 00 billion total addressable market" built for autonomous-acting agents.

May 21, 2026Top AI Stories

Anthropic signs $1.25 billion-a-month, three-year compute deal with rival xAI

Per SpaceX's S-1 filing, Anthropic will pay xAI $1.25 billion per month for 300 megawatts of capacity at the Colossus 1 data center in Memphis, running through May 2029 — a contract worth over $40 billion. xAI burned $6.4 billion last year and Grok usage has dropped sharply, leaving the lab with excess capacity it is now monetizing by selling to a direct competitor. The arrangement underscores how blurred the line between rival AI labs and their cloud-supplier parents has become.

May 20, 2026Top AI Stories

Mistral AI acquires Austria's Emmi AI for industrial physics modeling

Mistral closed an acquisition of Linz-based Emmi AI, an Austrian engineering startup building physics-based simulation models for aerospace, automotive, semiconductors, and energy. Roughly 30 researchers and engineers join Mistral's team, and Linz becomes an official Mistral office. The combined pitch: real-time simulations and digital twins layered onto Mistral's platform to court industrial buyers in high-stakes manufacturing sectors. Terms were not disclosed; the deal closes in May 2026.

May 15, 2026Top AI Stories

Cerebras debuts at $66 billion as IPO pops 108%, with OpenAI named top customer

Cerebras Systems priced its IPO at $185 per share — above its $115 to $160 range — and raised roughly $5.5 billion before the stock more than doubled on debut Thursday, closing at $311 for a $66 billion market cap. The S-1 names OpenAI, Group 42, Saudi Arabia's MBZUAI, and Amazon Web Services as top customers, and Cerebras swung to profitability on $510 million of 2025 revenue. The pop validates the thesis that frontier labs will pay a premium for Nvidia alternatives if inference economics work — and gives Cerebras a public-market currency to chase Nvidia's data-center share more aggressively.

May 13, 2026Top AI Stories

Ben Thompson: AI is repeating the 1970s mainframe playbook, top-down and worker-replacing

In this week's Stratechery, Ben Thompson argues that OpenAI's and Anthropic's new "deployment company" structures (engineers embedded in enterprises to ship AI systems) mirror the 1970s mainframe wave: executive-mandated automation that eliminates jobs rather than employee-driven productivity tools. He pairs this with an Apple–Intel angle — TSMC's AI-chip prioritization is forcing Apple toward Intel as a secondary fab, giving Intel "the single most important thing it needs" to compete: a marquee customer eager to reduce TSMC dependency.

May 11, 2026Top AI Stories

Ben Thompson: AI compute is splitting into training, answer inference, and agentic inference

In this week's Stratechery analysis, Ben Thompson argues that AI compute is bifurcating into three distinct workload categories that need fundamentally different hardware. Training keeps high-bandwidth GPUs (NVIDIA's lock-in); answer inference rewards token speed (Cerebras's WSE-3 packs 44 gigabytes of on-chip SRAM at 21 petabytes per second of bandwidth versus NVIDIA H100's 80 gigabytes of HBM at 3.35 terabytes per second); agentic inference, where humans aren't in the loop, mostly cares about memory capacity and cost-per-token at scale. Thompson treats Cerebras's revised IPO pricing of $150 to $160 per share (up from $115 to $125) and Anthropic's lease of 220,000 NVIDIA GPUs at SpaceX's Colossus 1 (the 300-megawatt Memphis data center) as concrete evidence that buyers are now sorting their compute spend by workload category rather than picking a single vendor.

May 10, 2026Top AI Stories

Nvidia commits over $40 billion to equity AI deals in early 2026, led by $30 billion OpenAI bet

Nvidia has now committed more than $40 billion to equity investments in AI companies in 2026, including a single $30 billion stake in OpenAI, up to $3.2 billion in glassmaker Corning, and up to $2.1 billion in data-center operator IREN. Wedbush analyst Matthew Bryson described the pattern as "squarely circular" — money cycling between chip vendor, model customer, and infrastructure provider. The chipmaker has also closed roughly two dozen private startup rounds plus 67 venture deals across 2025, intensifying scrutiny of how concentrated the AI capital stack has become around a single supplier.

May 10, 2026Top AI Stories

NVIDIA Nemotron Elastic packs three nested reasoning models in a single checkpoint

NVIDIA Research published Nemotron Elastic, a post-training method that embeds 30, 23, and 12 billion-parameter nested reasoning models inside a single 30 billion-parameter parent — extractable via zero-shot slicing without further fine-tuning. The recipe achieves a 360-times token reduction over pretraining from scratch, and the 30 billion checkpoint compresses to 18.7 gigabytes under NVFP4 quantization. The 23-to-30 billion configuration advances the accuracy-and-latency Pareto frontier with up to 16 percent higher accuracy and 1.9-times lower latency than the default Nemotron Nano v3 budget control. All three precision variants (BF16, FP8, NVFP4) are available on Hugging Face under nvidia/NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B.

May 7, 2026Top AI Stories

Zyphra releases ZAYA1-8B — 8 billion parameter mixture-of-experts model matches DeepSeek-R1 on math, trained entirely on AMD

San Francisco lab Zyphra released ZAYA1-8B, an 8.4 billion-parameter mixture-of-experts model with **only 760 million active parameters at inference. On math benchmarks the model matches DeepSeek-R1 (89.1 on AIME 2026, 71.6 on HMMT)** and stays competitive with Claude Sonnet 4.5 on reasoning, despite running roughly an order of magnitude smaller. Most notably, the model was trained on a **1,024-node AMD Instinct MI300X cluster with AMD Pensando Pollara networking** — the first frontier-quality math model trained outside the NVIDIA CUDA stack. Weights are on Hugging Face under Apache 2.0.

May 5, 2026Top AI Stories

Cerebras files largest US tech IPO of 2026 with OpenAI as top customer

Cerebras Systems filed to sell 28 million shares priced between $115 and $125 per share, targeting $3.5 billion in proceeds at a $26.6 billion valuation — the largest US tech IPO of 2026 so far. OpenAI is one of the chipmaker's largest customers under a multi-year contract worth more than $10 billion signed in January, and holds a $1 billion secured loan plus warrants for over 33 million shares, potentially making OpenAI a major shareholder post-listing. The offering puts Cerebras' Wafer-Scale Engine 3 against Nvidia in a public-market test of GPU pricing power and frontier-lab compute lock-in.

May 4, 2026Top AI Stories

Intel and AMD release joint ACE matrix-AI extension for x86 CPUs

The x86 Ecosystem Advisory Group has published the **AI Compute Extensions** whitepaper, jointly authored by AMD and Intel engineers. ACE adds two-dimensional tile registers and outer-product matrix-multiply instructions on top of AVX10, with a claimed 16-times compute-density improvement on matrix workloads. Unlike Intel's Xeon-only AMX, ACE targets both client and server silicon, and software enablement is already in flight for PyTorch, NumPy, and TensorFlow. Neither vendor has yet named which CPU generation will ship the new instructions.