Nvidia's $40 billion AI equity spree + Nemotron Elastic checkpoint ships
Nvidia commits over $40 billion to AI equity deals in 2026 led by a $30 billion OpenAI bet; NVIDIA Research ships a single Nemotron checkpoint with 3 nested reasoning-model sizes. Plus 3 more stories.
Listen to this brief
Audio & video are paid features
Plus unlocks audio streaming and PDF downloads. Pro adds offline MP3 downloads, video, certificates, and more.
- Audio streaming
- Downloadable PDFs
- All AI Playbooks
- Personalized content
- Certificates of completion
- Audio MP3 downloads
- Video lessonssoon
- & More…soon
Watch this brief
Capital flows lead today: Nvidia's $40 billion equity-investment spree highlights how concentrated the AI stack has become around a single supplier, and a fresh NVIDIA Research recipe lets one 30 billion-parameter checkpoint serve three reasoning-model sizes via zero-shot slicing. Then a quietly consequential Google upgrade to Gemini API File Search for retrieval-augmented generation workloads, a Microsoft Research paper questioning whether frontier models can be trusted to edit documents on a user's behalf, and the EU's May 7 Omnibus deal postponing the most demanding parts of the AI Act by 12 to 24 months.
- 1
Nvidia commits over $40 billion to equity AI deals in early 2026, led by $30 billion OpenAI bet
Nvidia has now committed more than $40 billion to equity investments in AI companies in 2026, including a single $30 billion stake in OpenAI, up to $3.2 billion in glassmaker Corning, and up to $2.1 billion in data-center operator IREN. Wedbush analyst Matthew Bryson described the pattern as "squarely circular" — money cycling between chip vendor, model customer, and infrastructure provider. The chipmaker has also closed roughly two dozen private startup rounds plus 67 venture deals across 2025, intensifying scrutiny of how concentrated the AI capital stack has become around a single supplier.
- 2
NVIDIA Nemotron Elastic packs three nested reasoning models in a single checkpoint
NVIDIA Research published Nemotron Elastic, a post-training method that embeds 30, 23, and 12 billion-parameter nested reasoning models inside a single 30 billion-parameter parent — extractable via zero-shot slicing without further fine-tuning. The recipe achieves a 360-times token reduction over pretraining from scratch, and the 30 billion checkpoint compresses to 18.7 gigabytes under NVFP4 quantization. The 23-to-30 billion configuration advances the accuracy-and-latency Pareto frontier with up to 16 percent higher accuracy and 1.9-times lower latency than the default Nemotron Nano v3 budget control. All three precision variants (BF16, FP8, NVFP4) are available on Hugging Face under
nvidia/NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B. - 3
Google upgrades Gemini API File Search to multimodal with metadata filters and page-level citations
Google updated its Gemini API File Search tool on May 5 with three additions for retrieval-augmented generation (RAG) workflows: multimodal indexing of images and text together using the Gemini Embedding 2 model, custom key-value metadata labels (such as
department: Legal) for query-time filtering, and page-level citations that link readers to the exact source location in indexed documents. The update targets both prototypes and production-scale RAG applications, with example use cases in creative agencies, scientific research, and engineering. No pricing change was announced. - 4
Microsoft paper: frontier models corrupt 25% of document content in long delegated workflows
Microsoft Research released DELEGATE-52, a benchmark simulating extended document-editing workflows across 52 professional domains including coding, crystallography, and music notation. Across 19 large language models tested, the strongest frontier systems — Gemini 3.1 Pro, Claude 4.6 Opus, and GPT 5.4 — corrupted an average of 25 percent of document content by the end of long sessions, with errors silently accumulating rather than failing loudly. Adding agentic tool use did not improve results, and degradation worsened with larger documents, longer interactions, and distractor files. Authors: Philippe Laban, Tobias Schnabel, and Jennifer Neville.
- 5
EU postpones AI Act high-risk obligations to 2027 and 2028 in Omnibus deal
The Council of the EU and the European Parliament reached provisional agreement on May 7 to delay compliance deadlines for high-risk AI systems under Annex III to December 2, 2027, and for AI in regulated products under Annex I to August 2, 2028. The deal also bans so-called "nudifier" tools and AI-generated child sexual abuse material starting December 2026, while postponing watermarking and synthetic-media disclosure obligations to the same date. The simplification was driven by industry lobbying and a final push from Germany, France, and Italy ahead of the original August 2026 enforcement deadline.
Get Top AI Stories by email
The day's most important AI news in your inbox — free. Email delivery is launching soon; opt in now and we'll save your spot.
Sources
- 1.NVIDIA AI Releases Star Elastic: One Checkpoint that Contains 30B, 23B, and 12B Reasoning Models with Zero-Shot Slicing — MarkTechPost · May 9, 2026
- 2.Artificial intelligence: Council and Parliament agree to simplify and streamline rules — Council of the EU · May 7, 2026
- 3.LLMs Corrupt Your Documents When You Delegate — arXiv · April 17, 2026
- 4.What the EU AI Omnibus Deal Changes for the AI Act and What Lies Ahead — TechPolicy.Press · May 8, 2026
- 5.NVIDIA on Hugging Face — Nemotron-Labs-3-Elastic-30B-A3B — Hugging Face · May 9, 2026
- 6.microsoft/DELEGATE52 — code and benchmark release — GitHub
- 7.Gemini API File Search is now multimodal — Google · May 5, 2026
- 8.Nvidia has already committed $40B to equity AI deals this year — TechCrunch · May 9, 2026
This brief was published on May 10, 2026. Cited URLs above point to third-party publishers and may move, paywall, or be retired over time. If a link no longer resolves, original article titles are preserved so you can recover them via search; the canonical web edition at aiproplaybook.com/top-ai-stories/2026-05-10 may carry updated source URLs.