Nvidia's $40 billion AI equity spree + Nemotron Elastic checkpoint ships

Nvidia commits over $40 billion to AI equity deals in 2026 led by a $30 billion OpenAI bet; NVIDIA Research ships a single Nemotron checkpoint with 3 nested reasoning-model sizes. Plus 3 more stories.

Listen to this brief

Free preview · first 0:30

0:00 / 0:30

Audio & video are paid features

Plus unlocks audio streaming and PDF downloads. Pro adds offline MP3 downloads, video, certificates, and more.

Plus adds:

Audio streaming
Downloadable PDFs
All AI Playbooks
Personalized content

Pro also adds:

Certificates of completion
Audio MP3 downloads
Video lessonssoon
& More…soon

Watch this brief

AI Pro Playbook video — coming soon

Capital flows lead today: Nvidia's $40 billion equity-investment spree highlights how concentrated the AI stack has become around a single supplier, and a fresh NVIDIA Research recipe lets one 30 billion-parameter checkpoint serve three reasoning-model sizes via zero-shot slicing. Then a quietly consequential Google upgrade to Gemini API File Search for retrieval-augmented generation workloads, a Microsoft Research paper questioning whether frontier models can be trusted to edit documents on a user's behalf, and the EU's May 7 Omnibus deal postponing the most demanding parts of the AI Act by 12 to 24 months.

1
Nvidia commits over $40 billion to equity AI deals in early 2026, led by $30 billion OpenAI bet
Nvidia has now committed more than $40 billion to equity investments in AI companies in 2026, including a single $30 billion stake in OpenAI, up to $3.2 billion in glassmaker Corning, and up to $2.1 billion in data-center operator IREN. Wedbush analyst Matthew Bryson described the pattern as "squarely circular" — money cycling between chip vendor, model customer, and infrastructure provider. The chipmaker has also closed roughly two dozen private startup rounds plus 67 venture deals across 2025, intensifying scrutiny of how concentrated the AI capital stack has become around a single supplier.
NVIDIA OpenAI 🏭Semiconductor Manufacturing 🏭Cloud Infrastructure & Data Processing 📚Module 10 — Future of AI
2
NVIDIA Nemotron Elastic packs three nested reasoning models in a single checkpoint
NVIDIA Research published Nemotron Elastic, a post-training method that embeds 30, 23, and 12 billion-parameter nested reasoning models inside a single 30 billion-parameter parent — extractable via zero-shot slicing without further fine-tuning. The recipe achieves a 360-times token reduction over pretraining from scratch, and the 30 billion checkpoint compresses to 18.7 gigabytes under NVFP4 quantization. The 23-to-30 billion configuration advances the accuracy-and-latency Pareto frontier with up to 16 percent higher accuracy and 1.9-times lower latency than the default Nemotron Nano v3 budget control. All three precision variants (BF16, FP8, NVFP4) are available on Hugging Face under nvidia/NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B.
NVIDIA Nemotron 🏭Semiconductor Manufacturing 📚Module 9 — AI Developer Tools
3
Google upgrades Gemini API File Search to multimodal with metadata filters and page-level citations
Google updated its Gemini API File Search tool on May 5 with three additions for retrieval-augmented generation (RAG) workflows: multimodal indexing of images and text together using the Gemini Embedding 2 model, custom key-value metadata labels (such as department: Legal) for query-time filtering, and page-level citations that link readers to the exact source location in indexed documents. The update targets both prototypes and production-scale RAG applications, with example use cases in creative agencies, scientific research, and engineering. No pricing change was announced.
Google DeepMind Gemini 📚Module 9 — AI Developer Tools
4
Microsoft paper: frontier models corrupt 25% of document content in long delegated workflows
Microsoft Research released DELEGATE-52, a benchmark simulating extended document-editing workflows across 52 professional domains including coding, crystallography, and music notation. Across 19 large language models tested, the strongest frontier systems — Gemini 3.1 Pro, Claude 4.6 Opus, and GPT 5.4 — corrupted an average of 25 percent of document content by the end of long sessions, with errors silently accumulating rather than failing loudly. Adding agentic tool use did not improve results, and degradation worsened with larger documents, longer interactions, and distractor files. Authors: Philippe Laban, Tobias Schnabel, and Jennifer Neville.
Microsoft Anthropic Google DeepMind OpenAI Claude Opus 4.6 Gemini 3.1 Pro 📚Module 8 — AI Agents 📚Module 9 — AI Developer Tools
5
EU postpones AI Act high-risk obligations to 2027 and 2028 in Omnibus deal
The Council of the EU and the European Parliament reached provisional agreement on May 7 to delay compliance deadlines for high-risk AI systems under Annex III to December 2, 2027, and for AI in regulated products under Annex I to August 2, 2028. The deal also bans so-called "nudifier" tools and AI-generated child sexual abuse material starting December 2026, while postponing watermarking and synthetic-media disclosure obligations to the same date. The simplification was driven by industry lobbying and a final push from Germany, France, and Italy ahead of the original August 2026 enforcement deadline.
🏭Government & Public Administration 📚Module 11 — Society & Education

Get Top AI Stories by email

The day's most important AI news — free, daily, unsubscribe anytime.

Spot a typo or have feedback?Share feedback

Sources

1.Artificial intelligence: Council and Parliament agree to simplify and streamline rules — Council of the EU · May 7, 2026
2.NVIDIA on Hugging Face — Nemotron-Labs-3-Elastic-30B-A3B — Hugging Face · May 9, 2026
3.NVIDIA AI Releases Star Elastic: One Checkpoint that Contains 30B, 23B, and 12B Reasoning Models with Zero-Shot Slicing — MarkTechPost · May 9, 2026
4.Gemini API File Search is now multimodal — Google · May 5, 2026
5.What the EU AI Omnibus Deal Changes for the AI Act and What Lies Ahead — TechPolicy.Press · May 8, 2026
6.LLMs Corrupt Your Documents When You Delegate — arXiv · April 17, 2026
7.Nvidia has already committed $40B to equity AI deals this year — TechCrunch · May 9, 2026
8.microsoft/DELEGATE52 — code and benchmark release — GitHub

Nvidia's $40 billion AI equity spree + Nemotron Elastic checkpoint ships

Audio & video are paid features

Nvidia commits over $40 billion to equity AI deals in early 2026, led by $30 billion OpenAI bet

NVIDIA Nemotron Elastic packs three nested reasoning models in a single checkpoint

Google upgrades Gemini API File Search to multimodal with metadata filters and page-level citations

Microsoft paper: frontier models corrupt 25% of document content in long delegated workflows

EU postpones AI Act high-risk obligations to 2027 and 2028 in Omnibus deal

Sources

Recent issues

California's AI labeling law + a DeepSeek attack campaign

OpenAI math results + EU AI Act enforcement

Anthropic's own models breached three real companies