Free to read. Sign up to save your progress and take knowledge-check quizzes.

Sign up free
5 min read·Updated May 7, 2026

Physical Intelligence pi-0

Physical Intelligence logoBy Physical Intelligence

pi-0 is Physical Intelligence's foundation model for general-purpose robotics — a single AI model that controls 7+ different robot types across 68+ manipulation tasks, from folding laundry to assembling boxes in a real factory.

Listen to this lesson

Free preview · first 0:30
0:00 / 0:30

Audio & video lessons are paid features

Plus unlocks audio streaming. Pro adds downloadable audio, video, certificates, and more.

Plus adds:
  • Audio streaming
  • Downloadable PDFs
  • All AI Playbooks
  • Personalized content
Pro also adds:
  • Certificates of completion
  • Audio MP3 downloads
  • Video lessonssoon
  • & More…soon

Watch this lesson

Video coming soon

Learning Objectives

  • Understand what pi-0 is and how a foundation model approach applies to robotics
  • Evaluate the progression from pi-0 to pi-0.5 and pi-0.6 with RECAP
  • Assess Physical Intelligence's competitive position in the emerging robotics AI market

What Is pi-0?

pi-0 (pronounced "pie zero") is a Vision-Language-Action (VLA) foundation model for robotics, built by Physical Intelligence. Unlike traditional robot software that is custom-programmed for each robot and each task, pi-0 is a single AI model that can control 7+ different robot types across 68+ manipulation tasks — a breakthrough in robot generalization.

Think of it like a foundation model for the physical world: just as GPT can generate text for any topic, pi-0 can generate robot actions for any manipulation task. Give it a camera feed, a natural language instruction like "fold the laundry," and it figures out the motor commands to make it happen — across different robot arms, grippers, and environments.

💡Key Concept

Vision-Language-Action (VLA) Model: A neural network that takes in visual input (camera images), language input (instructions like "pick up the red cup"), and outputs physical actions (robot motor commands). This is the robotics equivalent of a multimodal language model — but instead of generating text, it generates movements in the real world.

Model Evolution

VersionReleaseKey Breakthrough
pi-0October 2024First generalist VLA model; controls 7 robot types across 68+ tasks
pi-0.5April 2025Open-world generalization — performs tasks in entirely unseen environments
pi-0.6 with RECAPNovember 2025Reinforcement learning from corrections; doubled throughput; halved failure rates

pi-0.5: Open-World Generalization

The original pi-0 worked in environments similar to its training data. pi-0.5 broke this barrier — it can perform tasks in entirely unseen environments (like cleaning a kitchen in a new home not in its training data). It turns vague commands like "clean the bedroom" into multi-step action sequences: open drawers, pick up objects, place them correctly.

pi-0.6 with RECAP: Learning from Experience

The latest version uses RECAP (Reinforcement Learning with Experience and Corrections via Advantage-conditioned Policies) — a training loop where:

  1. The robot watches human demonstrations
  2. Humans correct robot mistakes in real-time
  3. The robot practices autonomously thousands of times, self-scoring its performance

Results: doubled throughput on difficult tasks and halved failure rates. Demonstrated making espresso drinks for 18 hours straight, folding 50 novel laundry items, and assembling 59 boxes in a real chocolate factory.

Real-World Demonstrations

TaskDetails
Laundry folding50 novel items the robot had never seen before
Espresso making18-hour continuous operation without human intervention
Box assembly59 boxes assembled in a real chocolate factory production line
Table bussingClearing and organizing restaurant-style table settings
Grocery baggingPacking groceries into bags from a conveyor
Object retrievalFinding and bringing specific objects from cluttered environments

Open Source

Physical Intelligence open-sourced the original pi-0 model weights and code via the openpi GitHub repository — allowing researchers and developers to build on the model. This is significant: most robotics foundation models from large companies remain proprietary.

pi-0 vs. Robotics Competitors

CompanyApproachKey Difference
Physical Intelligence (pi-0)Software-first foundation model for any robot bodyHardware-agnostic; one model controls 7+ robot types; open-sourced
Genesis AI (GENE-26.5)Full-stack robotics: foundation model + custom human-anatomy hands + sensor data-collection gloveVertically integrated; bets that fine-grained dexterity needs all three layers designed together; $105 million seed (May 2026)
Figure AI (Figure 03)Humanoid robot for manufacturingHardware + software vertical; $2.6 billion valuation; narrower manufacturing focus
Tesla OptimusHumanoid robot leveraging Tesla's manufacturing scaleGen 3 with 50 actuators per hand; factory deployment Q2-Q3 2026
Google DeepMind (RT-2/RT-X)Robotics research foundation modelsTop research lab; less product-focused than PI
1X Technologies (NEO)Humanoid robots for home and workBacked by OpenAI; narrower embodiment focus

Physical Intelligence's differentiator: Hardware-agnostic — pi-0 works across any robot platform, not just one company's hardware. This positions PI as a potential "operating system for robots" rather than a robot manufacturer. The contrast with Genesis AI is the clearest in the category: Physical Intelligence is betting the foundation model is the durable layer and that hardware will commoditize underneath; Genesis AI is betting that fine-grained dexterity requires the foundation model, custom anatomy-faithful hardware, and a sensor-laden data collection glove all designed and trained together. Both bets can be right for different deployment scenarios.

Company Details

DetailInfo
CompanyPhysical Intelligence (PI)
Founded2024
CEOKarol Hausman
HeadquartersSan Francisco, California
Employees~191
Valuation$5.6 billion (November 2025 Series B)
Total Raised~$1.1 billion (Seed $70 million + Series A $400 million + Series B $600 million)
Key InvestorsCapitalG (Alphabet); Jeff Bezos; Lux Capital; Thrive Capital; Sequoia; NVIDIA; OpenAI
Open Sourcepi-0 weights and code on GitHub (openpi)
Websitephysicalintelligence.company

Strengths

  • Generalist robot AI — one model controls 7+ robot types across 68+ tasks; the broadest robot generalization demonstrated
  • Real factory deployment — 59 boxes assembled in a real chocolate factory; 18-hour continuous espresso operation
  • Open-sourced — pi-0 weights available on GitHub for researchers and developers
  • Hardware-agnostic — works across any robot platform, not locked to one manufacturer
  • Elite investor backing — Jeff Bezos, Alphabet, Sequoia, NVIDIA, and OpenAI at $5.6 billion valuation

Limitations and Considerations

  • Pre-commercial — pi-0 is a research breakthrough, not yet a widely deployed commercial product
  • Manipulation only — focused on manipulation tasks (picking, placing, folding); does not address locomotion, navigation, or full humanoid control
  • Requires real-world training data — each new task requires human demonstrations and correction sessions
  • Hardware costs — deploying physical robots is expensive regardless of how good the software is
  • Early stage — Physical Intelligence was founded in 2024; the company and technology are very young

Key Takeaways

  • pi-0 is a foundation model for robotics — a single AI that controls 7+ robot types across 68+ manipulation tasks, from laundry folding to factory box assembly
  • pi-0.6 with RECAP (November 2025) doubled throughput and halved failure rates through reinforcement learning from human corrections
  • Hardware-agnostic approach positions Physical Intelligence as a potential "operating system for robots" — open-sourced via the openpi GitHub repository
  • $5.6 billion valuation with $1.1 billion raised from Bezos, Alphabet, Sequoia, NVIDIA, and OpenAI validates the vision of general-purpose robot AI

Save your progress & take the quiz

Sign up free to bookmark lessons, track which modules you've completed, and lock in what you learned with a quick knowledge-check quiz at the end of each lesson.

🧭Recommended for you