Name: Scale AI
Availability: InStock
Author: Scale AI

Learning Objectives

Understand what Scale AI does and why data infrastructure matters for AI development
Evaluate the impact of Meta's $14.3 billion investment on Scale AI's independence
Compare Scale AI's products (Data Engine, SEAL, GenAI Platform) and their roles in the AI ecosystem

What Is Scale AI?

Scale AI is an AI data infrastructure company that helps organizations build and deploy AI systems. Founded in 2016, Scale started as a data labeling service — providing the high-quality human annotations that AI models need for training. It has since expanded into model evaluation, enterprise AI platforms, and government defense contracts.

Scale plays a unique role in the AI ecosystem: while companies like OpenAI and Anthropic build the models, Scale provides the data infrastructure that makes those models accurate. If training data is the fuel of AI, Scale AI is one of the largest fuel refineries.

💡Key Concept

Data Annotation: The process of labeling raw data (images, text, audio, video) so AI models can learn from it. For example, labeling objects in photos for self-driving cars, or rating AI responses for RLHF (Reinforcement Learning from Human Feedback). Scale AI operates annotation at massive scale through its Remotasks and Outlier platforms with over 240,000 workers.

The Meta Deal

In June 2025, Meta invested $14.3 billion for a 49% non-voting stake in Scale AI, valuing the company at $29 billion. This triggered major changes:

Founder departure: CEO Alexandr Wang — who became the youngest self-made billionaire at age 28 — left Scale to become Meta's Chief AI Officer, leading Meta Superintelligence Labs
New leadership: Jason Droege (former Uber Eats founder) became Interim CEO
Customer concerns: OpenAI, Google, and xAI reduced or paused engagements over data confidentiality concerns tied to Meta's ownership
Layoffs: 200 full-time employees (14% of workforce) and 500 contractors were cut in July 2025

⚠️Warning

Meta's 49% ownership has made Scale AI a "conflicted" vendor for other frontier AI labs. If your organization needs vendor neutrality for data labeling, alternatives like Labelbox or SuperAnnotate may be worth evaluating.

Core Products

Scale Data Engine

The flagship data annotation platform. Supports image, video, text, LiDAR, and geospatial data labeling. Powers the RLHF (Reinforcement Learning from Human Feedback) process for frontier models — the technique that makes chatbots helpful and safe.

SEAL Leaderboards

Public AI model benchmarks that have become an industry standard. Unlike corporate benchmarks (which companies cherry-pick), SEAL provides independent evaluation across multiple dimensions:

Tool	Best For
SEAL Coding	Code generation and debugging evaluation
SEAL MultiChallenge	Multi-turn conversation quality
SEAL AudioMultiChallenge	Voice and audio model evaluation
SEAL Showdown	Blind human evaluation ranking from real conversations
EnigmaEval	Advanced reasoning benchmarks
Agentic Tool Use	AI agent capability evaluation

Scale GenAI Platform

An enterprise platform for building custom AI applications — RAG pipelines, model fine-tuning, testing, and deployment. Designed for organizations that want to build AI products without managing the full ML infrastructure stack.

Scale Labs (March 2026)

An expanded research division focused on AI capabilities research, post-training evaluation, and enterprise deployment. Recent benchmarks include SWE-Atlas (software engineering) and Voice Showdown (voice AI).

Government and Defense

Scale has significant US government contracts:

$100 million Pentagon OTA for Top Secret/SCI AI deployment
$99.5 million Army R&D contract (August 2025)
Thunderforge — a DoD AI agent program (March 2025)
5-year Qatar government AI services deal (February 2025)

Pricing

Plan	Price	Features
Self-Serve	Pay-as-you-go (1,000 free labeling units)	Research and experimentation
Enterprise	~$93,000/year average (up to ~$400,000)	Production AI teams with volume needs

Self-ServePay-as-you-go (1,000 free labeling units)

Research and experimentation

Enterprise~$93,000/year average (up to ~$400,000)

Production AI teams with volume needs

Self-serve pricing starts at approximately 2 cents per image and 6 cents per annotation. Enterprise pricing is consumption-based with annual volume commitments.

Scale AI vs. Competitors

Company	Strength	Weakness
Scale AI	Largest by revenue ($2 billion); SEAL benchmarks; government contracts	Meta ownership creates conflict of interest for some clients
Labelbox	Strong UI for in-house ML teams; vendor neutral	Smaller scale; fewer enterprise features
Appen	1 million+ contributors in 180+ languages; unmatched language diversity	In serious decline (stock down 99% from 2020 peak)
Snorkel AI	Programmatic labeling reduces human labor needed	Different approach; less suitable for complex annotation

Company Details

Detail	Info
Founded	2016
Founder	Alexandr Wang (now Meta Chief AI Officer)
Current CEO	Jason Droege (Interim; formerly founded Uber Eats)
Headquarters	San Francisco, California
Employees	~1,200 full-time (post-layoffs)
Gig Workers	240,000+ via Remotasks and Outlier platforms
Valuation	$29 billion (June 2025; Meta deal)
Revenue (2025)	~$2 billion (130% year-over-year growth)
Enterprise Customers	400+ enterprise clients
Recognition	TIME 100 Most Influential Companies (2025)
Website	scale.com

Strengths

Industry standard benchmarks — SEAL leaderboards are the most respected independent AI evaluation suite
Massive scale — 240,000+ gig workers across Remotasks and Outlier; $2 billion in revenue
Government trust — $200+ million in US defense contracts; Top Secret/SCI clearance level
Full stack — data labeling, model evaluation, enterprise deployment, and research in one company
Revenue growth — 130% year-over-year growth reaching $2 billion in 2025

Limitations and Considerations

Meta conflict — 49% ownership has caused frontier labs (OpenAI, Google, xAI) to reduce engagement over data confidentiality concerns
Leadership transition — founder departed; interim CEO in place; organizational stability uncertain
Worker treatment concerns — the Remotasks/Outlier gig worker model has drawn criticism for payment issues and sudden market exits
Enterprise pricing opacity — consumption-based pricing makes costs difficult to predict
Not a model provider — Scale AI does not build or host AI models; it provides the data and evaluation infrastructure around them

Key Takeaways

Scale AI is the largest AI data infrastructure company ($2 billion revenue, $29 billion valuation), providing data annotation, model evaluation, and enterprise AI services
The SEAL leaderboards have become the industry standard for independent AI model evaluation — used to compare frontier models across coding, reasoning, and conversation quality
Meta's $14.3 billion investment in June 2025 triggered the founder's departure to Meta and raised conflict-of-interest concerns among Scale's other frontier lab customers
Significant US government presence with over $200 million in defense contracts, including Top Secret/SCI deployments

Scale AI

Audio & video lessons are paid features