📍 San Francisco, California, United States·Est. 2023
Zyphra logo
Private Company

Zyphra

San Francisco AI lab specializing in efficient mixture-of-experts language models. Released ZAYA1-8B in May 2026, the first frontier-quality math model trained entirely on AMD Instinct MI300X GPUs.

Listen to this lesson

Free preview · first 0:30
0:00 / 0:30

Audio & video lessons are paid features

Plus unlocks audio streaming. Pro adds downloadable audio, video, certificates, and more.

Plus adds:
  • Audio streaming
  • Downloadable PDFs
  • All AI Playbooks
  • Personalized content
Pro also adds:
  • Certificates of completion
  • Audio MP3 downloads
  • Video lessonssoon
  • & More…soon

Watch this lesson

Video coming soon

Learn About Zyphra's AI Products

Create a free account to access in-depth lessons on each tool and model.

Start Learning Free

📋About Zyphra

Updated June 15, 2026

Zyphra is a San Francisco-based AI research lab focused on efficient mixture-of-experts (MoE) language models with custom attention mechanisms designed to preserve reasoning quality at small active-parameter budgets. The company's flagship release as of May 2026 is ZAYA1-8B — an open-weight MoE model with 8.4 billion total parameters and only 760 million active per inference token.

On May 6, 2026, Zyphra published ZAYA1-8B with two notable claims: first, the model matches DeepSeek-R1 on AIME 2026 (89.1) and HMMT (71.6) and stays competitive with Claude Sonnet 4.5 on broader reasoning despite the much smaller active-parameter footprint. Second, the model was trained entirely on a 1,024-node AMD Instinct MI300X cluster built with IBM, using AMD Pensando Pollara networking — making it the first public frontier-quality math model trained outside the NVIDIA CUDA stack. The training claim sits alongside Cerebras and Groq as evidence that the non-NVIDIA frontier-AI training market is maturing.

ZAYA1-8B weights ship under the Apache 2.0 license — the most permissive open-model license, with no restrictions on commercial or competitive use. Local deployment requires Zyphra's vLLM fork (the upstream vLLM does not yet have the MoE routing and specialized attention patches Zyphra needs); serverless deployment via Zyphra Cloud avoids the runtime switch.

🛠️Products & Tools (1)

ZAYA1-8BFreeFoundation Models

Open-weight 8.4 billion-parameter mixture-of-experts model from Zyphra with only 760 million active parameters at inference. Matches DeepSeek-R1 on AIME 2026 (89.1) and HMMT (71.6); competitive with Claude Sonnet 4.5 on reasoning. First frontier-quality math model trained entirely on AMD Instinct MI300X. Apache 2.0 license.