Free to read. Sign up to save your progress and take knowledge-check quizzes.

Sign up free
6 min read·Updated July 4, 2026

Leanstral 1.5 is Mistral AI's open Apache-2.0 model for formal mathematical proof in Lean 4 — a mixture-of-experts design with 119 billion total and 6 billion active parameters. It scores a perfect 100 percent on the miniF2F benchmark, solves 587 of 672 PutnamBench problems, and during testing flagged five previously unknown bugs across open-source repositories, including a critical integer-overflow flaw. It is a specialist verification tool, not a general chatbot.

Listen to this lesson

Free preview · first 0:30
0:00 / 0:30

Audio & video lessons are paid features

Plus unlocks audio streaming. Pro adds downloadable audio, video, certificates, and more.

Plus adds:
  • Audio streaming
  • Downloadable PDFs
  • All AI Playbooks
  • Personalized content
Pro also adds:
  • Certificates of completion
  • Audio MP3 downloads
  • Video lessonssoon
  • & More…soon

Watch this lesson

AI Pro Playbook video — coming soon

Learning Objectives

  • Understand what Leanstral 1.5 is and why formal theorem proving is a distinct, high-value niche within AI
  • Identify its core specs: an open Apache-2.0 mixture-of-experts model tuned for Lean 4 proof engineering
  • Evaluate when a formal-verification model like Leanstral is the right tool versus a general-purpose coding model

What Is Leanstral 1.5?

Leanstral 1.5 is an open model from Mistral AI, the Paris-based lab, built for one narrow but demanding job: writing and checking formal mathematical proofs in Lean 4, the proof-assistant language used by mathematicians and verification engineers. It is not a general chatbot — it is a specialist that turns mathematical claims and code correctness into machine-checkable proofs.

Under the hood, Leanstral 1.5 uses a mixture-of-experts (MoE) architecture with 119 billion total parameters but only 6 billion active per forward pass, which keeps inference cost low relative to its capability. Mistral trained it by combining supervised fine-tuning with reinforcement learning, and it is released under the permissive Apache 2.0 license — meaning the weights can be downloaded, self-hosted, and used commercially without restriction.

The reason formal verification matters: unlike a normal language model that produces plausible-sounding answers, a Lean proof either checks or it does not. That makes the output verifiable by construction — a rare property in AI, and the whole point of the formal-methods field.

💡Key Concept

Formal theorem proving: Writing mathematical arguments in a language a computer can mechanically verify, step by step, with no gaps. If the proof checks, the result is guaranteed correct. Tools like Lean 4 are used both to verify advanced mathematics and to prove that software behaves exactly as specified.

Tip

Try Leanstral 1.5: Available with free API access on La Plateforme, as open weights on Hugging Face, and integrated into Mistral Vibe. Best paired with a Lean 4 development environment.

Benchmark Performance

Leanstral 1.5 posts state-of-the-art results across the standard formal-math benchmarks. On the widely used miniF2F benchmark it reaches a perfect 100 percent, and it solves 587 of 672 problems on PutnamBench — a suite drawn from the Putnam mathematics competition. On graduate-level algebra it scores 87 percent on FATE-H, and on the far harder PhD-level FATE-X it reaches 34 percent. The model also shows strong test-time scaling, continuing to improve as it is given more room to reason, up to 4 million tokens.

BenchmarkScoreWhat it measures
miniF2F100 percentStandard formal-math proof benchmark
PutnamBench587 of 672 solvedPutnam-competition-level problems
FATE-H87 percentGraduate-level algebra
FATE-X34 percentPhD-level algebra

A perfect miniF2F score should be read with the usual caution — benchmarks saturate, and formal-math suites are narrower than open-ended reasoning — but the PutnamBench and FATE-X numbers show genuine strength on problems that remain hard even for specialists.

Finding Real Software Bugs

The more practical result is that formal methods are not just for pure mathematics. During testing, Leanstral 1.5 flagged five previously unknown bugs across 57 open-source repositories, including a critical integer-overflow flaw in a decoding library. Because the model reasons about code the way it reasons about a proof — asking whether a stated property actually holds — it can surface edge cases that ordinary testing and code review miss.

This points at a broader thesis for the field: as formal-verification models get cheaper and more capable, proving software correct becomes a realistic path to safer systems, not just an academic exercise reserved for aerospace and cryptography.

Strengths

  • Open and permissive: Apache 2.0 license allows self-hosting, fine-tuning, and commercial use with no restrictions
  • Efficient: Mixture-of-experts design activates only 6 billion of 119 billion parameters per forward pass
  • State-of-the-art on formal math: Perfect miniF2F, strong PutnamBench and FATE results
  • Practical bug discovery: Found real, previously unknown flaws in open-source code
  • Verifiable output: Lean proofs either check or they do not — no hallucinated correctness

Limitations & Considerations

  • Narrow by design: Leanstral is a formal-proof specialist, not a general assistant — it is the wrong tool for writing prose, answering open questions, or everyday coding
  • Requires Lean expertise: Getting value out of it assumes familiarity with Lean 4 and the formal-methods workflow
  • Benchmark saturation: A perfect miniF2F score reflects a maturing benchmark as much as raw capability — judge it on the harder FATE-X and real-world bug results
  • Emerging field: Formal-verification-with-AI is early; tooling, integrations, and best practices are still forming

Best Use Cases

TaskWhy Leanstral 1.5
Formalizing mathematics in Lean 4Purpose-built for proof engineering with state-of-the-art benchmark results
Verifying software correctnessReasons about stated properties to surface edge-case bugs testing misses
Research in automated reasoningOpen weights and Apache 2.0 license give full freedom to study and extend
Teaching formal methodsA capable, free proof assistant for coursework and self-study

When to choose alternatives:

  • General-purpose coding → a mainstream coding model or agent, not a proof specialist
  • Open-ended reasoning or writing → a frontier chat model
  • No Lean 4 in your workflow → the formal-methods entry cost may outweigh the benefit today

Getting Started

  1. Set up Lean 4 — install the Lean toolchain and an editor extension so proofs can be checked locally
  2. Get API access — sign up at console.mistral.ai for free API access, or download the open weights from Hugging Face
  3. Start small — hand the model a known theorem and confirm the generated Lean proof actually checks
  4. Try verification — point it at a small function with a stated property to see whether it can prove or break the claim
  5. Scale the reasoning budget — for harder targets, give the model more test-time tokens, where it continues to improve

Key Takeaways

  • Leanstral 1.5 is Mistral AI's open Apache-2.0 model for formal mathematical proof in Lean 4 — a specialist verification tool, not a general chatbot
  • Its mixture-of-experts design carries 119 billion total parameters but activates only 6 billion per forward pass, keeping it efficient
  • It reaches a perfect 100 percent on miniF2F, solves 587 of 672 PutnamBench problems, and scores 34 percent on PhD-level FATE-X
  • During testing it found five previously unknown bugs across 57 open-source repositories, including a critical integer-overflow flaw — evidence that formal verification is a practical route to safer software
  • Best for mathematicians, verification engineers, and researchers working in Lean 4; the wrong tool for general coding or writing

Save your progress & take the quiz

Sign up free to bookmark lessons, track which modules you've completed, and lock in what you learned with a quick knowledge-check quiz at the end of each lesson.

🧭Recommended for you