Glossary · Term

P1-30B-A3B

← all terms

Definition

An open mixture-of-experts model with 30 billion total parameters but only about 3 billion active per token.

A 30B/3B-active sparse mixture-of-experts open-weight base model used as the backbone for SU-01's olympiad-math post-training.

Mentioned in 1 episode

  1. 048
    How a 30B Open Model Reached Olympiad Gold With the Right Recipe