FrontierMath: Benchmarking AI against advanced mathematical research
FrontierMath includes both carefully crafted challenge problems and open research problems that remain unsolved by mathematicians.
FrontierMath Tiers 1–4
A benchmark of several hundred unpublished, highly challenging mathematics problems. Difficulty Tiers 1-3 cover undergraduate through early postdoc level problems, while Tier 4 is research-level mathematics.
Open Problems
A collection of unsolved mathematics problems that have resisted serious attempts by professional mathematicians. AI solutions would meaningfully advance the state of human mathematical knowledge.
Feedback
Have a question? Noticed something wrong? Let us know.
FrontierMath
FrontierMath is an AI benchmark consisting of extremely challenging math problems, including open research problems that remain unsolved by mathematicians.