Pinned
Very excited to announce HorizonMath with @erikyw26 and collaborators!
How can we measure AI progress on mathematical discovery? Turns out there’s several classes of problems where discovery is hard but verification is easy. We develop a benchmark with 101 such problems and test













