FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI