MATH Benchmark Competition mathematics. Max score: 100. ModelScore DeepSeek R1 97.3 o3 96.7 o4 mini 96.7