The second batch of “First Proof” problems is meant to evaluate AI’s usefulness for research-level math. The best model got six or seven of the ten questions right.
For all of the recent strides we’ve made in the math world—like a supercomputer finally solving the Sum of Three Cubes problem that puzzled mathematicians for 65 years—we’re forever crunching ...
Artificial intelligence has moved from checking homework to attacking problems that professional mathematicians once treated as out of reach. Systems tuned for symbolic reasoning are now cracking long ...