Last week, OpenAI shocked the mathematical community by revealing that one of its internal artificial intelligence (AI) ...
In mid-May, OpenAI announced that an internal AI model had disproved the Erdős unit distance conjecture, a famous problem in discrete geometry that had stumped human mathematicians for the last 80 ...
Google DeepMind, Google LLC’s artificial intelligence research unit, today unveiled two new AI models that are capable of advanced mathematical reasoning for solving complex math problems, which ...
The result is correct but challenges core norms of mathematics: checking proofs, crediting ideas and keeping research open to ...
OpenAI claims its reasoning model disproved a geometry conjecture unsolved since 1946 — and this time, the mathematicians who exposed its last embarrassing claim are backing it up.
An independent test of four artificial intelligence models verified 10 previously unpublished mathematical problems: the ETH Zurich model solved six problems, while other publicly available systems ...
Mathematician Will Sawin discusses his experience reviewing and refining a mathematical proof devised by OpenAI's internal ...
The GSMM Camp is a weeklong workshop that builds interdisciplinary problem-solving skills for graduate and advanced undergraduate students. Participants work in teams on mathematically rich problems ...
Researchers have introduced Light-R1-32B, a new open-source AI model optimized to solve advanced math problems. It is now available on Hugging Face under a permissive Apache 2.0 license — free for ...
The second batch of “First Proof” problems is meant to evaluate AI’s usefulness for research-level math. The best model got ...
On Friday, research organization Epoch AI released FrontierMath, a new mathematics benchmark that has been turning heads in the AI world because it contains hundreds of expert-level problems that ...
Chinese AI lab DeepSeek has quietly updated Prover, its AI model that’s designed to solve math-related proofs and theorems. According to South China Morning Post, DeepSeek uploaded the latest version ...