OpenAI claims it solved an 80-year-old math problem — for real this time
OpenAI claims its new AI reasoning model has solved an 80-year-old math problem, specifically disproving a conjecture in geometry posed by Paul Erdős in 1946. This marks the first time AI has autonomously solved such a prominent open problem in mathematics, with implications for various scientific fields.
OpenAI announced that its new AI reasoning model has produced an original mathematical proof. This proof disproves a famous unsolved conjecture in geometry, initially posed by Paul Erdős in 1946. This achievement follows a previous, less successful claim by OpenAI regarding solving Erdős problems.
Mathematicians such as Noga Alon, Melanie Wood, and Thomas Bloom have endorsed OpenAI's latest claim. Bloom, who maintains the Erdős Problems website, had previously criticized a premature announcement from OpenAI's former VP Kevin Weil regarding similar claims.
OpenAI stated that for nearly 80 years, mathematicians believed the best solutions to this problem resembled square grids. The AI model, however, discovered an entirely new family of constructions that performs better, thereby disproving the longstanding belief.
This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics. The proof emerged from a new general-purpose reasoning model, not a system designed specifically for math problems.
OpenAI emphasizes the significance of this development, suggesting that AI systems can now handle complex chains of reasoning and connect ideas across different fields. This capability has potential implications for advancements in biology, physics, engineering, and medicine.
Thomas Bloom noted that AI is helping to explore the vastness of mathematics, hinting at more discoveries to come.
Related articles
The AI world is getting ‘loopy’
AI models are taking a significant leap forward with the adoption of "agentic loops," where AI agents continuously prompt each other to improve code and solve complex problems. This approach, though potentially resource-intensive, promises to unlock new levels of autonomous problem-solving and efficiency in AI applications.
Codex-maxxing for long-running work
Codex is increasingly being used by organizations to support long-running projects that go beyond a single prompt. This whitepaper by Jason Liu offers practical strategies for leveraging Codex as a persistent workspace, managing complex workflows and sustaining progress.
Nobel laureate John Jumper is leaving DeepMind for rival Anthropic
Nobel laureate John Jumper is departing Google DeepMind to join its competitor, Anthropic, after dedicating nearly nine years to DeepMind, where he led the AlphaFold team. Jumper, who shared a Nobel Prize for his work on AlphaFold, expressed gratitude for his time at DeepMind while looking forward to new endeavors.
