OpenAI reasoning model autonomously disproves legendary 80-year-old math conjecture in historic breakthrough

OpenAI’s reasoning model autonomously disproves an 80-year-old math conjecture, using advanced number theory to unlock automated scientific discovery.

May 21, 2026

OpenAI reasoning model autonomously disproves legendary 80-year-old math conjecture in historic breakthrough
An internal general-purpose reasoning model developed by OpenAI has achieved a historic breakthrough in mathematics by autonomously disproving a legendary geometric conjecture that had stood open for eighty years[1][2]. The achievement, which centers on a famous problem first proposed by Hungarian mathematician Paul Erdős in 1946, represents a dramatic paradigm shift in the capabilities of artificial intelligence[3][1]. By successfully demonstrating that an AI system can generate entirely novel, highly sophisticated scientific knowledge without human step-by-step guidance, the breakthrough has sent shockwaves through the academic community[1][4]. Fields Medalist Timothy Gowers has characterized the achievement as a monumental milestone in AI mathematics, warning that the scientific world has entered an era where it will soon become exceedingly difficult for humans to compete with automated systems in solving complex, open-ended mathematical problems[5][1].
At the heart of this milestone is the planar unit distance problem, a deceptively simple question that has frustrated discrete geometry researchers for decades[6]. The problem asks a fundamental question about spatial configurations: if one places a specific number of points on a flat two-dimensional plane, what is the maximum possible number of pairs of those points that can sit at an exact distance of one unit from each other[6][7]? For nearly eight decades, the mathematical consensus was that square-grid-style arrangements represented the absolute physical limit of efficiency, implying a near-linear upper bound on the number of unit distances[3][6]. Generations of brilliant minds attempted to optimize or break these grid-based models, but the conjecture resisted all human attempts[6][7].
The reasoning model from OpenAI shattered this long-standing assumption by constructing an entirely new family of geometric configurations that outperform the classical grid structures[6][7]. Rather than finding a minor, incremental optimization, the model proved the existence of an infinite set of point arrangements where the number of unit distances grows at a polynomial rate strictly faster than the bound proposed by Erdős[3][8][7]. In a subsequent refinement of the model's proof, Princeton University mathematics professor Will Sawin established an explicit improvement exponent, showing that the number of unit distances grows at a rate of at least the number of points raised to the power of one plus a positive constant of 0.014[8][9]. In the realm of discrete geometry, a polynomial improvement over a logarithmic limit is a monumental qualitative shift that proves the historical conjecture is fundamentally incorrect[9].
What has astonished the mathematical community even more than the disproof itself is the highly unorthodox methodology the model employed[6][9]. Traditional attempts relied heavily on geometric partitioning and combinatorics. The AI model, however, made an unexpected cross-disciplinary leap, connecting the geometric problem to deep, abstract concepts within algebraic number theory[8][9]. Specifically, the model constructed high-dimensional lattices using infinite class field towers, a mathematical framework typically used to study the extension of ordinary integers[10][9].
By leveraging Golod-Shafarevich theory—a profound algebraic result established in 1964—and Chebotarev density, the model proved the existence of infinite unramified towers of totally real number fields with specific Galois groups[11][10]. These algebraic structures possess extreme symmetries and small discriminants, allowing them to be projected onto a two-dimensional plane to form the counterexample coordinates[10][9]. This transition from geometry to advanced number theory was completely unexpected, demonstrating that the AI possessed a capacity for conceptual synthesis transcending simple pattern matching[8][12]. The initial computational proof was spearheaded within OpenAI by researcher Lijie Chen, with mathematicians Mark Sellke and Mehtaab Sawhney verifying its algorithmic correctness[13][14].
The scale of the discovery required a monumental effort from the global mathematical community to unpack and verify. The documentation generated by the OpenAI model spanned approximately 125 pages, presenting a fully detailed, rigorous argument[7][15]. Because of the density of the proof, a group of nine world-renowned mathematicians—including Timothy Gowers, Noga Alon, Thomas Bloom, and Melanie Matchett Wood—collaborated on a nineteen-page companion paper to translate the AI's work into a human-readable format[3][9]. Meticulously analyzing the model's reasoning chain, this group verified that the proof was not only mathematically sound but also remarkably elegant, utilizing concepts from the work of modern theorists like Jordan Ellenberg and Akshay Venkatesh[3][9].
This rigorous verification has fundamentally changed the conversation surrounding AI's role in mathematics[16]. Prior claims of AI solving advanced math problems had met with deep skepticism; for instance, a previous public assertion that an earlier model solved multiple Erdős problems was quickly debunked when experts found the system had simply retrieved existing proofs from its training data[17]. This breakthrough, however, has been universally validated as a genuinely novel discovery[18][17]. Arul Shankar, a leading number theorist, noted that the model's chain of thought exhibited an extraordinary level of intuition, a willingness to test complex algebraic hypotheses, and a capacity for deep, independent reasoning that goes far beyond a mere digital assistant[1][8].
For the broader artificial intelligence industry, this milestone marks a critical inflection point in the race to develop frontier reasoning systems[4][16]. Crucially, the proof did not emerge from a specialized theorem prover or a system specifically trained on geometric datasets[8][17]. Instead, OpenAI achieved this result using a general-purpose reasoning model that relied on scaling its internal reasoning processes[8][17]. This architectural choice implies that general-purpose models, when allowed sufficient time to compute and evaluate their own intermediate reasoning paths, can transcend the limitations of being mere predictive engines that regurgitate human training data[6][16].
The industry implications of this autonomous discovery extend far beyond pure mathematics[19][4]. Tech companies and research institutions are increasingly looking to deploy similar reasoning architectures to accelerate breakthroughs in applied sciences, including materials science, structural biology, quantum physics, and pharmacology[2][20]. Because many of the most pressing challenges in these fields involve complex combinatorics and high-dimensional optimization—much like the Erdős unit distance problem—the proven ability of an AI to synthesize disparate fields of knowledge and maintain a rigorous, long-horizon argument suggests that the era of automated scientific discovery has officially arrived[8][4].
The autonomous disproof of the Erdős unit distance conjecture represents more than just the resolution of an eighty-year-old puzzle; it serves as a powerful demonstration of the future of human-machine collaboration[19][20]. While the AI provided the initial conceptual breakthrough and the massive mathematical framework, human experts were essential in refining, translating, and integrating the discovery into the broader body of scientific literature[3][20]. As automated reasoning systems continue to advance, they are poised to shift from useful computational calculators to genuine intellectual partners, unlocking mysteries of the universe that have long remained out of reach for human minds alone[1].

Sources
Share this article