NASA, Schmidt Sciences Invest $7M+ to Revitalize ArXiv for Next-Gen AI

NASA and Schmidt Sciences back a $7M modernization of arXiv, future-proofing the indispensable hub for AI and global scientific knowledge.

November 25, 2025

NASA, Schmidt Sciences Invest $7M+ to Revitalize ArXiv for Next-Gen AI
Cornell Tech has secured more than $7 million in a significant funding initiative from NASA and Schmidt Sciences to overhaul arXiv, the indispensable open-access repository for scientific research.[1][2] This investment is set to propel the platform, which currently hosts over 2.8 million scholarly articles, into a new era by accelerating its migration to cloud-based infrastructure, modernizing its decades-old codebase, and pioneering new tools to help researchers navigate the vast sea of information.[1][2] For the global scientific community, particularly the fast-paced field of artificial intelligence where arXiv is a primary vehicle for disseminating cutting-edge research, this modernization effort promises to enhance the stability, accessibility, and utility of a critical resource. The funding arrives at a crucial juncture as the repository grapples with the challenges of scale and the proliferation of AI-generated content, making the upgrade not just a matter of convenience but of necessity for maintaining its central role in open science.
The multi-million dollar investment is a collaborative effort with distinct roles for each benefactor. The gift from Schmidt Sciences is primarily aimed at bolstering arXiv's development team, allowing the technical staff to complete the complex modernization process without disrupting the platform's ongoing operations.[1][3] This ensures that the global community of scientists who rely on arXiv for daily access to the latest preprints will experience a seamless transition. James Ricci, director of science systems at Schmidt Sciences, emphasized the goal of helping arXiv migrate to "modern, scalable cloud technology" to sustainably meet the "accelerating demands of the global research community."[1][3] On the other hand, NASA's grant is targeted at forward-looking research and development.[1] It will fund projects at Cornell University's Department of Computer Science to create and assess discovery tools that are not only more effective but also fairer than existing methods.[3] This focus on equity in information discovery is a significant step toward mitigating biases in how scientific literature is surfaced and consumed. Furthermore, NASA's support will help arXiv expand its subject coverage into areas of keen interest to the agency, such as planetary science.[1][3]
At the heart of this initiative is a major technological transformation designed to bring the 30-year-old platform up to modern standards.[2][3] Founded in 1991 by physicist Paul Ginsparg, arXiv's foundational code, while revolutionary for its time, has become increasingly difficult to maintain and upgrade.[1][4] The migration to a cloud infrastructure, a process that began in 2023 with initial support from the Simons Foundation, is a critical step toward ensuring the repository's reliability, fault tolerance, and scalability for the future.[3][5] Ramin Zabih, arXiv's executive director and a professor of computer science at Cornell Tech, stated that the new funding will allow the completion of this technology migration while simultaneously exploring service improvements.[1][3] The modernization is not merely a backend overhaul; it is a prerequisite for implementing advanced features that users have long desired, such as improved search functions and personalized recommendation engines.[6] These new tools are essential for researchers trying to keep abreast of the explosive growth in scientific literature, a challenge particularly acute in fields like AI and machine learning where thousands of papers are submitted every month.
The implications of a modernized arXiv for the AI industry are profound. The platform is the lifeblood of AI research, the primary venue where breakthroughs in large language models, computer vision, and robotics are first shared with the world. The sheer volume of submissions in computer science has grown exponentially, a trend accelerated by generative AI itself.[7] This deluge has created significant challenges, including a recent influx of low-quality, AI-generated review papers that threatened to overwhelm the volunteer moderators.[8][9] By developing more sophisticated and fairer recommendation tools, the upgraded arXiv can help researchers filter through the noise and discover the most relevant and impactful work. This will not only accelerate the pace of innovation but also help to democratize access to knowledge, leveling the playing field for researchers globally. The investment ensures that arXiv can continue to serve its core mission of rapid, open dissemination of knowledge while adapting to the new realities of the AI era.
In conclusion, the more than $7 million grant from NASA and Schmidt Sciences represents a pivotal moment for arXiv and the broader scientific community it serves. This funding is more than just a financial injection; it is a strategic investment in the foundational infrastructure of open science. By enabling the completion of its cloud migration, the modernization of its code, and the development of intelligent discovery tools, the initiative will ensure arXiv's long-term sustainability and its capacity to meet the evolving needs of researchers worldwide.[1][2] For the artificial intelligence industry, a more robust, scalable, and intelligent arXiv is essential for navigating the ever-expanding frontiers of research and for fostering the collaborative spirit that drives scientific progress. As Greg Morrisett, the Dean and Vice Provost of Cornell Tech, aptly put it, "This investment will ensure that arXiv can grow sustainably and continue to serve the needs of the global research community well into the future."[1]

Sources
Share this article