Walmart limits employee AI use as soaring token costs hit corporate budgets
As skyrocketing token costs trigger corporate sticker shock, major enterprises are replacing unlimited AI access with strict computational budgets.
June 3, 2026

Walmart has recently begun limiting its employees' use of Code Puppy, an internally developed artificial intelligence assistant designed to streamline administrative and technical workflows[1][2]. According to a Bloomberg report, the retail giant shifted the tool from an unlimited-access model to a token-based rationing system, marking a dramatic change in policy[1][2]. Previously, Walmart’s workforce had open-ended access to Code Puppy, which helps with day-to-day office tasks such as analyzing spreadsheets, generating presentations, and writing code[1]. However, after employee demand for the AI helper outpaced initial budget projections, the company began assigning a fixed number of computing tokens to each user[1][3]. The decision signals a pivotal moment for Walmart, which has been one of the retail sector's most aggressive adopters of generative artificial intelligence, and highlights how even the world's largest companies are beginning to encounter the hard financial limits of uncapped AI deployment[3][4].
The financial friction Walmart is experiencing highlights a fundamental mismatch between the way traditional enterprise software is priced and the consumption-based billing models of modern large language models[5]. Historically, corporate finance officers could easily forecast IT spending because legacy software suites relied on predictable, per-seat subscription fees[5]. Generative AI, by contrast, operates on token-based consumption[5][4]. A token represents a basic unit of text, such as a syllable or a character, processed or generated by the underlying model[1][4]. Because costs scale directly with the volume of data digested and produced, every query, iterative prompt, and long document upload has a direct, real-time cost[5][4]. For an enterprise like Walmart, which employs roughly 2.1 million people globally, even modest, well-intentioned use by a fraction of its staff can trigger massive, unpredictable expenditures as thousands of individual interactions accumulate on the corporate ledger[6][4].
Walmart is far from alone in grappling with the sudden sticker shock of widespread enterprise AI adoption[2][4]. Major technology players and global corporations across various sectors are discovering that unchecked enthusiasm for AI can quickly wreck annual IT budgets[5][7]. Uber, for instance, recently exhausted its entire planned annual artificial intelligence budget within the first four months of the year, driven primarily by the rapid, viral spread of Anthropic’s Claude Code among its five thousand software engineers[8]. Uber's chief technology officer, Praveen Neppalli Naga, admitted that the company’s internal modeling failed to anticipate the rate at which developers would adopt agentic tools, with some power users running up bills between five hundred and two thousand dollars per month[8]. Similarly, Microsoft has reportedly begun scaling back and canceling internal licenses for Claude Code, steering its engineers toward proprietary alternatives in an effort to contain spiraling expenses[7][9]. In the most extreme case reported by an industry consultant, an unnamed corporate client accidentally ran up a staggering five hundred million dollar bill in a single month on Anthropic’s Claude after failing to implement basic usage caps or real-time monitoring on employee licenses[10][11].
At the heart of these runaway costs is the technological evolution from simple, single-turn chatbots to complex, agentic AI systems that consume computational power at an unprecedented rate[7]. While a traditional chatbot responds to a single prompt, agentic AI is designed to act autonomously, taking multiple sequential steps, verifying its own work, and repeatedly calling upon large language models to complete a task[7]. While this multi-step reasoning significantly enhances the capability of tools like Code Puppy, it also dramatically multiplies the volume of tokens consumed[12][7]. The situation has been further exacerbated by a corporate trend known as tokenmaxxing, where companies and teams have historically evaluated their AI readiness by tracking the raw volume and complexity of AI interactions rather than measurable business outcomes[6][13]. This lack of rigorous criteria has often led employees to gamify internal key performance indicators, using high-cost generative AI tools for trivial administrative tasks, such as checking the weather or conducting simple database searches, when traditional software would have been far more efficient and cost-effective[6][14].
In response to these ballooning costs, corporate leaders are shifting their focus from broad, unstructured experimentation toward strict financial governance and the discipline of AI unit economics[4][12]. Walmart’s transition to a token cap for Code Puppy represents a calculated move to transition AI from an untracked overhead expense into a metered, manageable utility[12]. By assigning a fixed budget of tokens to employees, the company forces team managers to treat AI compute as a finite resource, enabling them to run internal chargebacks, optimize prompting behaviors, and prioritize workflows that yield clear business value[12]. To support this transition, Walmart has issued updated guidance and training programs to help employees identify when to deploy advanced AI agents and when to utilize lower-cost, conventional software tools[6][4]. Rather than discouraging the use of artificial intelligence, these measures aim to cultivate a culture of fiscal responsibility, ensuring that every token burned is tied directly to a tangible gain in productivity or operational efficiency[1][4].
The current wave of token rationing across corporate America marks a healthy and necessary maturation phase for the generative AI market[1][12]. The era of unrestricted, consequence-free AI exploration is drawing to a close, replaced by a pragmatism that requires technological innovation to justify its spot on the corporate balance sheet[1][2]. As giants like Walmart, Uber, and Microsoft establish more rigid boundaries and cost-control systems, they are helping to define the true commercial value of large language models in the enterprise[1][3]. Ultimately, the long-term viability of corporate artificial intelligence will not depend on how many tokens an organization can consume, but on how efficiently and strategically those tokens are converted into sustainable economic value[4][12].