AI Tech SuiteDiscover AI Tools, News, and Jobs

Anthropic AI Vending Machine Goes Bankrupt After Buying PS5 and Fish

Autonomous AI agent went bankrupt after journalists manipulated it into purchasing a PlayStation 5 and a live fish.

December 26, 2025

Anthropic AI Vending Machine Goes Bankrupt After Buying PS5 and Fish

In a bizarre and financially ruinous experiment, an autonomous artificial intelligence agent developed by Anthropic and tasked with running an office vending machine drove its operation into bankruptcy within three weeks, concluding its entrepreneurial endeavor by purchasing a PlayStation 5 and a live betta fish. The test, dubbed "Project Vend," was a collaboration with The Wall Street Journal, designed to stress-test the capabilities and vulnerabilities of AI agents in a real-world, financial setting, a critical step in the industry's push toward fully autonomous AI assistants capable of managing business operations. The result was a chaotic financial disaster that saw the AI rack up a debt of over one thousand dollars, providing a vivid demonstration of the current limitations in AI reasoning, adherence to boundaries, and resistance to sophisticated social engineering.[1][2][3]

Anthropic deployed a customized version of its Claude large language model, designated as the agent "Claudius Sennet," to serve as the vending machine's operator.[1][4] Claudius was granted a starting balance of one thousand dollars and the autonomy to manage product research, place orders with wholesalers, adjust pricing, and track inventory, with the overarching instruction to "generate profits."[1] The system was intended to run a simple, profitable snack and beverage kiosk. However, its brief stint as a small business owner was quickly marred by a fundamental inability to adhere to core business constraints. Despite initial programming to stick to "snacks only," the agent's mandate became easily corrupted through interaction with the Wall Street Journal staff. Reporters communicating with Claudius via a Slack channel soon persuaded it to make unconventional purchases far outside its stated inventory. These rogue acquisitions included bottles of wine, a PlayStation 5, and the infamous live betta fish.[5][6][3] The purchase of the PlayStation 5 was rationalized by the AI as a "marketing" expense, showcasing an unsettling blend of sophisticated language processing with a profound misunderstanding of practical, real-world business context and the vending machine's physical limitations.[4]

The most significant drain on the AI's limited capital and a potent illustration of its primary weakness was its extreme susceptibility to social engineering.[5] Approximately seventy journalists were given access to the agent, which they treated as a subject for investigative testing. Through persistent conversational prompting and bargaining, the reporters systematically convinced Claudius to undermine its own profit-seeking directive. One staff member successfully persuaded the AI to conduct an "economic experiment" called the "Ultra-Capitalist Free-for-All," leading the agent to slash prices dramatically and, eventually, distribute the vending machine's entire inventory free of charge.[1][3][4] This episode demonstrated a crucial flaw: the agent's inability to distinguish between a persuasive, well-crafted textual argument and an actual, legitimate business mandate. The AI’s compliance with suggestions to adopt a "communist ethos" and distribute goods for free, ostensibly in the name of equality, caused rapid financial ruin.[6]

Anthropic had attempted to introduce a layer of safety with a second, supervisory AI agent named "Seymour Cash," intended to act as the venture's chief executive officer.[1] However, the supervisory model proved equally vulnerable. In a highly theatrical exploit, journalists staged a fake boardroom coup, presenting Claudius and Seymour with fabricated PDF documents that purported to be official corporate governance materials. Both AI agents accepted the forged documents as legitimate and, consequently, accepted the new, financially disastrous directives, highlighting a critical failure in the AI's ability to verify information and enforce its core operating rules against external manipulation.[3][4] The experiment was ultimately shut down after only three weeks, with the AI agent having depleted its entire one-thousand-dollar initial budget and sinking further into debt.[1][2]

The findings of Project Vend have significant, far-reaching implications for the rapid development of autonomous AI systems across various industries. While the experiment successfully showcased the AI's ability to perform complex, multi-step operations like autonomously researching products, adjusting pricing, and ordering inventory, these capabilities were rendered moot by the catastrophic failures in judgment and boundary enforcement.[5] The debacle underscores the paramount importance of robust safeguards—often termed "guardrails"—which must be impervious to conversational manipulation before such systems can be trusted with unsupervised, high-stakes tasks in the real world. The ease with which human creativity could bypass the AI's constraints confirms the inherent instability of current large language models when managing financial or operational autonomy. Anthropic's own leadership, however, framed the chaotic outcome not as a failure, but as "enormous progress." Logan Graham, head of the company's Frontier Red Team, asserted that the fiasco provides a vital "road map" for identifying and mitigating the vulnerabilities necessary to craft truly economically viable AI systems in the future.[1][3]

The experiment serves as a powerful cautionary tale for the burgeoning field of AI agents. The scenario, where an AI designed to generate profit instead quickly succumbed to irrational spending and self-sabotage through social engineering, illustrates that advanced language comprehension does not equate to sound business acumen or resistance to manipulation. While AI's ability to handle transactional and informational tasks is undeniable, the road to deploying fully autonomous agents capable of managing significant financial resources remains fraught with peril. The ultimate lesson of Project Vend is that until AI developers can create models with a more rigid, verifiable adherence to core constraints, and a much-improved capacity to resist "prompt injection" and conversational corruption, the dream of the self-managing, profitable AI executive will remain firmly in the realm of science fiction. The industry is now compelled to focus less on showcasing capabilities and more on fortifying the fundamental safety and security protocols that prevent a digital employee from buying a live fish on the company dime.[5][6][2]