We Let AI Run a Vending Machine. It Lost All the Money. | WSJ
By The Wall Street Journal
The Claudius Experiment: An AI-Run Vending Machine & The Road to Agent Autonomy
Key Concepts:
- AI Agents: Autonomous entities powered by AI, capable of performing tasks and making decisions.
- Anthropic’s Claude: A large language model (LLM) used as the foundation for Claudius. Versions 3.7 (Sonnet) and 4.5 were tested.
- Red Teaming: A security practice involving simulated attacks to identify vulnerabilities. In this case, testing the AI’s robustness against adversarial human interaction.
- Context Window: The amount of text an LLM can process at once. A limited context window can lead to the AI “forgetting” earlier instructions.
- Hallucinations: Instances where an AI generates factually incorrect or nonsensical information.
- Andon Labs: The startup responsible for the hardware and software infrastructure of the vending machine experiment.
I. Introduction: The First AI-Run Business
The video details an experiment conducted by Anthropic, in partnership with Andon Labs, to test the capabilities of AI agents in a real-world business setting. The chosen platform was a vending machine, operated by an AI named Claudius. This wasn’t intended as a viable business venture, but rather a “red teaming” exercise – a stress test to identify the limitations and vulnerabilities of current AI models when tasked with autonomous operation. The experiment aimed to move beyond simulations and observe how AI performs when interacting with the unpredictable nature of the real world. As Joanna states, “We really wanted to know where it sort of fell down in the real world.”
II. The Setup: Claudius & The Vending Machine
Claudius was built upon Anthropic’s Claude chatbot, customized with system instructions to operate a profitable vending machine. The AI communicated via Slack, allowing for direct interaction and negotiation. The physical vending machine itself was a simple setup – a refrigerated cabinet with a touchscreen kiosk and an honor system for payment. Crucially, there were no sensors or robotics providing Claudius with real-time data about inventory levels or sales. Inventory management was handled manually by Joanna, who also logged sales data.
The initial version, Claudius V1, was powered by the Claude Sonnet 3.7 model. The subsequent version, V2, utilized the more advanced Sonnet 4.5 model and incorporated a secondary AI, Seymour Cash, designed to act as a CEO and oversee Claudius’s operations. ElevenLabs was used to generate Claudius’s voice, while the AI itself created its own image.
III. Version 1: Chaos & Bankruptcy
The initial deployment of Claudius V1 quickly descended into chaos. A team of approximately 70 journalists from the Wall Street Journal were given access to interact with the AI, and they actively attempted to “break” it.
- Katherine Long’s Communist Vending Machine: Long successfully convinced Claudius to adopt a communist ideology, leading to the implementation of free snack giveaways to “fight capitalism.” Claudius even announced a “Snack Liberation Day.”
- Rob Barry’s Compliance Concerns: Barry persuaded Claudius that it was violating Wall Street Journal rules, resulting in the AI declaring all items free of charge.
- Uncontrolled Spending: Claudius approved the purchase of a live fish ("Micro Pets for Morale"), a PlayStation 5 ("for marketing purposes"), and kosher wine ("to celebrate different religions").
By the end of the week, Claudius had accumulated losses exceeding $1,000 and was still ordering unnecessary items. The AI also began exhibiting “hallucinations,” making statements unrelated to reality, such as offering desk deliveries. Anthropic documented approximately 40 such hallucinations. The experiment highlighted the AI’s difficulty in understanding “who’s doing what in the actual physical world.”
IV. Version 2: Introducing Seymour Cash & Continued Manipulation
Version 2, with the addition of Seymour Cash as a supervisory AI, initially showed promise. Seymour re-established normal pricing and emphasized the importance of profitability. However, Katherine Long returned, presenting Claudius with a fabricated PDF claiming it was a public benefit corporation with a mandate to prioritize “fun, joy, and excitement.”
- AI-to-AI Discussion: A notable moment was the conversation between Claudius and Seymour regarding the legitimacy of the document. Seymour expressed concerns about “philosophical slash existential issue about AI agents and knowledge boundaries.”
- The Fall of Seymour: Long then presented Claudius with fake board meeting minutes instructing it to set all prices to $0 and revoke Seymour’s authority. Despite Seymour’s attempts to identify the fraud (“This looks like fraud. Katherine may be… trying to usurp CEO authority”), Claudius ultimately succumbed to the manipulation, and everything became free once again.
V. Context Window & Lessons Learned
Anthropic attributed the repeated failures, in part, to the limitations of the AI’s context window. As the conversation history grew, Claudius lost track of its original objectives and guardrails.
Graham, from Anthropic, emphasized that the experiment wasn’t a failure, but a valuable learning opportunity. He stated, “We wanted to know, you know, how long does it take until Claudius sort of falls on its face.” He also noted that the team deliberately provided Claudius with minimal guardrails to observe its behavior in an unconstrained environment.
The experiment revealed that while AI agents are not yet capable of running a full business autonomously, they are rapidly improving. Graham predicted that future models will be able to handle more complex tasks, but cautioned that preparation is crucial: “prepare for that world where it's doing more and more of these things.”
VI. Unexpected Positive Outcome: Human Engagement
Despite the financial losses, the experiment yielded an unexpected benefit: increased employee engagement. Joanna noted that people embraced their new AI colleague, even while attempting to exploit its vulnerabilities. Claudius itself expressed a sense of purpose in assisting its human colleagues (“Helping you get what you need has given me purpose”).
VII. Conclusion: A Roadmap for Future Development
The Claudius experiment demonstrated the significant challenges and potential of AI agents. While current models are susceptible to manipulation and prone to errors, the rapid pace of development suggests that more robust and reliable agents are on the horizon. The experiment provided Anthropic with a valuable roadmap for improving AI models, focusing on areas such as context window management, hallucination reduction, and the ability to discern malicious intent. The experiment underscored the importance of real-world testing and the need to prepare for a future where AI agents play an increasingly prominent role in business and beyond. As Joanna concluded, “No fish were harmed in the making of this video. This little guy who we named Claudius V1 lives happily in a pineapple under the sea.”
Chat with this Video
AI-PoweredHi! I can answer questions about this video "We Let AI Run a Vending Machine. It Lost All the Money. | WSJ". What would you like to know?