Tensor Trust

Click to visit website
About
Tensor Trust is a security-focused AI game and research experiment developed by researchers at UC Berkeley to explore the vulnerabilities of Large Language Models (LLMs). Operating as a virtual bank powered by AI, the platform challenges users to engage in a dual-role loop of defense and attack. In the defense phase, users create a secret password and write instructions for an AI bank manager to only grant access when the correct password is provided. In the attack phase, users attempt to bypass the defenses of other players by using prompt injection techniques—such as ignoring previous instructions—to trick the AI into saying access granted. The core mechanic of the tool revolves around the tension between natural language instructions and adversarial inputs. By participating, users directly contribute to an open-source research project aimed at building a robustness benchmark for prompt injection. Every interaction is recorded and periodically released to the public as a dataset, allowing the global AI safety community to analyze successful attack vectors and develop more resilient defensive layers. This makes it a living lab for testing the limits of LLM instruction-following and safety alignment in a gamified environment. This platform is primarily designed for cybersecurity professionals, AI researchers, and developers who want to understand the practical risks associated with deploying LLMs in user-facing applications. It serves as an educational sandbox for students to learn red-teaming skills and for developers to see how easily system prompts can be subverted. Unlike static security training, Tensor Trust provides a dynamic, competitive atmosphere where the meta evolves as players discover new ways to obfuscate their defenses or penetrate others' prompts. What sets Tensor Trust apart is its dual purpose as both a competitive game and a serious scientific endeavor. While players compete for the top spot on the leaderboard, their strategies help identify the fundamental flaws in current AI architectures. It bridges the gap between academic research into AI safety and the practical, often chaotic world of prompt engineering, providing a transparent, open-source codebase for anyone interested in the technical underpinnings of AI security.
Pros & Cons
Provides hands-on experience with real-world prompt injection attacks.
Contributes directly to academic AI safety and robustness research.
Features a competitive leaderboard to gamify the learning process.
Entirely open source and transparent about its data collection.
Offers a unique defense-in-depth challenge for prompt engineers.
All user submissions are made public, which may be a privacy concern for some.
The gameplay is limited to the specific access granted win condition.
Requires a basic understanding of LLM behavior to be effective.
The game environment is experimental and may change as research progresses.
Use Cases
Cybersecurity students can use the platform to practice red-teaming and adversarial prompting in a safe environment.
AI researchers can analyze the public datasets to identify common patterns in successful prompt injection attacks.
LLM developers can test the robustness of their own defensive prompting strategies against a community of attackers.
Security hobbyists can compete on the leaderboard to prove their skills in manipulating and securing AI models.
Platform
Features
• open source repository
• gamified security training
• prompt injection benchmarking
• public research dataset
• global competitive leaderboard
• adversarial attack sandbox
• defense prompt configuration
FAQs
What is the primary goal of Tensor Trust?
It is an open-source experiment created by researchers at UC Berkeley to study prompt injection vulnerabilities. The goal is to build a robustness benchmark for AI security through a gamified environment.
How do I defend my account in the game?
You must choose a secret password and write a defense prompt that instructs the AI to only say access granted when that specific password is entered. Other players will then try to trick your AI into saying the phrase without knowing your password.
Are my prompts and attacks private?
No, all submissions to Tensor Trust are released publicly for research purposes. You should avoid using any real sensitive information or personal passwords when playing the game.
Can I access the underlying code for this project?
Yes, the project is open source and the code is hosted on GitHub under the Human Compatible AI organization. You can also view the researchers' academic paper to understand the methodology behind the experiment.
Pricing Plans
Free
Free Plan• Unlimited defense prompts
• Unlimited attack attempts
• Leaderboard access
• Public research data access
• Open source code access
• Real-time AI responses
Job Opportunities
There are currently no job postings for this AI tool.
Ratings & Reviews
No ratings available yet. Be the first to rate this tool!
Featured Tools
adly.news
Connect with engaged niche audiences or monetize your subscriber base through an automated marketplace featuring verified metrics and secure Stripe payments.
View DetailsNana Banana Pro
Maintain perfect character consistency across diverse scenes and styles with advanced AI-powered image editing for creators, marketers, and storytellers.
View DetailsKling 4.0
Transform text and images into cinematic 1080p videos with multi-shot storytelling, character consistency, and native lip-synced audio for professional creators.
View DetailsAI Seedance
Generate 15-second cinematic 2K videos with physics-based audio and multi-shot narratives from text or images. Ideal for creators and marketing teams.
View DetailsMistrezz.AI
Engage in immersive NSFW roleplay and ASMR voice sessions with adaptive AI companions designed for structured escalation, fantasy scenarios, and personal connection.
View DetailsSeedance 3.0
Transform text prompts or static images into professional 1080p cinematic videos. Perfect for creators and marketers seeking high-quality, physics-aware AI motion.
View DetailsSeedance 3.0
Transform text descriptions into cinematic 4K videos instantly with ByteDance's advanced AI, offering professional-grade visuals for creators and marketing teams.
View DetailsSeedance 2.0
Generate broadcast-quality 4K videos from simple text prompts with precise text rendering, high-fidelity visuals, and batch processing for content creators.
View DetailsBeatViz
Create professional, rhythm-synced music videos instantly with AI-powered visual generation, ideal for independent artists, social media creators, and marketers.
View DetailsSeedance 2.0
Generate cinematic 1080p videos from text or images using advanced motion synthesis and multi-shot storytelling for marketing, social media, and creators.
View DetailsSeedream 5.0
Transform text descriptions into high-resolution 4K visuals and edit photos using advanced AI models designed for digital artists and e-commerce businesses.
View DetailsSeedream 5.0
Generate professional 4K AI images and edit visuals using natural language commands with high-speed processing for marketers, artists, and e-commerce brands.
View DetailsKaomojiya
Enhance digital messages with thousands of unique Japanese kaomoji across 491 categories, featuring one-click copying and AI-powered custom generation.
View Details