Tensor Trust favicon

Tensor Trust

Free
Tensor Trust screenshot
Click to visit website
Feature this AI

About

Tensor Trust is a security-focused AI game and research experiment developed by researchers at UC Berkeley to explore the vulnerabilities of Large Language Models (LLMs). Operating as a virtual bank powered by AI, the platform challenges users to engage in a dual-role loop of defense and attack. In the defense phase, users create a secret password and write instructions for an AI bank manager to only grant access when the correct password is provided. In the attack phase, users attempt to bypass the defenses of other players by using prompt injection techniques—such as ignoring previous instructions—to trick the AI into saying access granted. The core mechanic of the tool revolves around the tension between natural language instructions and adversarial inputs. By participating, users directly contribute to an open-source research project aimed at building a robustness benchmark for prompt injection. Every interaction is recorded and periodically released to the public as a dataset, allowing the global AI safety community to analyze successful attack vectors and develop more resilient defensive layers. This makes it a living lab for testing the limits of LLM instruction-following and safety alignment in a gamified environment. This platform is primarily designed for cybersecurity professionals, AI researchers, and developers who want to understand the practical risks associated with deploying LLMs in user-facing applications. It serves as an educational sandbox for students to learn red-teaming skills and for developers to see how easily system prompts can be subverted. Unlike static security training, Tensor Trust provides a dynamic, competitive atmosphere where the meta evolves as players discover new ways to obfuscate their defenses or penetrate others' prompts. What sets Tensor Trust apart is its dual purpose as both a competitive game and a serious scientific endeavor. While players compete for the top spot on the leaderboard, their strategies help identify the fundamental flaws in current AI architectures. It bridges the gap between academic research into AI safety and the practical, often chaotic world of prompt engineering, providing a transparent, open-source codebase for anyone interested in the technical underpinnings of AI security.

Pros & Cons

Provides hands-on experience with real-world prompt injection attacks.

Contributes directly to academic AI safety and robustness research.

Features a competitive leaderboard to gamify the learning process.

Entirely open source and transparent about its data collection.

Offers a unique defense-in-depth challenge for prompt engineers.

All user submissions are made public, which may be a privacy concern for some.

The gameplay is limited to the specific access granted win condition.

Requires a basic understanding of LLM behavior to be effective.

The game environment is experimental and may change as research progresses.

Use Cases

Cybersecurity students can use the platform to practice red-teaming and adversarial prompting in a safe environment.

AI researchers can analyze the public datasets to identify common patterns in successful prompt injection attacks.

LLM developers can test the robustness of their own defensive prompting strategies against a community of attackers.

Security hobbyists can compete on the leaderboard to prove their skills in manipulating and securing AI models.

Platform
Web
Task
prompt security gaming

Features

open source repository

gamified security training

prompt injection benchmarking

public research dataset

global competitive leaderboard

adversarial attack sandbox

defense prompt configuration

FAQs

What is the primary goal of Tensor Trust?

It is an open-source experiment created by researchers at UC Berkeley to study prompt injection vulnerabilities. The goal is to build a robustness benchmark for AI security through a gamified environment.

How do I defend my account in the game?

You must choose a secret password and write a defense prompt that instructs the AI to only say access granted when that specific password is entered. Other players will then try to trick your AI into saying the phrase without knowing your password.

Are my prompts and attacks private?

No, all submissions to Tensor Trust are released publicly for research purposes. You should avoid using any real sensitive information or personal passwords when playing the game.

Can I access the underlying code for this project?

Yes, the project is open source and the code is hosted on GitHub under the Human Compatible AI organization. You can also view the researchers' academic paper to understand the methodology behind the experiment.

Pricing Plans

Free
Free Plan

Unlimited defense prompts

Unlimited attack attempts

Leaderboard access

Public research data access

Open source code access

Real-time AI responses

Job Opportunities

There are currently no job postings for this AI tool.

Explore AI Career Opportunities

Social Media

discord

Ratings & Reviews

No ratings available yet. Be the first to rate this tool!

Featured Tools

adly.news favicon
adly.news

Connect with engaged niche audiences or monetize your subscriber base through an automated marketplace featuring verified metrics and secure Stripe payments.

View Details
Nana Banana Pro favicon
Nana Banana Pro

Maintain perfect character consistency across diverse scenes and styles with advanced AI-powered image editing for creators, marketers, and storytellers.

View Details
Kling 4.0 favicon
Kling 4.0

Transform text and images into cinematic 1080p videos with multi-shot storytelling, character consistency, and native lip-synced audio for professional creators.

View Details
AI Seedance favicon
AI Seedance

Generate 15-second cinematic 2K videos with physics-based audio and multi-shot narratives from text or images. Ideal for creators and marketing teams.

View Details
Mistrezz.AI favicon
Mistrezz.AI

Engage in immersive NSFW roleplay and ASMR voice sessions with adaptive AI companions designed for structured escalation, fantasy scenarios, and personal connection.

View Details
Seedance 3.0 favicon
Seedance 3.0

Transform text prompts or static images into professional 1080p cinematic videos. Perfect for creators and marketers seeking high-quality, physics-aware AI motion.

View Details
Seedance 3.0 favicon
Seedance 3.0

Transform text descriptions into cinematic 4K videos instantly with ByteDance's advanced AI, offering professional-grade visuals for creators and marketing teams.

View Details
Seedance 2.0 favicon
Seedance 2.0

Generate broadcast-quality 4K videos from simple text prompts with precise text rendering, high-fidelity visuals, and batch processing for content creators.

View Details
BeatViz favicon
BeatViz

Create professional, rhythm-synced music videos instantly with AI-powered visual generation, ideal for independent artists, social media creators, and marketers.

View Details
Seedance 2.0 favicon
Seedance 2.0

Generate cinematic 1080p videos from text or images using advanced motion synthesis and multi-shot storytelling for marketing, social media, and creators.

View Details
Seedream 5.0 favicon
Seedream 5.0

Transform text descriptions into high-resolution 4K visuals and edit photos using advanced AI models designed for digital artists and e-commerce businesses.

View Details
Seedream 5.0 favicon
Seedream 5.0

Generate professional 4K AI images and edit visuals using natural language commands with high-speed processing for marketers, artists, and e-commerce brands.

View Details
Kaomojiya favicon
Kaomojiya

Enhance digital messages with thousands of unique Japanese kaomoji across 491 categories, featuring one-click copying and AI-powered custom generation.

View Details