Excavating AI / ImageNet Roulette favicon

Excavating AI / ImageNet Roulette

Free
Excavating AI / ImageNet Roulette screenshot
Click to visit website
Feature this AI

About

Excavating AI is an investigative project and digital tool designed to uncover the social and political implications of image-based machine learning training sets. Developed by Kate Crawford and Trevor Paglen, the platform centers on the "archeology of datasets," specifically scrutinizing ImageNet, one of the most influential benchmarks in artificial intelligence. The project includes an experimental software tool called ImageNet Roulette, which allows users to see how they would be classified by a deep-learning model trained exclusively on ImageNet's "Person" categories. By making these hidden classifications visible, the tool illustrates the shaky and often offensive assumptions that underpin how AI systems recognize and interpret human identity. Technically, the ImageNet Roulette tool utilizes an open-source Caffe deep-learning framework. When a user uploads a photograph, the application runs a face detector to locate subjects and then assigns them a label based on the thousands of categories found in the original ImageNet "Person" hierarchy. These categories include labels ranging from professions like "welders" to derogatory terms and nonsensical classifications such as "snob," "failure," or "closet queen." This process highlights the problematic nature of "supervised learning," where human laborers—often Amazon Mechanical Turk workers—are tasked with flattening complex human experiences into static, often biased, noun-based categories. This classification process is revealed to be a subjective and often ideological act rather than a purely scientific one. The project is primarily intended for researchers, AI ethicists, artists, and students of media theory who seek to understand the non-neutral nature of data. It serves as a powerful case study for those investigating the resurgence of "physiognomic" AI—the pseudoscientific attempt to derive character or morality from facial features. Unlike standard AI diagnostic tools that aim to "fix" bias through more data, Excavating AI argues that the very act of classification is a political intervention. It provides a rare forensic look at the historical and sociological layers of datasets that have already been integrated into critical social infrastructures like hiring, education, and law enforcement protocols. What makes Excavating AI unique is its critical, investigative approach to the "black box" of training data. Rather than offering a commercial service, it acts as a provocateur, revealing the "treachery of images" where labels and referents are arbitrarily linked. By highlighting datasets like ImageNet, JAFFE, and IBM’s Diversity in Faces, the project demonstrates how technical systems can replicate 19th-century phrenology and race science. It stands out as an essential resource for anyone questioning the power dynamics of who gets to decide what an image means and how those representations affect real-world rights, liberties, and forms of self-determination.

Pros & Cons

Provides a rare forensic look into the hidden labels used to train major AI systems.

Uses open-source frameworks for transparency in its experimental results.

Highlights critical ethical issues like phrenology and race science in modern AI.

Does not store user-uploaded images, respecting immediate privacy during the experiment.

Documented by world-class researchers from the AI Now Institute at NYU.

The 'Person' categories in the original dataset are currently down for maintenance or removed from public servers.

Produces offensive and derogatory labels by design to illustrate dataset bias, which may be distressing.

Limited to noun-based classifications derived from the historical WordNet hierarchy.

Functionality may be limited as several major source datasets have been taken offline by their respective institutions.

Use Cases

Ethical AI researchers can use the tool to demonstrate the real-world impact of biased training data to policymakers.

Media theory students can perform an 'archeology of datasets' to trace how historical prejudices are replicated in machine vision.

Artists and activists can visualize the 'physiognomic' assumptions of AI to create critical commentary on surveillance.

Computer science educators can use the project as a case study to teach students about the non-neutrality of supervised learning.

Platform
Web
Task
person classifying

Features

ethical ai research insights

open-source software framework

classification visualization

imagenet 'person' category analysis

historical dataset archeology

deep-learning caffe framework

automated image labeling

face detection and bounding boxes

FAQs

What is the primary purpose of ImageNet Roulette?

It is an experimental tool created to show how technical systems are trained using problematic data. It classifies users based on the 'Person' category in ImageNet to reveal the biases inherent in such datasets.

Does the tool store the photos I upload?

No, the creators state that ImageNet Roulette does not store the photos people upload to the application. The tool is designed for research and demonstration purposes rather than data collection.

Why are some of the labels returned by the tool offensive or nonsensical?

The labels are drawn directly from the ImageNet 'Person' category, which contains many problematic, misogynistic, and racist terms assigned by crowdsourced workers. The tool's purpose is to make these hidden classifications visible.

What happened to the 'Person' category images on the ImageNet website?

In early 2019, approximately 1.2 million photos in the 'Person' category were taken down or moved 'under maintenance' following research into their ethical and privacy violations. The tool uses a backup of these categories.

How does the tool determine a label if no face is detected?

If the face detector fails to locate a face, the application sends the entire uploaded scene to the Caffe model. It then returns a label in the upper left corner of the processed image based on the model's classification of the background.

Pricing Plans

Open Access
Free Plan

Image classification visualization

Face detection output

ImageNet 'Person' labels

Archeological analysis findings

Open-source Caffe framework usage

Job Opportunities

There are currently no job postings for this AI tool.

Explore AI Career Opportunities

Ratings & Reviews

No ratings available yet. Be the first to rate this tool!

Featured Tools

adly.news favicon
adly.news

Connect with engaged niche audiences or monetize your subscriber base through an automated marketplace featuring verified metrics and secure Stripe payments.

View Details
Nana Banana Pro favicon
Nana Banana Pro

Maintain perfect character consistency across diverse scenes and styles with advanced AI-powered image editing for creators, marketers, and storytellers.

View Details
Kling 4.0 favicon
Kling 4.0

Transform text and images into cinematic 1080p videos with multi-shot storytelling, character consistency, and native lip-synced audio for professional creators.

View Details
AI Seedance favicon
AI Seedance

Generate 15-second cinematic 2K videos with physics-based audio and multi-shot narratives from text or images. Ideal for creators and marketing teams.

View Details
Mistrezz.AI favicon
Mistrezz.AI

Engage in immersive NSFW roleplay and ASMR voice sessions with adaptive AI companions designed for structured escalation, fantasy scenarios, and personal connection.

View Details
Seedance 3.0 favicon
Seedance 3.0

Transform text prompts or static images into professional 1080p cinematic videos. Perfect for creators and marketers seeking high-quality, physics-aware AI motion.

View Details
Seedance 3.0 favicon
Seedance 3.0

Transform text descriptions into cinematic 4K videos instantly with ByteDance's advanced AI, offering professional-grade visuals for creators and marketing teams.

View Details
Seedance 2.0 favicon
Seedance 2.0

Generate broadcast-quality 4K videos from simple text prompts with precise text rendering, high-fidelity visuals, and batch processing for content creators.

View Details
BeatViz favicon
BeatViz

Create professional, rhythm-synced music videos instantly with AI-powered visual generation, ideal for independent artists, social media creators, and marketers.

View Details
Seedance 2.0 favicon
Seedance 2.0

Generate cinematic 1080p videos from text or images using advanced motion synthesis and multi-shot storytelling for marketing, social media, and creators.

View Details