Segment Anything Model (SAM) favicon

Segment Anything Model (SAM)

Free
Segment Anything Model (SAM) screenshot
Click to visit website
Feature this AI

About

Segment Anything Model (SAM) is a new AI model from Meta AI that can "cut out" any object, in any image, with a single click. SAM is a promptable segmentation system with zero-shot generalization to unfamiliar objects and images, without the need for additional training. It uses various input prompts, enabling flexible integration with other systems and extensible outputs. SAM has learned a general notion of what objects are, enabling zero-shot generalization to unfamiliar objects and images without requiring additional training. The model was trained on the SA-1B dataset consisting of 11M images and 1B+ masks.

Platform
Web
Keywords
computer visionai modelsimage segmentationsegmentation masks
Task
image segmenting

Features

zero-shot generalization

flexible integration with other systems

extensible outputs

multiple valid masks for ambiguous prompts

automatic segmentation

interactive point and box prompts

promptable segmentation system

FAQs

What type of prompts are supported?

Foreground/background points, Bounding box, Mask

What is the structure of the model?

A ViT-H image encoder that runs once per image and outputs an image embedding. A prompt encoder that embeds input prompts such as clicks or boxes. A lightweight transformer based mask decoder that predicts object masks from the image embedding and prompt embeddings

What platforms does the model use?

The image encoder is implemented in PyTorch and requires a GPU for efficient inference. The prompt encoder and mask decoder can run directly with PyTroch or converted to ONNX and run efficiently on CPU or GPU across a variety of platforms that support ONNX runtime.

How big is the model?

The image encoder has 632M parameters. The prompt encoder and mask decoder have 4M parameters.

How long does inference take?

The image encoder takes ~0.15 seconds on an NVIDIA A100 GPU. The prompt encoder and mask decoder take ~50ms on CPU in the browser using multithreaded SIMD execution.

What data was the model trained on?

The model was trained on our SA-1B dataset. See our dataset viewer.

How long does it take to train the model?

The model was trained for 3-5 days on 256 A100 GPUs.

Does the model produce mask labels?

No, the model predicts object masks only and does not generate labels.

Does the model work on videos?

Currently the model only supports images or individual frames from videos.

Where can I find the code?

Code is available on [GitHub](https://github.com/facebookresearch/segment-anything)

Pricing Plans

Free
Free Plan

Job Opportunities

There are currently no job postings for this AI tool.

Explore AI Career Opportunities

Social Media

Ratings & Reviews

No ratings available yet. Be the first to rate this tool!

Featured Tools

Songmeaning favicon
Songmeaning

Songmeaning uses AI to reveal the stories and meanings behind song lyrics. It offers lyric translation and AI music generation.

View Details
Whisper Notes favicon
Whisper Notes

Offline AI speech-to-text transcription app using Whisper AI. Supports 80+ languages, audio file import, and offers lifetime access with a one-time purchase. Available for iOS and macOS.

View Details
GitGab favicon
GitGab

Connects Github repos and local files to AI models (ChatGPT, Claude, Gemini) for coding tasks like implementing features, finding bugs, writing docs, and optimization.

View Details
nuptials.ai favicon
nuptials.ai

nuptials.ai is an AI wedding planning partner, offering timeline planning, budget optimization, vendor matching, and a 24/7 planning assistant to help plan your perfect day.

View Details
Make-A-Craft favicon
Make-A-Craft

Make-A-Craft helps you discover craft ideas tailored to your child's age and interests, using materials you already have at home.

View Details
Pixelfox AI favicon
Pixelfox AI

Free online AI photo editor with comprehensive tools for image, face/body, and text. Features include background/object removal, upscaling, face swap, and AI image generation. No sign-up needed, unlimited use for free, fast results.

View Details
Smart Cookie Trivia favicon
Smart Cookie Trivia

Smart Cookie Trivia is a platform offering a wide variety of trivia questions across numerous categories to help users play trivia, explore different topics, and expand their knowledge.

View Details
Code2Docs favicon
Code2Docs

AI-powered code documentation generator. Integrates with GitHub. Automates creation of usage guides, API docs, and testing instructions.

View Details