AI Tech Suite

Segment Anything Model (SAM)

Click to visit website

About

Segment Anything Model (SAM) is a new AI model from Meta AI that can "cut out" any object, in any image, with a single click. SAM is a promptable segmentation system with zero-shot generalization to unfamiliar objects and images, without the need for additional training. It uses various input prompts, enabling flexible integration with other systems and extensible outputs. SAM has learned a general notion of what objects are, enabling zero-shot generalization to unfamiliar objects and images without requiring additional training. The model was trained on the SA-1B dataset consisting of 11M images and 1B+ masks.

Features

• zero-shot generalization

• flexible integration with other systems

• extensible outputs

• multiple valid masks for ambiguous prompts

• automatic segmentation

• interactive point and box prompts

• promptable segmentation system

FAQs

What type of prompts are supported?

Foreground/background points, Bounding box, Mask

What is the structure of the model?

A ViT-H image encoder that runs once per image and outputs an image embedding. A prompt encoder that embeds input prompts such as clicks or boxes. A lightweight transformer based mask decoder that predicts object masks from the image embedding and prompt embeddings

What platforms does the model use?

The image encoder is implemented in PyTorch and requires a GPU for efficient inference. The prompt encoder and mask decoder can run directly with PyTroch or converted to ONNX and run efficiently on CPU or GPU across a variety of platforms that support ONNX runtime.

How big is the model?

The image encoder has 632M parameters. The prompt encoder and mask decoder have 4M parameters.

How long does inference take?

The image encoder takes ~0.15 seconds on an NVIDIA A100 GPU. The prompt encoder and mask decoder take ~50ms on CPU in the browser using multithreaded SIMD execution.

What data was the model trained on?

The model was trained on our SA-1B dataset. See our dataset viewer.

How long does it take to train the model?

The model was trained for 3-5 days on 256 A100 GPUs.

Does the model produce mask labels?

No, the model predicts object masks only and does not generate labels.

Does the model work on videos?

Currently the model only supports images or individual frames from videos.

Where can I find the code?

Code is available on [GitHub](https://github.com/facebookresearch/segment-anything)

Pricing Plans

Free

Free Plan

Job Opportunities

There are currently no job postings for this AI tool.

Explore AI Career Opportunities

Social Media

Ratings & Reviews

No ratings available yet. Be the first to rate this tool!

Featured Tools

Songmeaning

Songmeaning uses AI to reveal the stories and meanings behind song lyrics. It offers lyric translation and AI music generation.

View Details

Whisper Notes

Offline AI speech-to-text transcription app using Whisper AI. Supports 80+ languages, audio file import, and offers lifetime access with a one-time purchase. Available for iOS and macOS.

View Details

GitGab

Connects Github repos and local files to AI models (ChatGPT, Claude, Gemini) for coding tasks like implementing features, finding bugs, writing docs, and optimization.

View Details

nuptials.ai

nuptials.ai is an AI wedding planning partner, offering timeline planning, budget optimization, vendor matching, and a 24/7 planning assistant to help plan your perfect day.

View Details

Make-A-Craft

Make-A-Craft helps you discover craft ideas tailored to your child's age and interests, using materials you already have at home.

View Details

Pixelfox AI

Free online AI photo editor with comprehensive tools for image, face/body, and text. Features include background/object removal, upscaling, face swap, and AI image generation. No sign-up needed, unlimited use for free, fast results.

View Details

Smart Cookie Trivia

Smart Cookie Trivia is a platform offering a wide variety of trivia questions across numerous categories to help users play trivia, explore different topics, and expand their knowledge.

View Details

Code2Docs

AI-powered code documentation generator. Integrates with GitHub. Automates creation of usage guides, API docs, and testing instructions.

View Details

Segment Anything Model (SAM)

Click to visit website

About

Platform

Keywords

Task

Features

FAQs

What type of prompts are supported?

What is the structure of the model?

What platforms does the model use?

How big is the model?

How long does inference take?

What data was the model trained on?

How long does it take to train the model?

Does the model produce mask labels?

Does the model work on videos?

Where can I find the code?

Pricing Plans

Free

Job Opportunities

Social Media

Ratings & Reviews

Featured Tools

Songmeaning

Whisper Notes

GitGab

nuptials.ai

Make-A-Craft

Pixelfox AI

Smart Cookie Trivia

Code2Docs