Segment Anything Model (SAM)

Click to visit website
About
Segment Anything Model (SAM) is a new AI model from Meta AI that can "cut out" any object, in any image, with a single click. SAM is a promptable segmentation system with zero-shot generalization to unfamiliar objects and images, without the need for additional training. It uses various input prompts, enabling flexible integration with other systems and extensible outputs. SAM has learned a general notion of what objects are, enabling zero-shot generalization to unfamiliar objects and images without requiring additional training. The model was trained on the SA-1B dataset consisting of 11M images and 1B+ masks.
Platform
Task
Features
• zero-shot generalization
• flexible integration with other systems
• extensible outputs
• multiple valid masks for ambiguous prompts
• automatic segmentation
• interactive point and box prompts
• promptable segmentation system
FAQs
What type of prompts are supported?
Foreground/background points, Bounding box, Mask
What is the structure of the model?
A ViT-H image encoder that runs once per image and outputs an image embedding. A prompt encoder that embeds input prompts such as clicks or boxes. A lightweight transformer based mask decoder that predicts object masks from the image embedding and prompt embeddings
What platforms does the model use?
The image encoder is implemented in PyTorch and requires a GPU for efficient inference. The prompt encoder and mask decoder can run directly with PyTroch or converted to ONNX and run efficiently on CPU or GPU across a variety of platforms that support ONNX runtime.
How big is the model?
The image encoder has 632M parameters. The prompt encoder and mask decoder have 4M parameters.
How long does inference take?
The image encoder takes ~0.15 seconds on an NVIDIA A100 GPU. The prompt encoder and mask decoder take ~50ms on CPU in the browser using multithreaded SIMD execution.
What data was the model trained on?
The model was trained on our SA-1B dataset. See our dataset viewer.
How long does it take to train the model?
The model was trained for 3-5 days on 256 A100 GPUs.
Does the model produce mask labels?
No, the model predicts object masks only and does not generate labels.
Does the model work on videos?
Currently the model only supports images or individual frames from videos.
Where can I find the code?
Code is available on [GitHub](https://github.com/facebookresearch/segment-anything)
Pricing Plans
Free
Free PlanJob Opportunities
There are currently no job postings for this AI tool.
Ratings & Reviews
No ratings available yet. Be the first to rate this tool!
Featured Tools
Songmeaning
Songmeaning uses AI to reveal the stories and meanings behind song lyrics. It offers lyric translation and AI music generation.
View DetailsWhisper Notes
Offline AI speech-to-text transcription app using Whisper AI. Supports 80+ languages, audio file import, and offers lifetime access with a one-time purchase. Available for iOS and macOS.
View DetailsGitGab
Connects Github repos and local files to AI models (ChatGPT, Claude, Gemini) for coding tasks like implementing features, finding bugs, writing docs, and optimization.
View Details
nuptials.ai
nuptials.ai is an AI wedding planning partner, offering timeline planning, budget optimization, vendor matching, and a 24/7 planning assistant to help plan your perfect day.
View DetailsMake-A-Craft
Make-A-Craft helps you discover craft ideas tailored to your child's age and interests, using materials you already have at home.
View Details
Pixelfox AI
Free online AI photo editor with comprehensive tools for image, face/body, and text. Features include background/object removal, upscaling, face swap, and AI image generation. No sign-up needed, unlimited use for free, fast results.
View Details
Smart Cookie Trivia
Smart Cookie Trivia is a platform offering a wide variety of trivia questions across numerous categories to help users play trivia, explore different topics, and expand their knowledge.
View Details
Code2Docs
AI-powered code documentation generator. Integrates with GitHub. Automates creation of usage guides, API docs, and testing instructions.
View Details