
Picovoice

Click to visit website
About
Picovoice offers on-device voice AI and local LLMs for building speech analytics tools and applications. Their platform is designed for developers, prioritizing compliance, reliability, and scalability. They provide a modular platform with various components like speech-to-text, speaker recognition, wake word detection, and more. Picovoice also offers a local LLM platform (picoLLM) for inference and compression, along with various SDKs and support options ranging from a forever-free plan to enterprise-level plans with dedicated support. Their focus is on enabling developers to build AI-powered products without sacrificing data privacy or control, processing data locally to ensure compliance.
Platform
Task
Features
• speaker diarization
• text-to-speech
• speech-to-text
• speaker recognition
• streaming speech-to-text
• noise suppression
• speech-to-index
• wake word detection (porcupine model), voice activity detection (cobra model), speech-to-intent
FAQs
Can I use the Forever-Free plan for developing commercial voice products?
Yes. Picovoice technology is available under each plan at different usage limits. As long as you’re within the Forever-Free Plan limits, you can use Picovoice to train voice models and deploy them, even commercially, for free.
Can I create custom wake words and voice commands or fine-tune speech-to-text models even with the Forever-Free plan?
Yes. You can use pre-trained AI models or customize and fine-tune them on the Picovoice Console. Picovoice Console is a no-code platform that enables developers to train and fine-tune voice AI models instantly.
What type of support does Picovoice offer?
Picovoice offers several types of support options: Consulting: Ideal for Enterprise Plan users with project or application-specific needs and requirements. Dedicated Support: Ideal for Developer and Enterprise Plan users with integration and implementation-related questions. Enterprise Support Add-on: Ideal for Forever-Free Plan users who need direct access to the Picovoice team to get dedicated support. Jumpstart: Ideal for Forever-Free Plan users who need a head start with the Picovoice platform, capabilities, licensing, and more; or, are interested in expert-guided explorations, rather than self-guided. GitHub Issues: Ideal for Forever-Free Plan users who want to report bugs and issues or interact with the community.
How does Picovoice use end-user data?
Picovoice is private by design. Picovoice technology processes voice data locally on the platform of your choice without sending them to a 3rd party cloud, meaning no tracking, collecting, or storing end-user data.
Why do I need internet connectivity if you process voice data offline?
Picovoice uses AccessKey, hence internet connectivity, to be able to offer its services according to your plan limits. Picovoice engines call home servers to validate the AccessKey and check your plan limits.
What are the Picovoice Forever-Free Plan limits?
Picovoice Forever-Free Plan offers unlimited usage for the engine(s) below: picoLLM Inference. Picovoice Forever-Free Plan offers up to a total of 5 hours/month for the engines below: Leopard Speech-to-Text, Cheetah Speech-to-Text, Koala Noise Suppression, Eagle Speaker Recognition, Falcon Speaker Diarization, Octopus Speech-to-Index. Picovoice Forever-Free Plan offers up to 3 users/month (with no hourly limit) for the engines below: Porcupine Wake Word, Rhino Speech-to-Intent, Cobra Voice Activity Detection. Picovoice Forever-Free Plan offers up to 10M characters/month for the engines below: Orca Streaming Text-to-Speech. Please note that Picovoice tracks “things” that activate its engines, not individuals or their credentials.
How can I check my plan limit and usage?
Go to the Picovoice Console dashboard or profile page to check your limits and usage.
How can I increase my plan limits?
The Developer Plan is the next logical step when your needs surpass the Forever-Free Plan limits. The Developer Plan enables you to: increase your limits to build and test within your team, develop your product with direct support from the Picovoice team, purchase it online directly, without going through a lengthy enterprise procurement process
What are the Picovoice Developer Plan limits?
Picovoice Developer Plan offers up to a total of 1000 hours/month for the engines below: Leopard Speech-to-Text, Cheetah Speech-to-Text, Koala Noise Suppression, Eagle Speaker Recognition, Falcon Speaker Diarization, Octopus Speech-to-Index. Picovoice Developer Plan offers up to 100 users/month (with no hourly limit) for the engines below: Porcupine Wake Word, Rhino Speech-to-Intent, Cobra Voice Activity Detection. Picovoice Forever-Free Plan offers up to 50M characters/month for the engines below: Orca Streaming Text-to-Speech. Please note that Picovoice tracks “things” that activate its engines, not individuals or their credentials.
Do you offer volume discounts?
Yes! The higher the volume, the more discounts we offer. If you have a live product and changing a vendor, contact our Enterprise Sales team. If you’re at the early stages of the software development cycle or not ready to commit, start with the Developer Plan.
Pricing Plans
Forever-Free
Free Plan• picoLLM Inference
• Leopard Speech-to-Text (5 hours/month)
• Cheetah Speech-to-Text (5 hours/month)
• Koala Noise Suppression (5 hours/month)
• Eagle Speaker Recognition (5 hours/month)
• Falcon Speaker Diarization (5 hours/month)
• Octopus Speech-to-Index (5 hours/month)
• Porcupine Wake Word (3 users/month)
• Rhino Speech-to-Intent (3 users/month)
• Cobra Voice Activity Detection (3 users/month), Orca Streaming Text-to-Speech (10M characters/month)
Job Opportunities
Applied Speech Scientist
Picovoice provides on-device voice AI and local LLMs for developers, focusing on privacy, compliance, and scalability. Offers various plans from free to enterprise.
Experience Requirements:
Deep expertise in speech-to-text, text-to-speech, and speaker recognition
Solid understanding of theory, applications, and limitations of deep learning
Experience with at least one mainstream deep learning framework
Practical knowledge of data structures and algorithms
Experience building complex and extensible software
Other Requirements:
Hands-on experience with Python
Hands-on experience with C
Hands-on experience with PyTorch
Hands-on experience with CUDA
Knowledge of graph theory
Knowledge of probability theory
Show more details
Deep Learning Researcher
Picovoice provides on-device voice AI and local LLMs for developers, focusing on privacy, compliance, and scalability. Offers various plans from free to enterprise.
Experience Requirements:
Solid understanding of theory, applications, and limitations of deep learning
Experience with at least one mainstream deep learning framework
Practical knowledge of data structures and algorithms
Experience building complex and extensible software
Other Requirements:
Hands-on experience with Python
Hands-on experience with C
Hands-on experience with PyTorch
Hands-on experience with CUDA
Knowledge of graph theory
Knowledge of probability theory
Show more details
Deep Learning Researcher Intern
Picovoice provides on-device voice AI and local LLMs for developers, focusing on privacy, compliance, and scalability. Offers various plans from free to enterprise.
Experience Requirements:
Solid understanding of theory, applications, and limitations of deep learning
Experience with at least one mainstream deep learning framework
Practical knowledge of data structures and algorithms
Experience building complex and extensible software
Other Requirements:
Hands-on experience with Python
Hands-on experience with C
Hands-on experience with PyTorch
Hands-on experience with CUDA
Knowledge of graph theory
Knowledge of probability theory
Show more details
Ratings & Reviews
No ratings available yet. Be the first to rate this tool!
Alternatives
Retell AI
Retell AI builds production-ready AI voice agents at scale, offering features for building, testing, deployment, and monitoring.
View Details
Cognitiev
Cognitiev provides customizable multilingual AI voice agents with emotional recognition for various business and personal applications.
View DetailsFeatured Tools
Songmeaning
Songmeaning uses AI to reveal the stories and meanings behind song lyrics. It offers lyric translation and AI music generation.
View DetailsWhisper Notes
Offline AI speech-to-text transcription app using Whisper AI. Supports 80+ languages, audio file import, and offers lifetime access with a one-time purchase. Available for iOS and macOS.
View DetailsGitGab
Connects Github repos and local files to AI models (ChatGPT, Claude, Gemini) for coding tasks like implementing features, finding bugs, writing docs, and optimization.
View Details
nuptials.ai
nuptials.ai is an AI wedding planning partner, offering timeline planning, budget optimization, vendor matching, and a 24/7 planning assistant to help plan your perfect day.
View DetailsMake-A-Craft
Make-A-Craft helps you discover craft ideas tailored to your child's age and interests, using materials you already have at home.
View Details
Pixelfox AI
Free online AI photo editor with comprehensive tools for image, face/body, and text. Features include background/object removal, upscaling, face swap, and AI image generation. No sign-up needed, unlimited use for free, fast results.
View Details
Smart Cookie Trivia
Smart Cookie Trivia is a platform offering a wide variety of trivia questions across numerous categories to help users play trivia, explore different topics, and expand their knowledge.
View Details
Code2Docs
AI-powered code documentation generator. Integrates with GitHub. Automates creation of usage guides, API docs, and testing instructions.
View Details