
VidVoice

Click to visit website
About
VidVoice is a real-time video translation service that allows users to understand and speak any language without interpreters, subtitles, or robotic voiceovers. It uses AI to achieve native-sounding voiceovers and lip synchronization. The service is platform-agnostic, working with video conferencing tools like Skype, Zoom, Microsoft Teams, and Google Meet. Beyond video conferencing, VidVoice can dub videos, clean up audio by removing unwanted sounds, and translate videos into multiple languages. The technology behind VidVoice is based on a real-time Native Lip Sync algorithm that makes users look and sound like native speakers. Yepic AI developed VidVoice with support from the University of Surrey's Centre for Vision, Speech and Signal Processing (CVSSP).
Platform
Features
• secure and private
• video dubbing
• real-time lip sync
• simultaneous video translation
• clean videos - remove "urghs" and "ahhs"
• platform agnostic (skype, zoom, microsoft teams, google meet)
• native-sounding voiceovers
FAQs
Is it really real time?
Yes, but the translation element is not word for word so there are slight delays, but lip rendering based on translated audio is real time. Our technology reconnects the verbal and non verbal elements of translation assuring nothing is lost in translation.
When can I sign up?
We are currently piloting the tool with large corporates, but anyone can apply to Join The Beta
Will you account for pauses?
Yes, our algorithm remaps facial expressions and translated mouth movements.
Why is nobody else doing this?
We are ahead of SOTA research coming from universities. The major technical challenges lie in: 1) Identifying landmarks and overlaying a face mesh in real time 2) Achieving high quality point tracking of a typical live video feed with continuous unpredictable movement 3) Successfully translating and generating dubbed video in real-time 4) Maintaining expressions, pauses and appearance consistently. Yepic AI has a patent pending which achieves all the above, the next phase of development will focus on reducing the amount of GPU required and exploring newer transcription and translation architectures like like Speech2Speech.
Why are you doing this?
Short Answer: We love mad challenges and have a very personal pain that's inspired the company genesis. Long Answer: Monolingualism in a globalised world is the illiteracy of the 21st century. It costs Britain £48B annually. In the wake of COVID-19, Brexit Britain must play to its strengths and unlock the huge potential of service exports. The UK is already a global leader in service (legal, medical and consulting) exports but only 10% of UK SMEs are exporting(British Business Bank) and the numbers are falling. The rise of telehealth, global remote teams and "Zoom services" are creating an ocean of untapped export opportunities for SMEs, but… 89% of SMEs report language barriers as the number one obstacle to export (DIT). The IMF reports that by 2030 43% of global GDP will originate from BRIC nations. In Brazil, Russia and China >3% speak English fluently, making on-demand interpretation essential to access emerging markets. In the UK, less than 30% of the population can read/write another language, compared to 80% in Europe, putting us at a massive disadvantage, especially post Brexit. Additionally, the UK Gov just awarded a £360m contract year on interpretation/translation services. Translator availability is a growing problem. The lucky few global UK superstar brands that do export spend £billions on multilingual intermediaries, especially when entering emerging markets. An SME hiring a Portuguese translator (to enter the Brazilian market) costs £300 per hour or £1500 per day (Interprefy). To engage a client multiple teams must meet multiple stakeholders (with an interpreter) overall number of months and translate many documents at a cost of £10,000+ travel/Legal per lead, £50-100,000 per customer (1 to 5/10 win ratio). During the pandemic, many UK businesses tried to overcome language barriers through online simultaneous interpretation via Zoom (expensive) or SOTA machine translation ML (inexpensive but distracting). Most video conferencing tools now offer real-time transcription and translation subtitles. Through third-party apps voice over can be generated using Text-to-Speech. However, today's solutions distract from participants' non-verbal cues; body language and are very distracting. In the UK and Globally Telehealth is being widely adopted but the experience for non-natives is often an afterthought. Financially VidVoice is incentivised to pursue opportunities in the healthcare market subject to gaining relevant approvals. Our go-to-market begins with internal communications at MNCs, then External coms like sales meetings before specialising in niche medical and research applications.
Job Opportunities
Senior Sales Consultant - AI Learning Solutions
VidVoice provides real-time video translation with native lip sync and voiceovers for video conferencing and video dubbing.
Benefits:
Competitive Compensation and Equity
Education Requirements:
Bachelor’s degree in Business, Marketing, Education Technology, or a related field
Experience Requirements:
Minimum of 5 years of professional sales experience, with a proven track record in the education technology or corporate training sector
Strong capability in consultative sales, strategic account management, and client relationship building
Deep understanding of the L&D sector, including key drivers, challenges, and technology trends
Comfort with modern sales software, CRM systems, and an understanding of AI and its applications in learning
Excellent communication and presentation skills, capable of effectively articulating complex solutions in a clear and persuasive manner
Responsibilities:
Client Acquisition and Growth
Solution-Oriented Sales
Relationship Management
Market Insights
Sales Targets (monthly and quarterly sales targets, contributing to the overall growth objectives of Yepic AI.)Collaboration and Feedback
Show more details
Sales Development Representative (SDR)
VidVoice provides real-time video translation with native lip sync and voiceovers for video conferencing and video dubbing.
Benefits:
Competitive Compensation
Comprehensive Health Benefits
Flexible Work Environment
Career Development
Inclusive and Dynamic Culture
Education Requirements:
A Bachelor’s degree in Business, Marketing, or a related field
Experience Requirements:
3-4 years of experience in a sales or business development role, preferably within L&D SaaS or HR SaaS
Strong communication and interpersonal skills, with the ability to build rapport with diverse clients
Proven ability to meet and exceed sales targets
Self-motivated and goal-oriented with a proactive approach to problem-solving
Experience with CRM software (e.g., Salesforce) and proficiency in using sales tools and platforms
Other Requirements:
An understanding of AI and technology trends is a plus
Responsibilities:
Lead Qualification
Outbound Prospecting
Appointment Setting
Initial Outreach
Follow-Up (Maintain communication with prospects to ensure they progress through the sales funnel and address any questions or concerns they may have.)CRM Management (Ensure all interactions and lead data are accurately recorded in the CRM system. Track and report on key performance metrics to ensure targets are met.)Collaboration (Work closely with the sales and marketing teams to develop strategies for nurturing leads and improving conversion rates. Participate in team meetings and provide feedback to enhance the overall sales process.)
Show more details
Research Engineer, Computer Vision
VidVoice provides real-time video translation with native lip sync and voiceovers for video conferencing and video dubbing.
Benefits:
Competitive Compensation
Comprehensive Health Benefits
Flexible Work Environment
Career Development
Inclusive and Dynamic Culture
Education Requirements:
A Master’s or PhD in Computer Science, Engineering, or a related field
Experience Requirements:
3+ years of experience in machine learning and computer vision, with practical application of generative models
Proficiency in Python and experience with deep learning frameworks such as PyTorch
Familiarity with cloud deployment tools such as Docker and Azure
Experience with CI/CD processes and tools
Strong problem-solving skills and the ability to communicate effectively in a diverse team environment
Other Requirements:
Passion for software development and writing clean, maintainable code
Experience with C++ is a plus
Responsibilities:
Generative Avatar Performance
High-Quality Code Development
Algorithm and Model Development
End-to-End ML Lifecycle
Research and Innovation (Explore novel methods to solve challenges in the generative AI space. Stay up-to-date with the latest advancements and incorporate them into our development processes.)Collaboration and Code Reviews (Engage in code reviews and collaborate with team members to ensure best practices are followed. Contribute to a culture of continuous improvement and knowledge sharing.)Cloud Deployment (Work on deploying models and services on cloud infrastructure. Utilize Docker, Azure, and other cloud tools to ensure robust and scalable deployments.)
Show more details
Ratings & Reviews
No ratings available yet. Be the first to rate this tool!
Alternatives

AiLuvio
AiLuvio uses AI for real-time video call dubbing and translation in over 30 languages. Secure, ad-free, and offers various subscription plans.
View Details
VideoTranslator.AI
AI-powered video translation and teleconferencing tool supporting 120+ languages.
View Details
Dubly.AI
AI-powered video translation tool that translates videos into 28+ languages, preserving the original voice.
View Details
Lipdub
AI-powered video translation app that changes your voice and lip movements to match the translated language.
View Details
SignStudio
AI-powered platform providing accurate and seamless BSL and ASL sign language translation for videos, websites, and transport.
View DetailsFeatured Tools
Songmeaning
Songmeaning uses AI to reveal the stories and meanings behind song lyrics. It offers lyric translation and AI music generation.
View DetailsWhisper Notes
Offline AI speech-to-text transcription app using Whisper AI. Supports 80+ languages, audio file import, and offers lifetime access with a one-time purchase. Available for iOS and macOS.
View DetailsGitGab
Connects Github repos and local files to AI models (ChatGPT, Claude, Gemini) for coding tasks like implementing features, finding bugs, writing docs, and optimization.
View Details
nuptials.ai
nuptials.ai is an AI wedding planning partner, offering timeline planning, budget optimization, vendor matching, and a 24/7 planning assistant to help plan your perfect day.
View DetailsMake-A-Craft
Make-A-Craft helps you discover craft ideas tailored to your child's age and interests, using materials you already have at home.
View Details
Pixelfox AI
Free online AI photo editor with comprehensive tools for image, face/body, and text. Features include background/object removal, upscaling, face swap, and AI image generation. No sign-up needed, unlimited use for free, fast results.
View Details
Smart Cookie Trivia
Smart Cookie Trivia is a platform offering a wide variety of trivia questions across numerous categories to help users play trivia, explore different topics, and expand their knowledge.
View Details
Code2Docs
AI-powered code documentation generator. Integrates with GitHub. Automates creation of usage guides, API docs, and testing instructions.
View Details