Defined.ai

Click to visit website
About
Defined.ai is a company that provides high-quality, ethically sourced AI training data. They offer a large marketplace with diverse datasets for various applications, including spontaneous speech, scripted monologues, interactive voice response (IVR), and more. They also provide custom data services, quality control, and support. The company is focused on ethical AI development and maintains transparency in their data collection and handling processes.
Platform
Task
Features
• transcription
• expert support
• data collection
• data annotation
• ethical data sourcing
• high-quality data
• custom data services
• large selection of datasets
FAQs
How and from where were the participants in these datasets recruited?
Contributors are recruited using various methods, including organic and paid acquisition strategies, across self-owned channels, third-party platforms, and partnerships. Targeting is based on demographics, skills, experience, language, device, interests, and real-time context.
How do we inform the dataset participants about how the data collected will be used?
Contributors consent to our Terms of Use, Privacy Policy, and Cookies Policy before using the platform. The Privacy Policy details information collection and usage. Contributors can delete their accounts at any time, leading to anonymization of their data. We are GDPR compliant and ISO 27001 certified.
How do you determine pay rates for your participants in various locales?
Our pay policy ensures at least minimum wage, and in some cases, living wages. Rates depend on factors such as skill set and ability to attract contributors. Higher skills (e.g., medical collections) necessitate higher pay.
What are the terms of the Data License?
Defined.ai datasets are covered by a standard license agreement (link provided in the FAQ). The license is perpetual and allows commercialization of models built using the data.
What is Spontaneous IVR data and how it is gathered?
Spontaneous IVR data is gathered by having a human respond to an IVR system, following real-life scenarios. The human repeats their query in different ways. The speech is transcribed. The recording is done via telephony (8khz 16 bit per channel).
What is Spontaneous Dialog Data and how it is gathered?
Spontaneous Dialog data involves crowd members following pre-studied scenarios and recording conversations. One plays the agent, the other a customer with spontaneous content. Recording is done via telephony (8khz 16 bit per channel) and transcribed.
What is Scripted Monologue data and how it is gathered?
Scripted Monologue data involves speakers reading aloud from a given prompt. Clients receive the audio, prompt, and speaker information. Audio is recorded on-device (typically 16khz 16 bit). Device information is also provided.
If I buy 200h of data, does it mean I will get 200h of pure speech?
Audio duration is measured. Scripted speech includes pre- and post-reading silence. Dialogue speech generally has little silence except for natural breaks. For IVR, human speech segments comprise about 50% of the audio duration.
Can I get a sample of a dataset?
Free samples are available for download on the website.
Can you package subsets of data for me according to specific requirements of age, gender and accent?
Yes, custom datasets can be packaged based on specific requirements such as age, gender, and accent.
I need data that is not listed on the marketplace. Can you help me with my request?
We can help by either creating a custom collection or by informing about datasets planned for the future that may fulfill the requirements.
What are the payment options?
USD via ACH bank transfer. Purchase orders, SOWs, and other documentation are available upon request.
When will my purchased assets be delivered?
Datasets are delivered after payment is received. ACH transfers require cleared funds (2-3 business days). Custom orders may take longer.
Are there specific terms for Academia?
Yes, datasets are offered with significant discounts or even for free to Academia after a due diligence process.
Do you offer discounts?
Yes, discounts are available based on data volume. Contact us for a quotation.
Job Opportunities
AI/ML Sales Executive (US)
Defined.ai offers a large marketplace for high-quality, ethically sourced AI training data, providing diverse datasets and custom data services.
Benefits:
Flexible working schedule and hybrid model
Excellent career development opportunities
Culture of feedback and continuous improvement
International and diverse team
Continuous training opportunities
Education Requirements:
Bachelor's degree or equivalent
Experience Requirements:
6+ years of proven experience working as a Sales Executive selling Professional Services / Data / Customized Projects / Consultative Sales into Enterprise accounts (B2B)
Other Requirements:
Proficient with Salesforce / CRM and MS Office
Ability to communicate, present and influence all levels of the organization, including executives
Strong ability to handle directly and close complex deals above $1M
Knowledge in AI/ML
Technical Sales experience will be a plus
Responsibilities:
Hunting for new logos in the assigned Enterprise verticals
Expanding the company’s footprint in existing enterprise or strategic accounts
Managing enterprise and or strategic customers with significant deal sizes $500k-$5M
Creating organic revenue streams working with the solutions and customer success teams within assigned territories/regions
Supporting and collaborating with internal partners to build successful proof of concepts, use cases and RFPs etc
Show more details
B2B Technical Writer
Defined.ai offers a large marketplace for high-quality, ethically sourced AI training data, providing diverse datasets and custom data services.
Benefits:
Flexible working schedule and hybrid model
Excellent career development opportunities
Culture of feedback and continuous improvement
International and diverse team
Continuous training opportunities
Experience Requirements:
5+ years of B2B technical writing experience
Other Requirements:
Strong understanding of AI concepts
Exceptional writing skills
Ability to work effectively with cross-functional teams
Knowledge of SEO best practices
Responsibilities:
Write AI-focused B2B content
Collaborate with product and engineering teams
Support marketing team by developing content
Ensure content is relevant and localized
Implement SEO best practices
Show more details
Backend Engineer
Defined.ai offers a large marketplace for high-quality, ethically sourced AI training data, providing diverse datasets and custom data services.
Benefits:
Flexible working schedule and hybrid model
Excellent career development opportunities
Culture of feedback and continuous improvement
International and diverse team
Continuous training opportunities
Education Requirements:
BSc or MSc in Computer Science or similar background
Experience Requirements:
Mid to senior-level of .Net C# and software quality best practices
Other Requirements:
Experience with working with Agile software development methodologies
Worked with Azure services such as DevOps, Kubernetes and Blob Storage
Deep understanding of a fully automated software development lifecycle via CI/CD pipelines
Comfortable with applying software design and architectural patterns/principles
Accustomed to working with microservices in .Net C#, MS SQL Server and RabbitMQ
Knowledge of RESTful APIs
Familiarity with shell scripting
Proficient in both written and spoken English
Responsibilities:
Work on the back-end side of our platform by developing tools to automate workloads for data collection and processing of AI training datasets
Develop and evolve a microservice- and event-driven architecture based mainly on .Net C#, SQL Server, and RabbitMQ
Own the entire lifecycle (from conception to release and maintenance) of the services and applications your team owns
Be working in a multidisciplinary (QA, Back- and Front-end Engineers, Product Managers, etc.) and multicultural Agile team
Collaborate with the Product, Architecture, Infrastructure, and DevOps teams as well
Show more details
Ratings & Reviews
No ratings available yet. Be the first to rate this tool!
Featured Tools
Songmeaning
AI tool uncovering stories and meaning behind song lyrics. Offers lyric translation and AI music generation.
View DetailsWhisper Notes
Offline AI speech-to-text transcription app using Whisper AI. Supports 80+ languages, audio file import, and offers lifetime access with a one-time purchase. Available for iOS and macOS.
View DetailsGitGab
GitGab connects your Github repos to ChatGPT, Claude, and Gemini, contextualizing AI models with your code to implement features and find bugs.
View Details
Fully Booked AI
Fully Booked AI is an all-in-one solution designed specifically for salons and med spas, offering AI-powered marketing automation, lead generation, and streamlined communication.
View DetailsMake-A-Craft
Make-A-Craft helps you discover craft ideas tailored to your child's age and interests, using materials you already have at home.
View Details
GIF Face Swap
Free online tool to swap faces in GIFs. Upload your GIF and a target face to create fun, shareable images. No registration or limits.
View DetailsUnAI My Text
UnAI My Text transforms AI content into natural, human-like text, bypassing AI detection. It's easy to use, fast, and free, offering unlimited usage and multi-language support to make AI-generated text sound more human.
View Details
Pixelfox AI
Free online AI photo editor with comprehensive tools for image, face/body, and text. Features include background/object removal, upscaling, face swap, and AI image generation. No sign-up needed, unlimited use for free, fast results.
View Details
Smart Cookie Trivia
A trivia website with questions in multiple categories. Play now and expand your knowledge!
View Details
1Template
1Template makes professional resume creation simple and powerful. It offers a single, modifiable template that provides expert guidance every step of the way. It uses AI to help you craft a better resume.
View Details
TheLibrarian.io
WhatsApp AI Assistant designed to help Master Your Inbox, Control Your Schedule, and Find Anything You Need — so you can focus on what truly matters.
View DetailsVerisquad
Verisquad is an AI-powered multi-agent system for comprehensive claim verification, leveraging coordinated AI agents and evidence-based fact-checking to provide accurate veracity ratings.
View Details
GetLeads
AI-powered lead generation tool for finding relevant companies and decision maker contacts, with features like prospecting, lookalike leads, AI message generation, and automated email outreach.
View Details
Werd.ai
Werd.ai is an AI writing tool for creators. It streamlines content creation with research, SEO keyword targeting, and trend analysis, using AI to automate tasks and enhance workflow.
View Details