Defined.ai favicon

Defined.ai

Hiring
Defined.ai screenshot
Click to visit website
Feature this AI

About

Defined.ai is a company that provides high-quality, ethically sourced AI training data. They offer a large marketplace with diverse datasets for various applications, including spontaneous speech, scripted monologues, interactive voice response (IVR), and more. They also provide custom data services, quality control, and support. The company is focused on ethical AI development and maintains transparency in their data collection and handling processes.

Platform
Web
Task
data provision

Features

transcription

expert support

data collection

data annotation

ethical data sourcing

high-quality data

custom data services

large selection of datasets

FAQs

How and from where were the participants in these datasets recruited?

Contributors are recruited using various methods, including organic and paid acquisition strategies, across self-owned channels, third-party platforms, and partnerships. Targeting is based on demographics, skills, experience, language, device, interests, and real-time context.

How do we inform the dataset participants about how the data collected will be used?

Contributors consent to our Terms of Use, Privacy Policy, and Cookies Policy before using the platform. The Privacy Policy details information collection and usage. Contributors can delete their accounts at any time, leading to anonymization of their data. We are GDPR compliant and ISO 27001 certified.

How do you determine pay rates for your participants in various locales?

Our pay policy ensures at least minimum wage, and in some cases, living wages. Rates depend on factors such as skill set and ability to attract contributors. Higher skills (e.g., medical collections) necessitate higher pay.

What are the terms of the Data License?

Defined.ai datasets are covered by a standard license agreement (link provided in the FAQ). The license is perpetual and allows commercialization of models built using the data.

What is Spontaneous IVR data and how it is gathered?

Spontaneous IVR data is gathered by having a human respond to an IVR system, following real-life scenarios. The human repeats their query in different ways. The speech is transcribed. The recording is done via telephony (8khz 16 bit per channel).

What is Spontaneous Dialog Data and how it is gathered?

Spontaneous Dialog data involves crowd members following pre-studied scenarios and recording conversations. One plays the agent, the other a customer with spontaneous content. Recording is done via telephony (8khz 16 bit per channel) and transcribed.

What is Scripted Monologue data and how it is gathered?

Scripted Monologue data involves speakers reading aloud from a given prompt. Clients receive the audio, prompt, and speaker information. Audio is recorded on-device (typically 16khz 16 bit). Device information is also provided.

If I buy 200h of data, does it mean I will get 200h of pure speech?

Audio duration is measured. Scripted speech includes pre- and post-reading silence. Dialogue speech generally has little silence except for natural breaks. For IVR, human speech segments comprise about 50% of the audio duration.

Can I get a sample of a dataset?

Free samples are available for download on the website.

Can you package subsets of data for me according to specific requirements of age, gender and accent?

Yes, custom datasets can be packaged based on specific requirements such as age, gender, and accent.

I need data that is not listed on the marketplace. Can you help me with my request?

We can help by either creating a custom collection or by informing about datasets planned for the future that may fulfill the requirements.

What are the payment options?

USD via ACH bank transfer. Purchase orders, SOWs, and other documentation are available upon request.

When will my purchased assets be delivered?

Datasets are delivered after payment is received. ACH transfers require cleared funds (2-3 business days). Custom orders may take longer.

Are there specific terms for Academia?

Yes, datasets are offered with significant discounts or even for free to Academia after a due diligence process.

Do you offer discounts?

Yes, discounts are available based on data volume. Contact us for a quotation.

Job Opportunities

Defined.ai favicon
Defined.ai

AI/ML Sales Executive (US)

Defined.ai offers a large marketplace for high-quality, ethically sourced AI training data, providing diverse datasets and custom data services.

salesremotefull-time

Benefits:

  • Flexible working schedule and hybrid model

  • Excellent career development opportunities

  • Culture of feedback and continuous improvement

  • International and diverse team

  • Continuous training opportunities

Education Requirements:

  • Bachelor's degree or equivalent

Experience Requirements:

  • 6+ years of proven experience working as a Sales Executive selling Professional Services / Data / Customized Projects / Consultative Sales into Enterprise accounts (B2B)

Other Requirements:

  • Proficient with Salesforce / CRM and MS Office

  • Ability to communicate, present and influence all levels of the organization, including executives

  • Strong ability to handle directly and close complex deals above $1M

  • Knowledge in AI/ML

  • Technical Sales experience will be a plus

Responsibilities:

  • Hunting for new logos in the assigned Enterprise verticals

  • Expanding the company’s footprint in existing enterprise or strategic accounts

  • Managing enterprise and or strategic customers with significant deal sizes $500k-$5M

  • Creating organic revenue streams working with the solutions and customer success teams within assigned territories/regions

  • Supporting and collaborating with internal partners to build successful proof of concepts, use cases and RFPs etc

Show more details

B2B Technical Writer

Defined.ai offers a large marketplace for high-quality, ethically sourced AI training data, providing diverse datasets and custom data services.

Benefits:

  • Flexible working schedule and hybrid model

  • Excellent career development opportunities

  • Culture of feedback and continuous improvement

  • International and diverse team

  • Continuous training opportunities

Experience Requirements:

  • 5+ years of B2B technical writing experience

Other Requirements:

  • Strong understanding of AI concepts

  • Exceptional writing skills

  • Ability to work effectively with cross-functional teams

  • Knowledge of SEO best practices

Responsibilities:

  • Write AI-focused B2B content

  • Collaborate with product and engineering teams

  • Support marketing team by developing content

  • Ensure content is relevant and localized

  • Implement SEO best practices

Show more details

Backend Engineer

Defined.ai offers a large marketplace for high-quality, ethically sourced AI training data, providing diverse datasets and custom data services.

engineeringonsiteLisbon, PTfull-time

Benefits:

  • Flexible working schedule and hybrid model

  • Excellent career development opportunities

  • Culture of feedback and continuous improvement

  • International and diverse team

  • Continuous training opportunities

Education Requirements:

  • BSc or MSc in Computer Science or similar background

Experience Requirements:

  • Mid to senior-level of .Net C# and software quality best practices

Other Requirements:

  • Experience with working with Agile software development methodologies

  • Worked with Azure services such as DevOps, Kubernetes and Blob Storage

  • Deep understanding of a fully automated software development lifecycle via CI/CD pipelines

  • Comfortable with applying software design and architectural patterns/principles

  • Accustomed to working with microservices in .Net C#, MS SQL Server and RabbitMQ

  • Knowledge of RESTful APIs

  • Familiarity with shell scripting

  • Proficient in both written and spoken English

Responsibilities:

  • Work on the back-end side of our platform by developing tools to automate workloads for data collection and processing of AI training datasets

  • Develop and evolve a microservice- and event-driven architecture based mainly on .Net C#, SQL Server, and RabbitMQ

  • Own the entire lifecycle (from conception to release and maintenance) of the services and applications your team owns

  • Be working in a multidisciplinary (QA, Back- and Front-end Engineers, Product Managers, etc.) and multicultural Agile team

  • Collaborate with the Product, Architecture, Infrastructure, and DevOps teams as well

Show more details

Explore AI Career Opportunities

Social Media

Ratings & Reviews

No ratings available yet. Be the first to rate this tool!

Alternatives

Gradient Health favicon
Gradient Health

Gradient Health is a medical technology company that simplifies access to large, unbiased medical imaging datasets to accelerate AI development in healthcare.

View Details
Crustdata favicon
Crustdata

Crustdata is a real-time company and people data provider that fuels commercial, internal, and sales platforms, specifically designed to power AI intelligence layers.

View Details
Aleno favicon
Aleno

Aleno is a real-time on-chain market data provider for any chain, protocol, or token, offering unmatched accuracy and reliability for market insights.

View Details

Featured Tools

Songmeaning favicon
Songmeaning

Songmeaning is an AI-powered tool that helps users uncover the hidden stories and meanings behind song lyrics, enhancing their musical understanding.

View Details
PropLytics favicon
PropLytics

PropLytics is an AI-powered platform for real estate investors, providing data-backed ROI insights to help make smarter, faster investment decisions.

View Details
GitGab favicon
GitGab

GitGab is an AI tool that contextualizes top AI models like ChatGPT, Claude, and Gemini with your GitHub repositories and local code for enhanced development.

View Details
nuptials.ai favicon
nuptials.ai

nuptials.ai is an AI wedding planning partner, offering timeline planning, budget optimization, vendor matching, and a 24/7 planning assistant to help plan your perfect day.

View Details
Fastbreak AI favicon
Fastbreak AI

Fastbreak AI is an ultimate AI-powered sports operations engine, offering intelligent software for sports league scheduling, tournament management, and brand sponsorship.

View Details
BestFaceSwap favicon
BestFaceSwap

BestFaceSwap is an AI-powered online tool that enables users to easily change faces in videos and photos with high-quality and realistic results.

View Details
Healing Grace Alternative Healing favicon
Healing Grace Alternative Healing

Healing Grace Alternative Healing is a center offering personalized care through organic bath and body products, natural remedies, and spiritual healing practices.

View Details
Smart Cookie Trivia favicon
Smart Cookie Trivia

Smart Cookie Trivia is a platform offering a wide variety of trivia questions across numerous categories to help users play trivia, explore different topics, and expand their knowledge.

View Details

Latest AI News

View All News
Scientists Embed Hidden AI Prompts to Manipulate Peer Review
Scientists Embed Hidden AI Prompts to Manipulate Peer Review

Invisible AI prompts in academic papers expose a cunning new tactic to manipulate peer review and undermine scientific integrity.

Jul 5, 2025
Read More →
US Extends AI Chip Controls to Malaysia, Thailand to Block China Smuggling
US Extends AI Chip Controls to Malaysia, Thailand to Block China Smuggling

US tightens AI chip export controls on Malaysia and Thailand, trapping key semiconductor hubs in the US-China tech war.

Jul 5, 2025
Read More →
AI's Fatal Flaw: Simple Cat Facts Shatter Advanced Reasoning
AI's Fatal Flaw: Simple Cat Facts Shatter Advanced Reasoning

Irrelevant inputs, like cat facts, cripple advanced AI's reasoning, highlighting a dire need for context engineering.

Jul 5, 2025
Read More →