Defined.ai favicon

Defined.ai

Hiring
Defined.ai screenshot
Click to visit website
Feature this AI

About

Defined.ai is a company that provides high-quality, ethically sourced AI training data. They offer a large marketplace with diverse datasets for various applications, including spontaneous speech, scripted monologues, interactive voice response (IVR), and more. They also provide custom data services, quality control, and support. The company is focused on ethical AI development and maintains transparency in their data collection and handling processes.

Platform
Web
Keywords
machine learningaidatatraining datadatasets marketplace
Task
data provision

Features

transcription

expert support

data collection

data annotation

ethical data sourcing

high-quality data

custom data services

large selection of datasets

FAQs

How and from where were the participants in these datasets recruited?

Contributors are recruited using various methods, including organic and paid acquisition strategies, across self-owned channels, third-party platforms, and partnerships. Targeting is based on demographics, skills, experience, language, device, interests, and real-time context.

How do we inform the dataset participants about how the data collected will be used?

Contributors consent to our Terms of Use, Privacy Policy, and Cookies Policy before using the platform. The Privacy Policy details information collection and usage. Contributors can delete their accounts at any time, leading to anonymization of their data. We are GDPR compliant and ISO 27001 certified.

How do you determine pay rates for your participants in various locales?

Our pay policy ensures at least minimum wage, and in some cases, living wages. Rates depend on factors such as skill set and ability to attract contributors. Higher skills (e.g., medical collections) necessitate higher pay.

What are the terms of the Data License?

Defined.ai datasets are covered by a standard license agreement (link provided in the FAQ). The license is perpetual and allows commercialization of models built using the data.

What is Spontaneous IVR data and how it is gathered?

Spontaneous IVR data is gathered by having a human respond to an IVR system, following real-life scenarios. The human repeats their query in different ways. The speech is transcribed. The recording is done via telephony (8khz 16 bit per channel).

What is Spontaneous Dialog Data and how it is gathered?

Spontaneous Dialog data involves crowd members following pre-studied scenarios and recording conversations. One plays the agent, the other a customer with spontaneous content. Recording is done via telephony (8khz 16 bit per channel) and transcribed.

What is Scripted Monologue data and how it is gathered?

Scripted Monologue data involves speakers reading aloud from a given prompt. Clients receive the audio, prompt, and speaker information. Audio is recorded on-device (typically 16khz 16 bit). Device information is also provided.

If I buy 200h of data, does it mean I will get 200h of pure speech?

Audio duration is measured. Scripted speech includes pre- and post-reading silence. Dialogue speech generally has little silence except for natural breaks. For IVR, human speech segments comprise about 50% of the audio duration.

Can I get a sample of a dataset?

Free samples are available for download on the website.

Can you package subsets of data for me according to specific requirements of age, gender and accent?

Yes, custom datasets can be packaged based on specific requirements such as age, gender, and accent.

I need data that is not listed on the marketplace. Can you help me with my request?

We can help by either creating a custom collection or by informing about datasets planned for the future that may fulfill the requirements.

What are the payment options?

USD via ACH bank transfer. Purchase orders, SOWs, and other documentation are available upon request.

When will my purchased assets be delivered?

Datasets are delivered after payment is received. ACH transfers require cleared funds (2-3 business days). Custom orders may take longer.

Are there specific terms for Academia?

Yes, datasets are offered with significant discounts or even for free to Academia after a due diligence process.

Do you offer discounts?

Yes, discounts are available based on data volume. Contact us for a quotation.

Job Opportunities

Defined.ai favicon
Defined.ai

AI/ML Sales Executive (US)

Defined.ai offers a large marketplace for high-quality, ethically sourced AI training data, providing diverse datasets and custom data services.

salesremotefull-time

Benefits:

  • Flexible working schedule and hybrid model

  • Excellent career development opportunities

  • Culture of feedback and continuous improvement

  • International and diverse team

  • Continuous training opportunities

Education Requirements:

  • Bachelor's degree or equivalent

Experience Requirements:

  • 6+ years of proven experience working as a Sales Executive selling Professional Services / Data / Customized Projects / Consultative Sales into Enterprise accounts (B2B)

Other Requirements:

  • Proficient with Salesforce / CRM and MS Office

  • Ability to communicate, present and influence all levels of the organization, including executives

  • Strong ability to handle directly and close complex deals above $1M

  • Knowledge in AI/ML

  • Technical Sales experience will be a plus

Responsibilities:

  • Hunting for new logos in the assigned Enterprise verticals

  • Expanding the company’s footprint in existing enterprise or strategic accounts

  • Managing enterprise and or strategic customers with significant deal sizes $500k-$5M

  • Creating organic revenue streams working with the solutions and customer success teams within assigned territories/regions

  • Supporting and collaborating with internal partners to build successful proof of concepts, use cases and RFPs etc

Show more details

B2B Technical Writer

Defined.ai offers a large marketplace for high-quality, ethically sourced AI training data, providing diverse datasets and custom data services.

Benefits:

  • Flexible working schedule and hybrid model

  • Excellent career development opportunities

  • Culture of feedback and continuous improvement

  • International and diverse team

  • Continuous training opportunities

Experience Requirements:

  • 5+ years of B2B technical writing experience

Other Requirements:

  • Strong understanding of AI concepts

  • Exceptional writing skills

  • Ability to work effectively with cross-functional teams

  • Knowledge of SEO best practices

Responsibilities:

  • Write AI-focused B2B content

  • Collaborate with product and engineering teams

  • Support marketing team by developing content

  • Ensure content is relevant and localized

  • Implement SEO best practices

Show more details

Backend Engineer

Defined.ai offers a large marketplace for high-quality, ethically sourced AI training data, providing diverse datasets and custom data services.

engineeringonsiteLisbon, PTfull-time

Benefits:

  • Flexible working schedule and hybrid model

  • Excellent career development opportunities

  • Culture of feedback and continuous improvement

  • International and diverse team

  • Continuous training opportunities

Education Requirements:

  • BSc or MSc in Computer Science or similar background

Experience Requirements:

  • Mid to senior-level of .Net C# and software quality best practices

Other Requirements:

  • Experience with working with Agile software development methodologies

  • Worked with Azure services such as DevOps, Kubernetes and Blob Storage

  • Deep understanding of a fully automated software development lifecycle via CI/CD pipelines

  • Comfortable with applying software design and architectural patterns/principles

  • Accustomed to working with microservices in .Net C#, MS SQL Server and RabbitMQ

  • Knowledge of RESTful APIs

  • Familiarity with shell scripting

  • Proficient in both written and spoken English

Responsibilities:

  • Work on the back-end side of our platform by developing tools to automate workloads for data collection and processing of AI training datasets

  • Develop and evolve a microservice- and event-driven architecture based mainly on .Net C#, SQL Server, and RabbitMQ

  • Own the entire lifecycle (from conception to release and maintenance) of the services and applications your team owns

  • Be working in a multidisciplinary (QA, Back- and Front-end Engineers, Product Managers, etc.) and multicultural Agile team

  • Collaborate with the Product, Architecture, Infrastructure, and DevOps teams as well

Show more details

Explore AI Career Opportunities

Social Media

Ratings & Reviews

No ratings available yet. Be the first to rate this tool!

Featured Tools

Songmeaning favicon
Songmeaning

Songmeaning uses AI to reveal the stories and meanings behind song lyrics. It offers lyric translation and AI music generation.

View Details
Whisper Notes favicon
Whisper Notes

Offline AI speech-to-text transcription app using Whisper AI. Supports 80+ languages, audio file import, and offers lifetime access with a one-time purchase. Available for iOS and macOS.

View Details
GitGab favicon
GitGab

Connects Github repos and local files to AI models (ChatGPT, Claude, Gemini) for coding tasks like implementing features, finding bugs, writing docs, and optimization.

View Details
nuptials.ai favicon
nuptials.ai

nuptials.ai is an AI wedding planning partner, offering timeline planning, budget optimization, vendor matching, and a 24/7 planning assistant to help plan your perfect day.

View Details
Make-A-Craft favicon
Make-A-Craft

Make-A-Craft helps you discover craft ideas tailored to your child's age and interests, using materials you already have at home.

View Details
Pixelfox AI favicon
Pixelfox AI

Free online AI photo editor with comprehensive tools for image, face/body, and text. Features include background/object removal, upscaling, face swap, and AI image generation. No sign-up needed, unlimited use for free, fast results.

View Details
Smart Cookie Trivia favicon
Smart Cookie Trivia

Smart Cookie Trivia is a platform offering a wide variety of trivia questions across numerous categories to help users play trivia, explore different topics, and expand their knowledge.

View Details