Back

Case Study: Global AI Training Initiative for Major Voice Technology Provider

Voices delivered over 18,000 hours of high-quality multilingual, conversational audio.

An animated image with a green and blue background with white text that says: 'Large-Scale Conversational Data Collection'
50,000
Hours of Audio Data
18,000
Hours of Conversational Data
9
Languages

Client Challenge:

Voices partnered with a major voice technology provider to support the development of an AI model capable of generating more natural and human-like speech.

This case study highlights Voices’ expertise in large-scale, multilingual voice data collection for advanced AI model training.

Challenge:

A leading voice technology provider required a vast and diverse dataset of conversational audio to train an AI model designed to better understand conversational cues and generate more natural-sounding speech. The project required a significant scale, encompassing a wide range of languages and regional accents, as well as both scripted and spontaneous interactions.

Solution:

Voices undertook this complex project, leveraging its extensive network and operational capabilities to deliver high-quality, ethically sourced voice data. The solution involved:

  • Massive Data Sourcing: Initiating a project with a potential scope of up to 50,000 hours of audio data, and successfully delivering over 18,000 hours of conversational data to date.
  • Diverse Data Collection: Including both scripted interactions (e.g., customer service scenarios, targeted discussion topics, guided dialogues) and unscripted, spontaneous conversations to ensure a realistic and varied dataset.
  • Multilingual and Multi-accent Support: Sourcing data across 9 languages and multiple regional accents within English, Spanish, German, French, Italian, Japanese, Arabic, Portuguese, and Dutch. This highlights Voices’ ability to attract global talent and manage language complexities.
  • Operational Excellence: Coordinating a vast global network of contributors and ensuring consistent data quality across varied speech styles and regional accents. This logistical and linguistic achievement demonstrated Voices’ precision, adaptability, and operational excellence.

Results:

This ongoing project showcases Voices’ ability to manage complex, multilingual data collection efforts at scale, consistently meeting aggressive delivery timelines without compromising quality. Key outcomes include:

  • Significant Data Delivery: Over 18,000 hours of audio data delivered so far.
  • Rapid Scalability: Delivery speed scaled to over 1,000 hours of conversational data per week.
  • Broad Language Coverage: Successful sourcing across 9 different languages, demonstrating versatility in capturing authentic, usable voice data for a wide range of AI training needs.
  • Enhanced AI Model Development: The project directly contributes to the development of AI models that produce more natural-sounding, context-aware speech, aligning with the client’s goal of improving conversational AI.

This project demonstrates how Voices combines scale, speed, and language complexity to help clients train more natural-sounding, context-aware AI models, reinforcing our commitment to powering responsible AI development.

Ready to get started?

Join the #1 marketplace for voice over

Sign Up Free

More Stories from Voices Customers

A user inside a VR program. The user is choosing a colour swatch, which is displayed on a t-shirt a mannequin is wearing.
Software

Rather than using an internal voice, Shopify always looks to Voices to find the right voice for their projects.

Read Their Story
A man grabs a Coca Cola bottle from a bucket full of ice.
Advertising

David Studio Columbia saves time by finding their international voices on Voices.

Read Their Story
an image of the liveperson logo.
Software

"It’s great to work with a vendor that is very much oriented to the future and the opportunity that AI brings."

Read Their Story