Shaip is offering a 50% discount on its ready-to-use audio/voice datasets for training conversational AI models
Ready-to-use audio/speech datasets in over 45 languages to start your speech recognition models.
Louisville, Kentucky, USA – February 3, 2022: Shaip, a global leader and innovator in training data collection and annotation in conversational AI, offers ready-to-use audio/voice datasets in more than 45 languages with a 50% discount for a limited period. The conversational AI dataset is used to train machine learning models that support a variety of use cases i.e. ASR, virtual/digital assistant, chatbot, conversational AI, speech analysis, TTS, language modeling, etc.
We currently offer over 50,000 hours of audio/voice data collected by a dedicated team of doctors, data engineers, ML engineers and human annotators from around the world. The data is divided into:
Call Center Conversations (8khz): Synthetic unscripted telephone conversation: “agent” and “customer”
Generic conversations (8khz): Unscripted telephone conversation between 2 people
Media and podcasts (16 khz): public domain audio/video interviews, podcasts, etc. between 1 and 5 people or more.
Scripted talk/monologue (16 khz): prompt-based recording
Vatsal Ghiya – CEO, Shaip said finding the right reference datasets has always been a daunting task to get ML initiatives off the ground. We specialize in serving AI organizations to create high quality custom audio datasets. We offer an exclusive catalog of “out-of-the-box” audio/voice datasets of 45 languages in multiple dialects for a variety of AI use cases.
He further adds that we have made available through the website the 50,000 hours of ready-to-use voice/audio data sets. These datasets are of very high quality and offer a fast and cost effective alternative to collecting and annotating data from scratch.
Shaip can also help get various conversational data in more than 150 languages around the world on the parameters below:
Languages, regional dialects and accents
Goal-oriented conversations across all areas of the industry
Spontaneous and scripted conversations
Monologue, 2-person conversations, call center conversations, wake-up words
Conversations about emotion, feeling, intention
Contact us today at [email protected]