Shaip is offering a 50% discount on its ready-to-use audio/voice datasets for training conversational AI models

Ready-to-use audio/speech datasets in over 45 languages ​​to start your speech recognition models.

Louisville, Kentucky, USA – February 3, 2022: Shaip, a global leader and innovator in training data collection and annotation in conversational AI, offers ready-to-use audio/voice datasets in more than 45 languages ​​with a 50% discount for a limited period. The conversational AI dataset is used to train machine learning models that support a variety of use cases i.e. ASR, virtual/digital assistant, chatbot, conversational AI, speech analysis, TTS, language modeling, etc.

We currently offer over 50,000 hours of audio/voice data collected by a dedicated team of doctors, data engineers, ML engineers and human annotators from around the world. The data is divided into:

Call Center Conversations (8khz): Synthetic unscripted telephone conversation: “agent” and “customer”

Generic conversations (8khz): Unscripted telephone conversation between 2 people

Media and podcasts (16 khz): public domain audio/video interviews, podcasts, etc. between 1 and 5 people or more.

Scripted talk/monologue (16 khz): prompt-based recording

Vatsal Ghiya – CEO, Shaip said finding the right reference datasets has always been a daunting task to get ML initiatives off the ground. We specialize in serving AI organizations to create high quality custom audio datasets. We offer an exclusive catalog of “out-of-the-box” audio/voice datasets of 45 languages ​​in multiple dialects for a variety of AI use cases.

He further adds that we have made available through the website the 50,000 hours of ready-to-use voice/audio data sets. These datasets are of very high quality and offer a fast and cost effective alternative to collecting and annotating data from scratch.

Shaip can also help get various conversational data in more than 150 languages ​​around the world on the parameters below:

Languages, regional dialects and accents

Goal-oriented conversations across all areas of the industry

Spontaneous and scripted conversations

Monologue, 2-person conversations, call center conversations, wake-up words

Conversations about emotion, feeling, intention

Contact us today at [email protected]

Source link

Comments are closed.