Tweet

Harveen Singh Chadha

20 Mar, 7 tweets, 8 min read

@facebookai

Open Source Alert: Very excited to announce we are open sourcing Vakyash, a speech recognition framework to democratize speech recognition in Indic Languages.

Some key features:

1. End to end training and experimentation platform built on top of @facebookai Wav2Vec 2.0.

2. State of the art pretrained and finetuned models in 8 Indic languages including some low resource languages.

(Hindi, Indian English, Kannada, Marathi, Odia, Tamil, Telugu and Gujarati)

3. KenLM based language models including text data for all the above languages

4. Intelligent data pipelines to generate training data for any end to end speech recognition framework (recipes include language identification, speaker clustering and gender identification)

5. Inference service to host models using wav2vec 2.0 in real time and in batch mode.

Link to Github: github.com/Open-Speech-Ek…

Documentation: open-speech-ekstep.github.io

#speechrecognition #IndicLanguages #DeepLearning #DataScience

@svpino

Please help me spread the word. @svpino @jeremyphoward @JiliJeanlouis @ylecun @philipvollet @tttthomasssss @MLWhiz @Al_Grigor @haltakov @bhutanisanyam1 @JFPuget @AndLukyane @_rockt @riedelcastro @an_open_mind @chipro @suzatweet @Tim_Dettmers @rctatman

@SanhEstPasMoi

Please help me spread the word

@SanhEstPasMoi @abhi1thakur @Thom_Wolf @rasbt @chrmanning @emilymbender @lmoroney @PyTorch @paperswithcode

• • •

Missing some Tweet in this thread? You can try to force a refresh

Share this page!

Harveen Singh Chadha

Try unrolling a thread yourself!

Did Thread Reader help you today?

Like this author's thread?