Home

Awesome

Vakyansh Open Source Models

  1. Pretrained ASR Models
  2. Finetuned ASR Models
  3. Language Models
  4. Punctuation Models
  5. TTS Models
  6. Gender Classification Model
  7. Language Identification Models
  8. Interspeech 2021 ASR Models

<a name="pam"></a>

Pretrained ASR Models

wav2vec2-code | nemo-code

Pretrained ModelDescriptionArchitectureHours
Vakyansh-Conformer-SSLThis model was pre-trained using Nemo toolkit with 34,000 hours unlabeled audio in 39 Indian languages. This includes 15,000 hours of news recordings available on the internet, 10,000 hours of YouTube audios and other audio data. In addition, 9,000 hours of Indian English audio data was taken from NPTEL lectures open sourced by AI4Bharat. <br> This model was trained in collaboration with NVIDIA (NVIDIA Graphics Pvt Ltd). We thank NVIDIA for providing the compute resources to train this model.Conformer-Large34,000
CLSRIL-23Cross Lingual Speech Representations for Indic Languages, Contains 10,000 hours of training data from 23 Indic Languages. <br> Citation: https://arxiv.org/abs/2107.07402 wav2vec2-Base10,000
hindi_pretrained_4khTrained on 4200 hours of Hindi Datawav2vec2-Base4,200
kannada_pretrained_1400hTrained on 1400 hours of Kannada datawav2vec2-XLSR1,400

<br><br>

<a name="fam"></a>

Finetuned ASR Models

Conformer based models

Repo

LanguagePretrained ModelFinetuned ModelFinetuned HoursArch
HindiVakyansh Conformer SSLhindi_large_ssl_25002,500 hLarge
Indian EnglishVakyansh Conformer SSLindian_en_large_ssl_700700 hLarge
KannadaVakyansh Conformer SSLkannada_large_ssl_10001,000 hLarge
PunjabiVakyansh Conformer SSLpunjabi_large_ssl_500500 hLarge
TamilVakyansh Conformer SSLtamil_large_ssl_900900 hLarge

<br><hr>

wav2vec2 based models

Repo

Citation: https://arxiv.org/abs/2203.16512

LanguagePretrained ModelFinetuned ModelDictionarySingle Model for InferenceFinetuned HoursTS model
HindiCLSRIL-23him_4200dicthindi_infer4200 hhindi_ts
Indian EnglishCLSRIL-23enm_700dictenglish_infer700 henglish_ts
KannadaCLSRIL-23knm_560dictkannada_infer560 hkannada_ts
TamilCLSRIL-23tam_250dicttamil_infer250 htamil_ts
BengaliCLSRIL-23bnm_200dictbengali_infer200 hbengali_ts
NepaliCLSRIL-23nem_130dictnepali_infer130 hnepali_ts
TeluguCLSRIL-23tem_100dicttelugu_infer100 htelugu_ts
GujaratiCLSRIL-23gum_100dictgujarati_infer100 hgujarati_ts
MarathiCLSRIL-23mrm_100dictmarathi_infer100 hmarathi_ts
OdiaCLSRIL-23orm_100dictodia_infer100 hodia_ts
SanskritCLSRIL-23sam_60dictsanskrit_infer60 hsanskrit_ts
MaithiliCLSRIL-23maim_50dictmaithili_infer50 hmaithili_ts
UrduCLSRIL-23urm_60hdicturdu_infer60hurdu_ts
PunjabiCLSRIL-23pam_10hdictpunjabi_infer10 hpunjabi_ts
DogriCLSRIL-23doi_55hdictdogri_infer55 hdogri_ts
MalayalamCLSRIL-23mlm_8hdictmalayalam_infer8 hmalayalam_ts
BhojpuriCLSRIL-23bhom_60hdictbhojpuri_infer60 hbhojpuri_ts
AssameseCLSRIL-23asm_8hdictassamese_infer8 hassamese_ts

<br><br>

<a name="lm"></a>

Language Models

Repo

Language models integrate with finetuned models.

LanguageTypeLexiconLMText Corpus
Hindikenlm 5-gramhindi_lexiconhindi_lmhindi_text
Indian Englishkenlm 5-gramenglish_lexiconenglish_lmenglish_text
Kannadakenlm 5-gramkannada_lexiconkannada_lmkannada_text
Tamilkenlm 5-gramtamil_lexicontamil_lmtamil_text
Bengalikenlm 5-grambengali_lexiconbengali_lmbengali_text
Nepalikenlm 5-gramnepali_lexiconnepali_lmnepali_text
Telugukenlm 5-gramtelugu_lexicontelugu_lmtelugu_text
Gujaratikenlm 5-gramgujarati_lexicongujarati_lmgujarati_text
Marathikenlm 5-grammarathi_lexiconmarathi_lmmarathi_text
Odiakenlm 5-gramodia_lexiconodia_lmodia_lm
Sanskritkenlm 5-gramsanskrit_lexiconsanskrit_lmsanskrit_text
Maithilikenlm 5-grammaithili_lexiconmaithili_lmmaithili_text
Urdukenlm 5-gramurdu_lexiconurdu_lmurdu_text
Punjabikenlm 5-grampunjabi_lexiconpunjabi_lmpunjabi_text
Dogrikenlm 5-gramdogri_lexicondogri_lmdogri_text
Malayalamkenlm 5-grammalayalam_lexiconmalayalam_lmmalayalam_text
Bhojpurikenlm 5-grambhojpuri_lexiconbhojpuri_lmbhojpuri_text
Rajasthanikenlm 5-gramrajasthani_lexiconrajasthani_lmrajasthani_text
Assamesekenlm 5-gramassamese_lexiconassamese_lmassamese_text
Hinglishkenlm 5-gramhinglish_lexiconhinglish_lmhinglish_text

Dataset Credits: We thanks AI4Bharat for open sourcing the Indic-Corp dataset. Link. We modified the original data by tokenizing and removing duplicates.

Domain Specific Language Models

LanguageTypeDomainLexiconLMText Corpus
Englishkenlm 5-gramBiomedicalbio_lexiconbio_lmbio_lm_eng_text

<br><br>

<a name="pm"></a>

Punctuation Models

Training Repo

Inference Repo

LanguageModelData
Hindihi.ziphindi_data
Assameseas.zipassamese_data
Bengalibn.zipbengali_data
Gujaratigu.zipgujarati_data
Kannadakn.zipkannada_data
Malayalamml.zipmalayalam_data
Marathimr.zipmarathi_data
Odiaor.zipodia_data
Punjabipa.zippunjabi_data
Tamilta.ziptamil_data
Telugute.ziptelugu_data

Dataset Credits: We thank AI4Bharat for open sourcing the Indic-Corp dataset. Link. We modified the original data by tokenizing and removing duplicates.

<br><br>

<a name="tts"></a>

TTS Models

Below models are trained using Glow TTS and hifi GAN combination.

Repo

LanguageLanguage CodeGenderglow ckpthifi-gan ckpt
HindihiFemalehi_0_glowhi_0_hifi
HindihiMalehi_1_glowhi_1_hifi
KannadaknFemalekn_0_glowkn_0_1_hifi
KannadaknMalekn_1_glowkn_0_1_hifi
TamiltaFemaleta_0_glowta_0_1_hifi
TamiltaMaleta_1_glowta_0_1_hifi
TeluguteFemalete_0_glowte_0_1_hifi
TeluguteMalete_1_glowte_0_1_hifi
OdiaorFemaleor_0_glowor_0_1_hifi
OdiaorMaleor_1_glowor_0_1_hifi
MalayalammlFemaleml_0_glowml_0_hifi
MalayalammlMaleml_1_glowml_1_hifi
MarathimrFemalemr_0_glowmr_1_hifi
GujaratiguMalegu_0_glowgu_0_hifi
BengalibnFemalebn_0_glowbn_0_1_hifi
BengalibnMalebn_1_glowbn_0_1_hifi
EnglishenFemaleen_0_glowen_0_hifi
EnglishenMaleen_1_glowen_1_hifi

Dataset Credits: We thanks IITM for open sourcing Indic-TTS dataset. Link

<br><br>

<a name="gcm"></a>

Gender Classification Model

Repo

TypeModel TypeModel
Gender ClassificationSVCModel

<br><br> <a name="lim"></a>

Language Identification Models

Repo

TypeModel
Hindi_vs_OthersModel
Tamil_vs_OthersModel

<br><br>

<a name="iam"></a>

Interspeech 2021 ASR Models

Comp Link

LanguagePretrained ModelFinetuned ModelDictionarySingle Model for Inference
TeluguCLSRIL-23te_40h_interspeech dicttelugu_infer_interspeech
TamilCLSRIL-23ta_40h_interspeech dicttamil_infer_interspeech
GujaratiCLSRIL-23gu_40h_interspeech dictgujarati_infer_interspeech
HinglishCLSRIL-23hinglish_interspeechdicthinglish_infer_interspeech

<br><br>

Citation

If you use any of our resources, please cite the following article:

@misc{chadha2022vakyansh,
    title={Vakyansh: ASR Toolkit for Low Resource Indic languages},
    author={Harveen Singh Chadha and Anirudh Gupta and Priyanshi Shah and Neeraj Chhimwal and Ankur Dhuriya and Rishabh Gaur and Vivek Raghavan},
    year={2022},
    eprint={2203.16512},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}

If you use the pretrained model (CLSRIL-23) please cite the following article:

@misc{gupta2021clsril23,
      title={CLSRIL-23: Cross Lingual Speech Representations for Indic Languages}, 
      author={Anirudh Gupta and Harveen Singh Chadha and Priyanshi Shah and Neeraj Chimmwal and Ankur Dhuriya and Rishabh Gaur and Vivek Raghavan},
      year={2021},
      eprint={2107.07402},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}
<hr>