Marathi Data Description

The Marathi speech data is collected from three different user groups: College students, Rural low income workers, Urban low income workers. The dataset is split into train and test, with 93.89 hours and 5 hours of audio, respectively. There are 2543 and 200 unique sentences in the train and test sets, respectively, and the utterances belong to the same set of 31 speakers in both train and test sets, with 100% speaker overlap. The text transcriptions of train and test sets are disjoint. The audio files are sampled at 8kHz, 16-bit encoding. The total vocabulary size of the train and test set is 3395.