| domain | ldcil.org |
| summary | Here’s a summary of the website content:
The Raw Speech Corpus Project is a multi-faceted initiative focused on creating extensive speech resources. It encompasses the development of various corpora including aligned speech, text-to-speech parallel corpora, and monolingual text corpora, covering 270 mother tongues. Activities include speech data creation, annotation, and validation, voice building through TTS, digitization efforts, and parts-of-speech annotation and validation, ultimately contributing to classical language corpus creation. |
| title | Home | Official Website of Linguistic Data Consortium for Indian Languages |
| description | Established in 2007, the Linguistic Data Consortium for Indian Languages (LDC-IL) is a scheme of the Department of Higher Education, Ministry of Human Resource and Development, Government of India implemented by and housed inside the Central Institute of |
| keywords | corpus, speech, text, gold, standard, sentence, language, bengali, indian, kannada, data, project, english, telugu, hindi, assamese, gujarati |
| upstreams |
|
| downstreams |
|
| nslookup | A 203.129.240.173 |
| created | 2026-02-14 |
| updated | 2026-02-14 |
| summarized | 2026-02-15 |
|
|