NLTM logo
bhashini logo
Universal Language Contribution API
About ULCA

ULCA is a standard API and open scalable data platform (supporting various types of datasets) for Indian language datasets and models.

About ULCA

Application Programming Interfaces

Data Sets  Language datasets
  • Parallel text corpus in two or more languages
  • Monolingual text corpus
  • Automatic Speech Recognition (ASR) corpus
  • Text to Speech (TTS) corpus
  • Optical Character Recognition (OCR) corpus
  • Transliteration corpus
  • Natural Language Understanding (NLU) datasets
  • Glossary datasets
Models  Language specific tasks
  • Machine Translation (MT)
  • Automatic Speech Recognition (ASR)
  • Text to Speech (TTS)
  • Optical Character Recognition (OCR)
  • Speech To Speech (STS)
  • Transliteration
Benchmarks  Open benchmarking
  • Large, diverse and task specific benchmarks
  • Research community approved metric system