UnReaL-TecE AI/NLP Services
Buy AI/NLP models or Datasets for building AI/NLP models for research and commercial use.
Wide Range of Services
Explore our AI/NLP models tailored for processing any Indian language. From summarization and topic modeling to sentiment and hate speech analysis, we've got you covered.
Support for Various Modalities
We build datasets across all modalities including speech, text, and audio-visual data.
Support for any Indian Language
Develop language resources across all Indian languages, including scheduled, non-scheduled, under-resourced, endangered, and unwritten languages.
Custom Solutions
We deliver linguistic resources and language technology you need including the most unconventional and customised requirements.
Featured Datasets
Educational/Research
Datasets are freely available under CC BY-NC-SA 4.0 for educational and research institutions.
Commercial Use
Commercial organizations need a separate, commercial license for use in products or research.
SpeeD-IA
Speech datasets and models for Indian Languages - Indo Aryan
Recordings, inter-linear glossing, and translations into English.
Available for Indo-Aryan languages like Awadhi, Bhojpuri, Magahi, etc.
Loading...

ComMA Project
Communal & Mysoginestic Aggression
Social Media Comments
60,000+ comments from various platforms.
Multilingual
Meitei (Manipuri), Bangla, Hindi, and English.
Multimodal
Text, Memes and Audio/Video Dataset
Aggression and Bias
Annotated with levels of aggression and bias (gender, caste, religion, etc.).
Aggression in Hindi and English Speech
Dataset
Dataset annotated with different levels of aggression in spoken Hindi and Indian English.
Models
Models for automatic identification of aggression in Hindi and Indian English speech.
Politeness in Text
Datasets designed to analyze politeness levels in written Hindi texts.
Propaganda in Hindi on Social Media
Datasets designed to detect propaganda in Hindi on Social Media in newspapers and periodicals.
© 2022-25 UnReaL-TecE LLP. All rights reserved.