Skip to content
Change the repository type filter

All

    Repositories list

    • stopwords

      Public
      This repository contains the stopword list in various Indian languages. This list is expected to be useful for researchers working in different fields including AI/NLP, Linguistics, Digital Humanities, etc
      Creative Commons Zero v1.0 Universal
      0000Updated Feb 15, 2025Feb 15, 2025
    • 🤗 AutoTrain Advanced for integration with LiFE App Training Module
      Python
      Apache License 2.0
      538000Updated Feb 11, 2025Feb 11, 2025
    • life

      Public
      Linguistic Field Data Management and Analysis System [LiFE]
      Python
      GNU Affero General Public License v3.0
      1500Updated Oct 27, 2024Oct 27, 2024
    • Repository of data and scripts of UGC-UKIERI Project on "Automatic Detection of Verbal Threat in HIndi and English Aggressive Speech"
      Praat
      Other
      1000Updated Jun 9, 2024Jun 9, 2024
    • harmpot

      Public
      This repository contains the dataset, models and other details about the HarmPot (Measuring Harm Potential of Social Media Content in India) Project.
      GNU Affero General Public License v3.0
      0000Updated May 22, 2024May 22, 2024
    • A repository of the social media dataset in Hindi, annotated with politeness levels
      Other
      1000Updated Jan 10, 2024Jan 10, 2024
    • Repository of the data and tools for propaganda identification in HIndi
      Jupyter Notebook
      Other
      1000Updated Jan 10, 2024Jan 10, 2024
    • ComMA

      Public
      Dataset of 20,000 datapoints in Meitei, Bangla and Hindi, richly annotated with different levels of aggression and bias for the ComMA Project.
      Other
      1000Updated Jan 10, 2024Jan 10, 2024
    • SpeeD-IA

      Public
      Repository for different Speech Datasets and Models for Indo-Aryan languages.
      Other
      1000Updated Nov 27, 2023Nov 27, 2023
    • SpeeD-IL

      Public
      Central Repository for the Speech Datasets and Models in Indian Languages (SpeeD-IL) project. Each language family has a separate, dedicated repository linked to this central repository.
      GNU Affero General Public License v3.0
      0100Updated Aug 17, 2023Aug 17, 2023
    • crawlers

      Public
      Crawlers for automatically collecting data from different sources
      Python
      Apache License 2.0
      4000Updated Apr 4, 2023Apr 4, 2023
    • Punctuation Models for 12 Indian Languages
      Python
      MIT License
      24000Updated Dec 15, 2022Dec 15, 2022
    • Read, write, and manipulate Praat TextGrid files with Python
      Python
      GNU General Public License v3.0
      30000Updated Nov 4, 2022Nov 4, 2022
    • mscrabble

      Public
      Repository for Multilingual Scrabble Generator and Games - especially aimed towards endangered languages
      JavaScript
      MIT License
      3100Updated Dec 16, 2021Dec 16, 2021