Assignment 2 for CS 11-731 Machine Translation course.
-
Updated
Nov 6, 2019 - TypeScript
Assignment 2 for CS 11-731 Machine Translation course.
A 16M LLM for POS tagging in African languages
Auto-generated stopwords for South African Bantu Languages
a repository containing the details of natural language inference dataset in Hindi
Dataset for Paper - A Neural Approach to Multilingual Sentiment Analysis in Low Resource Languages, submitted in Elsevier Expert Systems with Applications
FilWordNet web portal — a language resource for Filipino and Philippine English built from text analysis network science and natural language processing
Italian hate speech detection using transformer.
A web application to test sentence-similarity models of the top 10 Indian Languages
GlotSparse: Building Corpora in Under-Resourced Languages
IsiZulu News (articles and headlines) and Siswati News (headlines) Corpora - za-isizulu-siswati-news-2022
ASR for quechua language is an open source which can run in real time using HTK toolkit.
LLMs for Low Resource Languages in Multilingual, Multimodal and Dialectal Settings
The Ede Python library automates the generations of instruction fine-tuning datasets in low-resource languages.
Repository for my personal page.
AAAI Knowledge NLP Submission
An overview of the possibilities of using TARS models for low language resources
The following repository contains data and data preparation tools for a Polish-Kashubian translator.
ULMFiT Model that classifies swahili news articles
This repository highlights the LLMs reasoning capabilities of ✨ Mistral / LLaMA-3 / Phi-3 / Gemma / Flan-T5 / GPT-4o ✨ in Targeted Sentiment Analysis in Russian / Translated to English mass-media 📊
Add a description, image, and links to the low-resource-languages topic page so that developers can more easily learn about it.
To associate your repository with the low-resource-languages topic, visit your repo's landing page and select "manage topics."