Tools for splitting, normalizing, text-shaping Arabic script
-
Updated
Jun 23, 2024 - TypeScript
Tools for splitting, normalizing, text-shaping Arabic script
Research project on the state of the field of Multilingual Digital Humanities, with an initial focus on Arabic
Code and models for "The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models". EACL 2021, WANLP.
مستودع الأوراق المسحية في معالجة اللغة العربية (أسبر) A Repository for survey and review papers in Arabic Natural Language processing (ANLP).
Exploring the impact of contextual attention on Arabic text classification: This study examines how contextual attention, such as that implemented in transformers, influences the performance of generative models for Arabic text classification, by analyzing attention mechanisms and their usefulness.
Python library used for Arabic NLP to process, prepare and clean the Arabic text
Many countries speak Arabic; however, each country has its own dialect, the aim of this project is to build a model that predicts the dialect given the text.
This repo contains my NLP Module Labs
Repo for Kareem's Professional Website
Arabic text regression with various models, GPT-2 text generation, and BERT-based text classification.
Emotion Prediction in Arabic Text
The codebase for the "ALDi: Quantifying the Arabic Level of Dialectness of Text" paper accepted to EMNLP 2023.
AraT5: Text-to-Text Transformers for Arabic Language Understanding
Preprocesses and summarizes Arabic texts using BERT based model.
A comprehensive dataset for training a Text-to-Speech system focused on the Iraqi dialect. Contains custom-recorded audio samples, phonetic annotations, and text to support TTS model development and synthesis for Iraqi Arabic.
a bilingual glossary, to provide accurate translations and NLP specific definitions so that the arabic reader can see more clearly
A list of Moroccan Darija Datasets grouped by name, data source, region and size.
A Python implementation of Farasa toolkit
This notebook explores the application of Regex and embedding techniques in Arabic Natural Language Processing (NLP). It covers the use of regular expressions for text parsing tasks and delves into various word embedding methods, including Word2Vec and FastText, for semantic analysis and representation of Arabic text data.
Maha is a text processing library specially developed to deal with Arabic text.
Add a description, image, and links to the arabic-nlp topic page so that developers can more easily learn about it.
To associate your repository with the arabic-nlp topic, visit your repo's landing page and select "manage topics."