Gets text and extracts sentences in a language from text using that language's lexicon.
-
Updated
Sep 26, 2021 - Python
Gets text and extracts sentences in a language from text using that language's lexicon.
Utilities for Processing the Dialogue State Tracking Challenge 3 Corpus
Universitat de Barcelona - Ioculator seu Mimus - Eclipse-based engine for annotation of the MiMus corpus
Utilities for Processing the bAbi Tasks Corpus
Collection of tools for building diachronic/historical word vectors
This project delves into the preprocessing and exploratory data analysis of a corpus, where initial phase involves constructing into individual articles using journalistic approach.
Corpus processing library
Tareas de Procesamiento del lenguaje natural
Corpus Processing Library
Utilities for Processing the Saarbrücken Corpus of Spoken English
The DEWmodel-Climatechange contains code to preprocess corpus and build DWE model. This work is part of the FRGS/1/2020/SSI0/UKM/02/1 project. Copyright @sabrinatiun2022
Source code to evaluate the semantic severity (vertical expansion) of concepts.
Paper that Lena Baunaz and I are working on as part of my SNSF-funded 'Focus in diachrony' research project at the University of Cambridge, UK.
Heuristics and cognitive biases in public discourse on climate changes - lingustic data analysis
Simple utility to filter out text corpus according to frequencies of words consisting sentences in it
Companion website for "Corpus Approaches to Language in Social Media" - source and build versions
Diarization A to Z - Kaldi to Gecko to Kaldi and corpus and back
Split-corpus package that provide dividing text corpora into the meaningful parts as close to specified size as possible.
Termoteca - multilingual terminological database
Corpus analysis of plain text and providing Type-Token Ratio as well as some other statistics.
Add a description, image, and links to the corpus-processing topic page so that developers can more easily learn about it.
To associate your repository with the corpus-processing topic, visit your repo's landing page and select "manage topics."