Persian Datasets including: Wikipedia, Twitter, Hamshahri, Hellokish, NSURL'19, Peyma, Text_mining.ir
-
Updated
Oct 6, 2023
Persian Datasets including: Wikipedia, Twitter, Hamshahri, Hellokish, NSURL'19, Peyma, Text_mining.ir
In this repository, the wavLM model is used for quality and poor quality data for speaker verification task, and the PyCM library is used for evaluation.
This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.
The first intelligent Persian reverse dictionary
Simple Script To Crawl Data From Persian News Agencies Including Fars, Mehr.
The first dataset for Farsi fact extraction and verification
Official github repository, Persis: A persian font recognition pipeline using convolutional neural networks.
An Image Dataset of Printed Farsi Text for OCR Research
Persian to Finglish dataset with all the sentences voice for TTS dataset used to train tacotron2
A collection of Farsi (Persian) datasets
CLIPfa: Connecting Farsi Text and Images
Persian/Farsi text to speech(TTS) training using coqui tts
Add a description, image, and links to the farsi-datasets topic page so that developers can more easily learn about it.
To associate your repository with the farsi-datasets topic, visit your repo's landing page and select "manage topics."