This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.
-
Updated
May 12, 2024 - Jupyter Notebook
This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.
Persian to Finglish dataset with all the sentences voice for TTS dataset used to train tacotron2
Persian/Farsi text to speech(TTS) training using coqui tts
Official github repository, Persis: A persian font recognition pipeline using convolutional neural networks.
Persian Datasets including: Wikipedia, Twitter, Hamshahri, Hellokish, NSURL'19, Peyma, Text_mining.ir
Simple Script To Crawl Data From Persian News Agencies Including Fars, Mehr.
In this repository, the wavLM model is used for quality and poor quality data for speaker verification task, and the PyCM library is used for evaluation.
CLIPfa: Connecting Farsi Text and Images
The first intelligent Persian reverse dictionary
An Image Dataset of Printed Farsi Text for OCR Research
The first dataset for Farsi fact extraction and verification
A collection of Farsi (Persian) datasets
Add a description, image, and links to the farsi-datasets topic page so that developers can more easily learn about it.
To associate your repository with the farsi-datasets topic, visit your repo's landing page and select "manage topics."