Skip to content

Mateusz-Dera/whisperspeech-webui

Repository files navigation

WhisperSpeech web UI

Web UI for WhisperSpeech (https://github.com/collabora/WhisperSpeech)

Preview

Info

Version

Test platform:

Name Info
CPU AMD Ryzen 7900X3D (iGPU disabled in BIOS)
GPU AMD Radeon 7900XTX
RAM 64GB DDR5 6600MHz
Motherboard ASRock B650E PG Riptide WiFi (2.10)
OS Ubuntu 22.04
Kernel 6.5.0-28-generic
ROCm 6.1

Instalation:

  1. Install Python 3.11

  2. Clone repository

  3. Mount the repository directory.

  4. Create and activate venv

  5. For ROCm set HSA_OVERRIDE_GFX_VERSION.

  • For the Radeon 7900XTX:
export HSA_OVERRIDE_GFX_VERSION=11.0.0
  1. Install requirements
  • ROCm 5.7:
pip install -r requirements_rocm_5.7.txt
pip install git+https://github.com/ROCmSoftwarePlatform/flash-attention.git@ae7928c5aed53cf6e75cc792baa9126b2abfcf1a
  • ROCm 6.0:
pip install -r requirements_rocm_6.0.txt
pip install git+https://github.com/ROCmSoftwarePlatform/flash-attention.git@2554f490101742ccdc56620a938f847f61754be6
  • CUDA 11.8 (Tested on Ubuntu 23.10):
pip install -r requrements_cuda_11.8.txt
  • CUDA 12.1 (Tested on Ubuntu 23.10):
pip install -r requrements_cuda_12.1.txt
  1. Run:
python webui.py
  • With -h or --help for help:
python webui.py -h