Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Transcription WorkBench #6303

Draft
wants to merge 24 commits into
base: develop
Choose a base branch
from

Conversation

lutangar
Copy link
Contributor

@lutangar lutangar commented Mar 29, 2024

Description

┬───────────────────────┬──────────────────────┬──────────┬────────┬───────────────────────┐
│          WER          │         CER          │ duration │ model  │        engine         │
┼───────────────────────┼──────────────────────┼──────────┼────────┼───────────────────────┤
│ '30.373831775700932%''9.824046920821115%''31s''tiny''openai-whisper'    │
│ '25.233644859813083%' │      undefined       │  '12s''tiny''whisper-ctranslate2' │
│ '35.046728971962615%' │      undefined       │  '18s''tiny''whisper-timestamped' │
┴───────────────────────┴──────────────────────┴──────────┴────────┴───────────────────────┘
┬───────────────────────┬──────────────────────┬──────────┬─────────┬───────────────────────┐
│          WER          │         CER          │ duration │  model  │        engine         │
┼───────────────────────┼──────────────────────┼──────────┼─────────┼───────────────────────┤
│ '15.887850467289718%' │      undefined       │ '2m 51s''small''openai-whisper'    │
│ '16.355140186915886%''18.10850439882698%''42s''small''whisper-ctranslate2' │
│ '16.822429906542055%' │      undefined       │ '1m 5s''small''whisper-timestamped' │
┴───────────────────────┴──────────────────────┴──────────┴─────────┴───────────────────────┘

💻 i7-7500U
🎥 🇫🇷 1m28s
https://podeduc.apps.education.fr/numerique-educatif/video/21893-communiquer-lors-dune-classe-transplantee/

Related issues

Has this been tested?

  • 👍 yes, I added tests to the test suite

see specific transcriptions tests https://github.com/larriereguichet/PeerTube/actions/workflows/transcription.yml

  • 💭 no, because this PR is a draft and still needs work
  • 🙅 no, because this PR does not update server code
  • 🙋 no, because I need help

Todo

Screenshots

@lutangar lutangar marked this pull request as draft March 29, 2024 09:37
@lutangar lutangar changed the title WIP Transcription WorkBench Mar 29, 2024
Copy link

socket-security bot commented Apr 16, 2024

👍 Dependency issues cleared. Learn more about Socket for GitHub ↗︎

This PR previously contained dependency changes with security issues that have been resolved, removed, or ignored.

View full report↗︎

@lutangar lutangar force-pushed the transcription-backend-workbench branch from 9dd15b8 to 1d27252 Compare April 23, 2024 08:26
Copy link

socket-security bot commented Apr 29, 2024

New and removed dependencies detected. Learn more about Socket for GitHub ↗︎

Package New capabilities Transitives Size Publisher
npm/[email protected] None 0 55.4 kB vitaly
npm/[email protected] eval 0 636 kB bnjmnt4n
npm/[email protected] None 0 1.41 MB bnjmnt4n
npm/[email protected] None +4 307 kB dabh
npm/[email protected] None 0 458 kB isaacs
npm/[email protected] None 0 14.5 kB webtorrent-bot
npm/[email protected] None 0 344 kB vitaly
npm/[email protected] None +1 794 kB vitaly
npm/[email protected] filesystem, network 0 22.2 kB runk
npm/[email protected] None +3 125 kB medikoo
npm/[email protected] None 0 8.9 kB zensh
npm/[email protected] network 0 5.29 kB dougwilson
npm/[email protected] None +1 224 kB dougwilson
npm/[email protected] environment, filesystem 0 51.7 kB broofa
npm/[email protected] None 0 54.5 kB ljharb
npm/[email protected] environment, eval, filesystem +10 2.84 MB joshuakgoldberg
npm/[email protected] None 0 32.7 kB tjvr
npm/[email protected] eval Transitive: unsafe +3 57.2 kB dougwilson
npm/[email protected] None 0 6.84 kB styfle
npm/[email protected] filesystem +10 235 kB linusu
npm/[email protected] None 0 7.65 kB medikoo
npm/[email protected] filesystem, network, shell Transitive: environment +1 2.23 MB illuspas
npm/[email protected] environment, filesystem, network, shell 0 500 kB andris
npm/[email protected] None 0 9.22 kB jonschlinkert
npm/[email protected] None 0 5.49 kB sindresorhus
npm/[email protected] None 0 19.1 kB indutny
npm/[email protected] None 0 38.5 kB deanm
npm/[email protected] None +1 7.01 kB isaacs
npm/[email protected] None 0 124 kB aspecto-release-bot
npm/[email protected] None 0 505 kB hectormolinero
npm/[email protected] None +1 109 kB sindresorhus
npm/[email protected] None 0 788 kB vitaly
npm/[email protected] filesystem, network 0 27.4 kB webtorrent-bot
npm/[email protected] None 0 12.9 kB bermi
npm/[email protected] None 0 4.55 kB sindresorhus
npm/[email protected] filesystem 0 9.1 kB brianc
npm/[email protected] None 0 4.17 kB charmander
npm/[email protected] environment, network +3 121 kB brianc
npm/[email protected] None 0 90 kB mrmlnc
npm/[email protected] unsafe 0 295 kB metcoder95
npm/[email protected] None +1 656 kB mourner
npm/[email protected] None 0 650 kB lukeapage
npm/[email protected] None 0 4.9 kB bendrucker
npm/[email protected] None 0 3.06 kB bendrucker
npm/[email protected] None 0 5.92 kB bendrucker
npm/[email protected] None 0 6.73 kB bendrucker
npm/[email protected] None 0 10.1 kB martianboy
npm/[email protected] Transitive: environment +3 1.05 MB caub
npm/[email protected] None +2 63.4 kB dougwilson
npm/[email protected] filesystem, network 0 51.5 kB tootallnate
npm/[email protected] environment, eval, filesystem 0 59.7 kB pug-bot
npm/[email protected] None 0 33.5 kB google-wombot
npm/[email protected] None 0 8.37 kB feross
npm/[email protected] None 0 3.03 kB mafintosh
npm/[email protected] None 0 6.36 kB cwmma
npm/[email protected] None 0 8.46 kB dougwilson
npm/[email protected] environment 0 124 kB matteo.collina
npm/[email protected] filesystem 0 20.5 kB paulmillr
npm/[email protected] None 0 8.85 kB bridgear
npm/[email protected] None 0 241 kB rbuckton
npm/[email protected] Transitive: environment, filesystem +7 63.8 kB phated
npm/[email protected] environment, filesystem +2 159 kB ljharb
npm/[email protected] None 0 6.56 kB feross
npm/[email protected] None 0 31.7 kB feross
npm/[email protected] None 0 42.3 kB chalker
npm/[email protected] None +2 196 kB boutell
npm/[email protected] None +1 109 kB npm-cli-ops
npm/[email protected] None 0 216 kB robinbuschmann
npm/[email protected] filesystem +5 7.42 MB sdepold
npm/[email protected] None +2 193 kB oculus42
npm/[email protected] filesystem, shell +2 1.87 MB derduher
npm/[email protected] None +1 1.37 MB darrachequesne
npm/[email protected] filesystem, network +1 1.38 MB darrachequesne
npm/[email protected] None +1 2.93 MB nene
npm/[email protected] None +1 24.3 kB mafintosh
npm/[email protected] None 0 49.3 kB mafintosh
npm/[email protected] None 0 6.96 kB sindresorhus
npm/[email protected] network 0 24 kB titanism
npm/[email protected] None 0 10.9 kB jamesmessinger
npm/[email protected] None 0 12.5 kB dominictarr
npm/[email protected] shell +9 152 kB gilamran
npm/[email protected] None 0 84 kB typescript-bot
npm/[email protected] None 0 394 kB hirokiosame
npm/[email protected] None 0 174 kB dchest
npm/[email protected] None +1 57.9 kB gkz
npm/[email protected] None +1 29.6 kB dougwilson
npm/[email protected] None 0 40.6 MB typescript-bot
npm/[email protected] None 0 4.31 kB dougwilson
npm/[email protected] filesystem, unsafe +2 274 kB 3rdeden
npm/[email protected] None 0 5.48 kB tootallnate
npm/[email protected] None 0 751 kB profnandaa
npm/[email protected] None 0 8.75 kB dougwilson
npm/[email protected] network 0 46.4 kB silverbucket
npm/[email protected] filesystem, network +2 5.78 MB webtorrent-bot
npm/[email protected] filesystem, network Transitive: environment +10 469 kB dabh
npm/[email protected] environment, network 0 141 kB lpinca
npm/[email protected] None 0 6.46 kB raynos
npm/[email protected] filesystem +1 92.9 kB thejoshwolfe

🚮 Removed packages: npm/@babel/[email protected], npm/@babel/[email protected], npm/@babel/[email protected], npm/@jridgewell/[email protected], npm/@types/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected], npm/[email protected]

View full report↗︎

@lutangar lutangar force-pushed the transcription-backend-workbench branch from 2535ac1 to 380bd49 Compare April 29, 2024 15:24
chore: fiddling around some more

chore: add ctranslate2 and timestamped

chore: add performance markers

chore: refactor test

chore: change worflow name

chore: ensure Python3

chore(duration): convert to chai/mocha syntahx

chore(transcription): add individual tests for others transcribers

chore(transcription): implement formats test of all implementations

Also compare result of other implementation to the reference implementation

chore(transcription): add more test case with other language and models size and local model

chore(test): wip ctranslate 2 adapat
@lutangar lutangar force-pushed the transcription-backend-workbench branch from 4745d30 to c9d2385 Compare May 31, 2024 14:39
lutangar added 22 commits May 31, 2024 16:41
chore(test): clean a bit

chore(test): clean a bit

chore(test): refacto timestamed spec

chore(test): update workflow

chore(test): fix glob expansion with sh

chore(test): extract some hw info

chore(test): fix async tests

chore(benchmark): add model info
feat(transcription): extract run concept in own class an run more bench

chore(transcription): somplify run object only a uuid is now needed and add more benchmark scenario
docs(transcription): add local model usage
Forbid transcript creation without a language.
Add `languageDetection` flag to an engine and some assertions.

Fix an issue in `whisper-ctranslate2` :
Softcatala/whisper-ctranslate2#93
Update existing time function to output an integer number of seconds and add a ms human-readable time formatter with hints of tests.
I'm not very happy with the TranscriptFileEvaluator constructor... suggestions ?

chore(JiWer): add usage in README

docs(jiwer): update JiWer readme
chore(transcription): remove transcription workflow

docs(transcription): add benchmark info
…ate2 transcriber

chore(transcription): ctranslate2 assertion

chore(transcription): ctranslate2 assertion
@lutangar lutangar force-pushed the transcription-backend-workbench branch from c9d2385 to 855e672 Compare May 31, 2024 14:42
chore(transcription): add download and unzip utils functions

chore(transcription): download & unzip models fixtures

chore(transcription): zip
@lutangar lutangar force-pushed the transcription-backend-workbench branch from 855e672 to 3bffa61 Compare May 31, 2024 14:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant