Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feat/joint diarization and embedding with prepared data #1583

Draft
wants to merge 90 commits into
base: develop
Choose a base branch
from

Conversation

clement-pages
Copy link
Collaborator

No description provided.

chai3 and others added 30 commits June 8, 2023 08:42
BREAKING(model): get rid of (flaky) `Model.introspection`
- fixes the dimension error between files id and probabilties arrays
- changes the way of how chunks for the embedding task are sampled
- creates two functions to draw chunks, one for each subtask

Tests are required to ensure that there are no bugs
For now this is a copy past from methods in segmentation task.
as computing this loss probably does not make sense in powerset
mode because first class (empty set of labels) does exactly this
as this instance attribute was not used
as these loop could break gradient flow and to optimize
the code
for now do the trick only for the diarization subtask
clement-pages and others added 20 commits November 21, 2023 16:14
* use npz archive instead pickle to save task data

* improve code readability

* improve(task): update numpy array dtypes

In order to use types whose size better machtes the contents of the arrays

* remove `end` entry from `annotated_regions` numpy array

This entry was redundant with the start and duration entries,
since `end` = `start` + `duration`.

* fix: allow data preparation to be finished when task has no validation

* improve: clear data lists after assignation to `self.prepared_data`

This is to avoid data redundancy in the `prepare_data` method

---------

Co-authored-by: clement-pages <[email protected]>
Now the joint task uses `prepare_data` and `setup` from core `Task` and
`SpeakerDiarization` task.
…' of github.com:clement-pages/pyannote-audio into feat/joint-diarization-and-embedding-with-prepared-data
…ddins

This new model is based on a `WeSpeakerResnet34` for the speaker embeddings
extraction part, and on `PyanNet` for (local) segmentation.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants