Feat/joint diarization and embedding with prepared data #1583

clement-pages · 2023-12-08T13:11:33Z

No description provided.

…e.to Fixes 1397

BREAKING(model): get rid of (flaky) `Model.introspection`

…o feat/joint-diarization-and-embedding

- fixes the dimension error between files id and probabilties arrays - changes the way of how chunks for the embedding task are sampled - creates two functions to draw chunks, one for each subtask Tests are required to ensure that there are no bugs

For now this is a copy past from methods in segmentation task.

as computing this loss probably does not make sense in powerset mode because first class (empty set of labels) does exactly this

as this instance attribute was not used

…` pipeline Co-authored-by: Hervé BREDIN <[email protected]>

as these loop could break gradient flow and to optimize the code

for now do the trick only for the diarization subtask

* use npz archive instead pickle to save task data * improve code readability * improve(task): update numpy array dtypes In order to use types whose size better machtes the contents of the arrays * remove `end` entry from `annotated_regions` numpy array This entry was redundant with the start and duration entries, since `end` = `start` + `duration`. * fix: allow data preparation to be finished when task has no validation * improve: clear data lists after assignation to `self.prepared_data` This is to avoid data redundancy in the `prepare_data` method --------- Co-authored-by: clement-pages <[email protected]>

Now the joint task uses `prepare_data` and `setup` from core `Task` and `SpeakerDiarization` task.

…' of github.com:clement-pages/pyannote-audio into feat/joint-diarization-and-embedding-with-prepared-data

…embedding-with-prepared-data

…mbedding`

…ddins This new model is based on a `WeSpeakerResnet34` for the speaker embeddings extraction part, and on `PyanNet` for (local) segmentation.

…ata`

…embedding-with-prepared-data

…ocol

…embedding-with-prepared-data

…-prepared-data

Now, the first `num_dia_samples` samples in a batch are dedicated to the diarization substak, and the remaining sample are for the embedding subtask

... and fix some bugs

…-prepared-data

…computation

chai3 and others added 30 commits June 8, 2023 08:42

fix: raise TypeError on wrong device type in Pipeline.to and Inferenc…

0551070

…e.to Fixes 1397

feat(task): add support for multi-task models (pyannote#1374)

30ddb0b

BREAKING(model): get rid of (flaky) `Model.introspection`

fix(inference): fix multi-task inference

4eb7190

feat: update FAQtory default answer

dcdfc15

add draft version of the joint diarization and embedding tasks

87f49f9

Merge branch 'develop' of github.com:clement-pages/pyannote-audio int…

6025a80

…o feat/joint-diarization-and-embedding

fix StopIteration error

04de82f

add missing collate methods

d8cb598

For now this is a copy past from methods in segmentation task.

remove support for non-powerset mode

d2d6e14

remove computing of vad loss

e58943b

as computing this loss probably does not make sense in powerset mode because first class (empty set of labels) does exactly this

remove unused imports

bc989cd

fix probabilities do not sum to 1 error

b4d0a78

attempt to fix file duration error

78718b1

attempt to fix negative start_time in embedding part

dfdd8f3

add end-to-end diarization and embedding model

1888360

update end-to-end model

6216d1f

clean multi-task source code

b42cc33

remove support for SegmentationProtocol in the multi-tasks

3d295dd

improve(test): use pyannote.database.registry (pyannote#1413)

3363be6

Set alpha coefficient as attribute

99a7762

remove diarization_database_files attribute

f2a4e34

as this instance attribute was not used

feat(pipeline): add return_embeddings option to `SpeakerDiarization…

017c910

…` pipeline Co-authored-by: Hervé BREDIN <[email protected]>

fix: fix missed speech at the very beginning/end

cf0e3b3

add losses computation in training_step method

f48b74f

doc: add note to self regarding cluster reassignment (pyannote#1419)

f393546

remove for loops in embedding loss computation

5718593

as these loop could break gradient flow and to optimize the code

add validation part into the multi-task

8036572

remove subtask parameter from prepare_chunk

aa36d7b

fix bugs in validation part

6617c9c

for now do the trick only for the diarization subtask

clement-pages and others added 20 commits November 21, 2023 16:14

improve code readability

987e702

Merge branch 'pyannote:develop' into feat/data_preparation

5358986

improve: remove complete redefinition of setup in joint task

0011870

Now the joint task uses `prepare_data` and `setup` from core `Task` and `SpeakerDiarization` task.

Merge branch 'feat/joint-diarization-and-embedding-with-prepared-data…

68763dc

…' of github.com:clement-pages/pyannote-audio into feat/joint-diarization-and-embedding-with-prepared-data

Merge branch 'feat/data-preparation' into feat/joint-diarization-and-…

7d78548

…embedding-with-prepared-data

improve: remove duplicated attributes in `JointSpeakerDiarizationAndE…

6e6b62d

…mbedding`

update: replace old Task attributes with prepared_data in joint task

e60873c

improve: handle multi-speaker embeddings in example_output

40cc903

feat: add new end-to-end model for joint speaker diarization and embe…

30ae9fb

…ddins This new model is based on a `WeSpeakerResnet34` for the speaker embeddings extraction part, and on `PyanNet` for (local) segmentation.

fix: fix empty dict issue for metadata_unique_values in `prepared_d…

72f9916

…ata`

improve: add dynamic typing for np array in prepare_data

ecd2cb4

Merge branch 'feat/data-preparation' into feat/joint-diarization-and-…

5e1abad

…embedding-with-prepared-data

improve: check matching bewteen task current protocol and cached prot…

fb6d540

…ocol

remove: remove unused argument stage in Task.setup

3810308

Merge branch 'feat/data-preparation' into feat/joint-diarization-and-…

f916db5

…embedding-with-prepared-data

update: change name of attribute database_ratio to dia_task_rate

e7da160

wip: attempt to fix issues encountered during training

77ac89f

update: use all the pyannet pretrained model

ea6d06d

fix: fix diarization loss calculation condition in training_step

185798d

hbredin mentioned this pull request Jan 23, 2024

wip: support for joint diarization and embedding #1409

Closed

clement-pages and others added 9 commits May 14, 2024 09:10

Merge branch 'develop' into feat/joint-diarization-and-embedding-with…

3fef4f5

…-prepared-data

update joint task with last modifications on data preparation

9d13697

update the way batches are generated in the joint task

6c67fc6

Now, the first `num_dia_samples` samples in a batch are dedicated to the diarization substak, and the remaining sample are for the embedding subtask

fix random generators

519db89

delete remaining call to example_output

106bfc5

update joint task training_step

d3326b1

... and fix some bugs

fix(task): fiw wrong call to receptive_field in prepare_chunk

a36420d

Merge branch 'develop' into feat/joint-diarization-and-embedding-with…

101f1d3

…-prepared-data

update(joint task): filter out inactive speaker embeddings from loss …

62fad78

…computation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/joint diarization and embedding with prepared data #1583

Feat/joint diarization and embedding with prepared data #1583

clement-pages commented Dec 8, 2023

Feat/joint diarization and embedding with prepared data #1583

Are you sure you want to change the base?

Feat/joint diarization and embedding with prepared data #1583

Conversation

clement-pages commented Dec 8, 2023