Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

added DA in-memory generator in data.py #90

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

Anner-deJong
Copy link
Contributor

Regular data augmentation (DA) seems not so straightforward with sequence samples, as we dont want to randomly rotate each frame in a single sample sequence by a different degree. -> Custom DA is implemented

Since this DA is on the fly, the get_all_sequences_in_memory() won't cut it anymore. However, the generator is much slower.
This is partly counteracted by implementing a hybrid version: Load all raw data to memory, and apply DA on the fly from there.
Locally this gave a 1.6 - 2.0 speed up as compared to running the regular generator with DA (currently the regular generator does NOT have any DA option).

NB:
I changed the code locally quite a bit, with different names. The code definitely ran, yet before uploading I altered it to fit the online terminology again, in which I have not tested wrong naming or wrong attributes or something.

Regular data augmentation (DA) seems not so straightforward with sequence samples, as we dont want to randomly rotate each frame in a single sample sequence by a different degree. -> Custom DA is implemented

Since this DA is on the fly, the get_all_sequences_in_memory() won't cut it anymore. However, the generator is much slower.
This is partly counteracted by implementing a hybrid version: Load all raw data to memory, and apply DA on the fly from there.
Locally this gave a 1.6 - 2.0 speed up as compared to running the regular generator with DA (currently the regular generator does NOT have any DA option).

NB:
I changed the code locally quite a bit, with different names. The code definitely ran, yet before uploading I altered it to fit the online terminology again, in which I have not tested wrong naming or wrong attributes or something.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant