Set up performance testing #100

kissgyorgy · 2021-12-07T10:30:15Z

We need to measure how fast unblob as a whole can operate and what strategy can speed up extraction significantly.
Example question we want to answer: Which is faster? Matching on all YARA patterns at once or iterating on the file multiple times with less patterns?

Measure different scenarios:

One big file with few smaller files inside
Lots of small files concatenated and inside
Multiple big files concatenated and inside
Refact the priority handling by concatenating all YARA rules and handle the match results by priority instead of scanning a file multiple times. Measure the difference on various files.

vlaci · 2021-12-07T12:05:38Z

We should measure how much wall time the different extraction steps take, e.g. Yara matching, chunk calculation, carving, extraction and so on.

kissgyorgy · 2021-12-08T10:57:16Z

There is pytest-benchmark which we can use to write benchmark tests with some special markers which would be ignored by default but can be easily selected to run.

kissgyorgy mentioned this issue Dec 7, 2021

Refactor processing, get rid of strategies.py #98

Merged

kissgyorgy added this to the v2.0 - more in depth extraction milestone Dec 7, 2021

kissgyorgy mentioned this issue Dec 7, 2021

Parallelize file processing #71

Closed

kissgyorgy self-assigned this Dec 9, 2021

vlaci assigned kissgyorgy and vlaci and unassigned kissgyorgy Jan 12, 2022

qkaiser added enhancement New feature or request performance performance improvements tasks labels Jan 13, 2022

kukovecz mentioned this issue Jan 20, 2022

Rework recursive process_file core calls #181

Closed

vlaci mentioned this issue Jan 27, 2022

Optimize yara timeout value #201

Closed

martonilles modified the milestones: v2.0 - metadata extraction, v3.0 Mar 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set up performance testing #100

Set up performance testing #100

kissgyorgy commented Dec 7, 2021 •

edited

vlaci commented Dec 7, 2021

kissgyorgy commented Dec 8, 2021 •

edited

Set up performance testing #100

Set up performance testing #100

Comments

kissgyorgy commented Dec 7, 2021 • edited

vlaci commented Dec 7, 2021

kissgyorgy commented Dec 8, 2021 • edited

kissgyorgy commented Dec 7, 2021 •

edited

kissgyorgy commented Dec 8, 2021 •

edited