New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Set up performance testing #100
Labels
Milestone
Comments
We should measure how much wall time the different extraction steps take, e.g. Yara matching, chunk calculation, carving, extraction and so on. |
There is pytest-benchmark which we can use to write benchmark tests with some special markers which would be ignored by default but can be easily selected to run. |
qkaiser
added
enhancement
New feature or request
performance
performance improvements tasks
labels
Jan 13, 2022
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
We need to measure how fast unblob as a whole can operate and what strategy can speed up extraction significantly.
Example question we want to answer: Which is faster? Matching on all YARA patterns at once or iterating on the file multiple times with less patterns?
Measure different scenarios:
The text was updated successfully, but these errors were encountered: