Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

High memory usage compared to other implementation #8

Open
jonrscott opened this issue Dec 9, 2021 · 2 comments
Open

High memory usage compared to other implementation #8

jonrscott opened this issue Dec 9, 2021 · 2 comments

Comments

@jonrscott
Copy link

Hi, I found this while looking for a lower memory usage alternative to anknown/ahocorasick.

I have a dataset of around 6 million strings. The total memory usage, as shown by pprof, after building the automaton is just over 30GB, compared to 6.5GB for the anknown version.

Do you have any tips for working out why it's using so much more RAM?

Thanks in advance.

@petar-dambovaliev
Copy link
Owner

Hi, I found this while looking for a lower memory usage alternative to anknown/ahocorasick.

I have a dataset of around 6 million strings. The total memory usage, as shown by pprof, after building the automaton is just over 30GB, compared to 6.5GB for the anknown version.

Do you have any tips for working out why it's using so much more RAM?

Thanks in advance.

Hey, sorry for the late response, lol. It's been 2 years, more or less.
I haven't had much time for open source.
I am not familiar with the implementation of anknown.
I'll need to check it out before making some kind of a statement.

@peter7891
Copy link

Hi, I found this while looking for a lower memory usage alternative to anknown/ahocorasick.

I have a dataset of around 6 million strings. The total memory usage, as shown by pprof, after building the automaton is just over 30GB, compared to 6.5GB for the anknown version.

Do you have any tips for working out why it's using so much more RAM?

Thanks in advance.

I will also need to analyse your data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants