-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Not releasing /tmp #682
Comments
First, I suggest compressing the output files as they are created by using the extension ".tsv.gz". |
Second, I don't see how the graph cache command would be created by the |
Ahh, I have another command that runs after the import. That command is : |
Working with large data files requires some care. As Craig suggested, make sure the edge file that was produced is compressed to not waste any space. |
Describe the bug
I used the following command over one of the 2022 Wikidata JSON dumps and I didn't have enough free space on my disk for the output *.tsv files
The process has been unsuccessful and the /tmp have been completely occupied with
kgtk-graph-cache-sh200.sqlite3.db
(about 63 GB). The SQLite file seems to remain after some other successful importing as well.To Reproduce
Not sure how to reproduce the situation, but I think the problem was due to a lack of free space.
Expected behavior
The /tmp is better to be cleared after either successful or failed importing process
The text was updated successfully, but these errors were encountered: