-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Generalize Final data.npz output #181
Labels
enhancement
New feature or request
Comments
martham93
changed the title
Generalize Final
Generalize Final data.npz output
Aug 27, 2020
Data.npz
Ouput
instead of data.npz output will explore the final output using x-array for more flexibility but including a utils script in the repo or example script to go from the final output to tf-records for modeling purposes |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Currently the
data.npz
output, by default is split into atest
andtrain
set with 80% of the data going into training and 20% of the data going into the test set. While the ratio, number of split sets, and names of split sets can be changed, it could be better to return adata.npz
and let users handle splitting the data outside of label-maker for increased flexibility.I think it could be useful if the new version of the
data.npz
file returned as thex-y-z
tile id as the key and the values is a list with 2 elements, the first being the label in numpy array format, and the second being the image in numpy array format.Another benefit of this change will be that the image label pairs in numpy array format will retrain the
x-y-z
tile id. Thex-y-z
tile information is currently only associated with thelabels.npz
file and not associated with thedata.npz
file.thoughts? @drewbo
The text was updated successfully, but these errors were encountered: