Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

anonymised feature output to facilitate sharing #19

Open
andreaskutka opened this issue Sep 7, 2023 · 0 comments
Open

anonymised feature output to facilitate sharing #19

andreaskutka opened this issue Sep 7, 2023 · 0 comments
Labels
enhancement New feature or request

Comments

@andreaskutka
Copy link
Collaborator

Problem
Very little data is available for testing or training models. Often users are constrained of sharing their survey data for other use.

Solution:
Develop an intermediate output that contains the features in a raw anonymised form that can be shared more easily by users. Ideally, the output is prior to score transformation, to allow potential use as training data. The output should:

  • replace the variable names and responsible names with letters
  • remove answer option, question text, etc, only keep columns needed
  • save the feature tables on item and unit level as csv, and zip them
@andreaskutka andreaskutka added the enhancement New feature or request label Sep 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant