Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Datasets: output corresponding type #6722

Open
ajdapretnar opened this issue Feb 2, 2024 · 1 comment
Open

Datasets: output corresponding type #6722

ajdapretnar opened this issue Feb 2, 2024 · 1 comment

Comments

@ajdapretnar
Copy link
Contributor

What's your use case?

Datasets always outputs an instance of Table, even for data that would be more suited to Corpus.

What's your proposed solution?

Datasets could (should?) output Corpus when the data is more appropriate for text mining and Timeseries when appropriate for ts. A flag somewhere?

Are there any alternative solutions?
Currently, the user has to explicitly use Corpus and set text features manually.

@ajdapretnar
Copy link
Contributor Author

Corpus migrates text_features to data attributes, which would, presumably, solve the problem (text_features is retained when loading the data).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant