Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add having in prediction problem generation #133

Open
PatrikDurdevic opened this issue Aug 7, 2023 · 0 comments
Open

Add having in prediction problem generation #133

PatrikDurdevic opened this issue Aug 7, 2023 · 0 comments

Comments

@PatrikDurdevic
Copy link
Contributor

In the store dataset, the target entity is orderlines.

Currently, we can generate a problem like

  • Predict the number of records with <products.price> greater than 22.99 in next 1m days
  • SELECT count(*) from orderlines inner join products where products.price > 22.99

Which basically predicts how many individual products with a price > 22.99 will be ordered in a month.

With a having operation, we would be able to generate problems like

  • Predict the number of records having average <products.price> greater than 22.99 in the next 1m days
  • SELECT count(orderlines.orderid) from orderlines inner join products group by orderlines.orderid having avg(products.price) > 22.99

Which predicts how many orders there will be such that average product price in each order is > 22.99 in a month.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant