Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support In and notIn operators in ParquetFilters.ConvertFilterToParquet #10341

Open
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

sririshindra
Copy link
Contributor

@sririshindra sririshindra commented May 16, 2024

ParquetFilters.ConvertFilterToParquet performs conversion from an Iceberg filter (Expression) to a Parquet FilterPredicate. It currently only handles IS_NULL, NOT_NULL, IS_NAN, NOT_NAN, EQ, NOT_EQ, GT, GT_EQ, LT, LT_EQ. It does not handle, e.g., IN and NOT_IN, which would be useful to have.

This conversion can be used for setting a record filter in Parquet and then using that to filter row groups when reading Parquet files.

Change-Id: If424bf9f83d78d2768990cce553a56866461ca4e
@sririshindra sririshindra force-pushed the parquet_filtered_row_groups-CDPD-69432 branch from f2bb0c9 to 06bb5ac Compare May 16, 2024 17:13
Change-Id: If424bf9f83d78d2768990cce553a56866461ca4e
Change-Id: I6477d840a24f685ce578a8bca83c1c51437074a0
@github-actions github-actions bot added the API label May 21, 2024
@sririshindra sririshindra changed the title [WIP] Support In and notIn operators in ParquetFilters.ConvertFilterToParquet Support In and notIn operators in ParquetFilters.ConvertFilterToParquet May 21, 2024
Change-Id: Idf419106b89f356caff7a019e097bef677c46eef
Change-Id: I523d7bd39819a91c4905d007086c605bd0c53e70
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant