feat(api): move from .case() to .cases() #9096

NickCrews · 2024-05-01T17:01:37Z

Redo of #7914 (with substantive changes) and #9039 (merely switching the base repo to the correct one, my fork)

Summary of changes:

Instead of removing the old .case() APIs, I deprecate them here. There are a few tests for them to try to ensure we didn't break anyone.
Added many more test cases, including for dtypes, dshapes, and expected errors on bad construction
Added test that ibis.null().cases((None, "fill"), else_="not hit") always results in "not hit". Maybe not the best ergonomics, but at least it is consistent and written down. Perhaps we can revisit later. See one of the TODOs below.
fixed bug where datashape was only getting determined from base or cases. Really it needs to depend on ALL inputs.
added some tests and implementation for dealing with empty branches: ibis.cases() (results in NULL) and ibis.cases(else_=5) (results in 5). I considered disallowing these, but I don't think there is anything semantiically wrong with supporting this.
moved a few tests from the pandas and dask backends to backends/test/test_generic.py so they are run on all backends.

TODOs that I found that should come in followups:

NULL replacement isn't super consistent yet. For example, val.substitute({None: 4}) currently does a fillna(). But if you do val.cases((None, 4), else_=val), then this ALWAYS hits the else_ case, because x = NULL never evals to True. EXCECPT for clickhouse, which appears to special case this. See the added test_switch_cases_null test. This also isn't even consistent in the sense that it only special cases for python None. If you do ibis.null(), or something only known at runtime like ibis.literal(5).nullif(5), then this will always hit the else_ case. Due to these limitations, I vote for making matching against NULLs out of scope for .cases and .substitute. If a user wants to do this, then they better do a .fillna() before.
the batting table has a column RBI of type int64. On sqlite, this .to_pandas() to a column of type object. I have this marked as broken here, but would be good to fix separately.
Literal('foo', type=bool), should error, but doesn't

NickCrews · 2024-05-01T17:52:43Z

EDIT: duh, it's because they don't guarantee row order. Updated the assertions to be order-independent.

Any idea as to why the datafusion, exasol, and risingwave column tests are failing? I still have trouble getting those backends running on my M1 so I can't debug locally very well.

NickCrews · 2024-05-01T17:29:30Z

ibis/expr/types/numeric.py

- .else_(nulls)
- .end()
- )
+ return self.cases(*enumerate(labels), else_=nulls)


now that this is such a simple implementation I would consider deprecating and then removing the whole .label() API.

This is setup for ibis-project#9039, where I change the API of Value.cases(), so I want to make sure that this functionality doesn't change, but the user gets a deprecationwarning

Fixes ibis-project#7280

NickCrews · 2024-05-11T00:09:42Z

@cpcloud gentle nudge for a review here :)

NickCrews · 2024-05-21T21:13:20Z

@cpcloud anything I can do to help move this forward/easier to review?

NickCrews mentioned this pull request May 1, 2024

feat(api): move from .case() to .cases() #9039

Closed

NickCrews force-pushed the case-to-cases branch 5 times, most recently from 8ac2d67 to 6885a2c Compare May 1, 2024 17:51

NickCrews commented May 1, 2024

View reviewed changes

NickCrews requested a review from cpcloud May 1, 2024 18:05

NickCrews force-pushed the case-to-cases branch 6 times, most recently from d3ac95a to 513bb94 Compare May 7, 2024 17:52

NickCrews added 2 commits May 8, 2024 14:11

test: add tests for Value.cases()

4d33830

This is setup for ibis-project#9039, where I change the API of Value.cases(), so I want to make sure that this functionality doesn't change, but the user gets a deprecationwarning

feat: move from .case() to .cases()

ae70dc6

Fixes ibis-project#7280

NickCrews force-pushed the case-to-cases branch from 513bb94 to ae70dc6 Compare May 8, 2024 22:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(api): move from .case() to .cases() #9096

feat(api): move from .case() to .cases() #9096

NickCrews commented May 1, 2024 •

edited

NickCrews commented May 1, 2024 •

edited

NickCrews May 1, 2024

NickCrews commented May 11, 2024

NickCrews commented May 21, 2024

feat(api): move from .case() to .cases() #9096

Are you sure you want to change the base?

feat(api): move from .case() to .cases() #9096

Conversation

NickCrews commented May 1, 2024 • edited

NickCrews commented May 1, 2024 • edited

NickCrews May 1, 2024

Choose a reason for hiding this comment

NickCrews commented May 11, 2024

NickCrews commented May 21, 2024

NickCrews commented May 1, 2024 •

edited

NickCrews commented May 1, 2024 •

edited