Changed type of _fillna_map in normalize_time_series #4197

duartepinto555 · 2024-05-14T10:29:30Z

Issue #, if available:

Description of changes:
Changed the _fillna_map format to Timestamp with UTC. Previously this was changing the type of the series if the _fillna_map was not in the same format as the series, which later causes an error in the ".dt" parameter.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…series Issue #, if available: Description of changes: Changed the _fillna_map format to Timestamp with UTC. Previously this was changing the type of the series if the _fillna_map was not in the same format as the series, which later causes an error in the ".dt" parameter. By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

yinweisu · 2024-05-14T10:40:38Z

Previous CI Run	Current CI Run

duartepinto555 · 2024-05-14T12:40:31Z

I'm sorry, this is my first time contributing on github, I was just running code with autogluon and arrived upon this error and thought of giving a contribute... All the tests still pass and around the same time, should I do something else? Sorry for the inconvenience

github-actions · 2024-05-14T13:09:46Z

Job PR-4197-1b62216 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-4197/1b62216/index.html

Innixma · 2024-05-16T22:07:35Z

Hey @duartepinto555, great find, thanks for sending a PR!

Could you provide a code example where the previous logic fails but your PR succeeds?

You can either include it in the PR as a unit test or provide the code in a GitHub comment in this PR and I can test it out.

The main thing I need is to be able to verify that this change fixes a scenario where we previously failed.

duartepinto555 · 2024-05-17T08:50:04Z

Thank you for your detailed answer, @Innixma!

So, my use case was a niche one.

Basically, I have a model generated from AWS from an older autogluon==0.4.3. This made the _fillna_map attribute of the DatetimeFeatureGenerator differ from the new version. In version 0.4.3, the attribute doesn't have the datetime with "utc" which changes the type when comparing to the new version.

The error appears when loading this new model using autogluon==1.1.0 and trying to get a prediction. I can share my code as follows:

from autogluon.tabular import TabularPredictor
import pandas as pd
import os

MODEL_DIR = 'path_to_model_from_version_0.4.3'
testing_dataset_dir = 'path_to_prediction_dataset.feather'

# Loading model
predictor = TabularPredictor.load(MODEL_DIR, require_version_match=False)
predictor._decision_threshold = None

# Loading training dataset
df = pd.read_feather(testing_dataset_dir)

# Change paths from AWS S3 to Local one
model_name = 'CatBoost_BAG_L1_FULL'
predictor._trainer.set_model_attribute(model_name, 'path', [model_name])

# Get prediction
prediction = predictor.predict(df, model=model_name)    # >> Error occurs here

Hope you can still replicate this error, if you want me to provide an older model which returns this error I can do so as well!

Innixma · 2024-05-17T18:54:58Z

Ah I see, thanks for the response @duartepinto555. The proposed change would introduce a inference latency overhead due to the extra pd.to_datetime conversion. While I would merge it if it were resolving an existing bug, this is instead related to backwards compatibility.

We explicitly do not support backwards compatible model loading, which is why when you try to load the old artifact it will warn you that the versions differ. Trying to ensure our code works when loading old artifacts is too complicated and would slow down our development, especially for versions that differ in major version (0.x <-> 1.x). My guess is that the bug you are experiencing here is just the tip of the iceberg, and many other things would go wrong trying to get your old model working on 1.1.0 without retraining.

Please either retrain your predictor from scratch using 1.1.0, or continue using 0.4.3 for inference. You can try monkey-patching / using your own fork of AutoGluon with this change, but you'll have to find your own work-arounds to any issues going this route.

If your model artifact is from SageMaker AutoPilot / Canvas, I believe they will be upgrading to using a more recent version of AutoGluon shortly (within 1-2 months).

Innixma added module: tabular module: features labels May 16, 2024

Innixma added this to the 1.1.1 Release milestone May 16, 2024

Innixma closed this May 17, 2024

Innixma added the wontfix This will not be worked on label May 17, 2024

duartepinto555 deleted the patch-1 branch May 18, 2024 09:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changed type of _fillna_map in normalize_time_series #4197

Changed type of _fillna_map in normalize_time_series #4197

duartepinto555 commented May 14, 2024

yinweisu commented May 14, 2024

duartepinto555 commented May 14, 2024

github-actions bot commented May 14, 2024

Innixma commented May 16, 2024

duartepinto555 commented May 17, 2024 •

edited

Innixma commented May 17, 2024

Changed type of _fillna_map in normalize_time_series #4197

Changed type of _fillna_map in normalize_time_series #4197

Conversation

duartepinto555 commented May 14, 2024

yinweisu commented May 14, 2024

duartepinto555 commented May 14, 2024

github-actions bot commented May 14, 2024

Innixma commented May 16, 2024

duartepinto555 commented May 17, 2024 • edited

Innixma commented May 17, 2024

duartepinto555 commented May 17, 2024 •

edited