[tabular] add infer throughput logging #4200

Innixma · 2024-05-14T23:40:41Z

Issue #, if available:
Resolves #4162

Description of changes:

add infer throughput logging, this implementation specifically introduces no overheads by avoiding calling predict separately, instead calculating the throughput at the same time as the score is being calculated for efficiency.
code cleanup / code dedupe
added additional method documentation

Mainline:

AutoGluon training complete, total runtime = 21.17s ... Best model: WeightedEnsemble_L2

This PR:

AutoGluon training complete, total runtime = 21.17s ... Best model: WeightedEnsemble_L2 | Estimated inference throughput: 23766.6 rows/s (1500 batch size)

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

yinweisu · 2024-05-14T23:49:07Z

Previous CI Run	Current CI Run
botocore==1.34.104	botocore==1.34.105
boto3==1.34.104	boto3==1.34.105
botocore==1.34.104	botocore==1.34.105
boto3==1.34.104	boto3==1.34.105

yinweisu · 2024-05-15T23:12:36Z

Previous CI Run	Current CI Run

github-actions · 2024-05-16T01:43:45Z

Job PR-4200-9d2669d is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-4200/9d2669d/index.html

rey-allan

LGTM!

prateekdesai04 · 2024-05-16T23:29:10Z

core/src/autogluon/core/models/abstract/abstract_model.py

 """
 y_pred_proba = self.predict_proba(X, **kwargs)
 y_pred = get_pred_from_proba(y_pred_proba=y_pred_proba, problem_type=self.problem_type)
 return y_pred

- def predict_proba(self, X, normalize=None, **kwargs) -> np.ndarray:
+ def predict_proba(self, X, *, normalize: bool | None = None, record_time: bool = False, **kwargs) -> np.ndarray:


why do we not set the record_time as true and by default record the time taken for prediction?

Because calling time.time takes time, and predict is something we want to take as little time as possible for the user.

prateekdesai04

LGTM

[tabular] add infer throughput logging

d550916

Innixma added this to the 1.1.1 Release milestone May 14, 2024

Fix unit test

9d2669d

Innixma requested review from rey-allan and prateekdesai04 May 16, 2024 01:11

rey-allan approved these changes May 16, 2024

View reviewed changes

prateekdesai04 reviewed May 16, 2024

View reviewed changes

prateekdesai04 approved these changes May 16, 2024

View reviewed changes

Innixma merged commit 6d7122f into autogluon:master May 16, 2024
29 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tabular] add infer throughput logging #4200

[tabular] add infer throughput logging #4200

Innixma commented May 14, 2024 •

edited

yinweisu commented May 14, 2024

yinweisu commented May 15, 2024

github-actions bot commented May 16, 2024

rey-allan left a comment

prateekdesai04 May 16, 2024

Innixma May 16, 2024

prateekdesai04 left a comment

[tabular] add infer throughput logging #4200

[tabular] add infer throughput logging #4200

Conversation

Innixma commented May 14, 2024 • edited

yinweisu commented May 14, 2024

yinweisu commented May 15, 2024

github-actions bot commented May 16, 2024

rey-allan left a comment

Choose a reason for hiding this comment

prateekdesai04 May 16, 2024

Choose a reason for hiding this comment

Innixma May 16, 2024

Choose a reason for hiding this comment

prateekdesai04 left a comment

Choose a reason for hiding this comment

Innixma commented May 14, 2024 •

edited