Feature request: Performance metrics per model-version #1970

vitalyli · 2022-01-27T21:45:07Z

Feature Request

Describe the problem the feature is intended to solve

We have multiple AB running, where the same model_name can have different versions,
which could have different performance outcomes.

For instance, the same model with the same inputs can have different number of layers or
architecture, which can make it slower and heavier especially for "on CPU" processing.

We need to have a way to monitor and get perf. metrics such as model p95 and average latency,
at model_name.version granularity, while currently, all that is visible is model_name level metrics.

:tensorflow:serving:request_latency_bucket{model_name="tf_model_name",API="Predict",entrypoint="GRPC",le="2.52873e+08"} 16237

Describe the solution

Solution to this is to have one more set of performance counters inside servable, to count p95 and average
time at more granular level of model-version.

Describe alternatives you've considered

Only way we can see right now, is to execute a call from the client and measure latency this way,
however that includes round trip latency and feature engineering requirements, that are specific to a given
model-version, thus making it operationally challenging at scale and maintenance headache, while still
not giving us pure server side metrics per model-version.

System information

**OS Platform and Distribution: CentOS 7; later OEL 8
TensorFlow Serving installed from (source or binary): source
TensorFlow Serving version: 2.6

vitalyli · 2022-04-16T01:28:22Z

@godot73 Any opinion on this. Thanks!

vitalyli · 2022-05-30T23:36:54Z

@google is this not feasible or just nobody else asked before?

singhniraj08 · 2023-06-23T04:16:11Z

@vitalyli,

Similar feature request #1959 in progress. Requesting you to close this issue and follow similar thread for updates.
Thank you.

github-actions · 2023-07-01T02:12:37Z

This issue has been marked stale because it has no recent activity since 7 days. It will be closed if no further activity occurs. Thank you.

github-actions · 2023-07-09T02:14:47Z

This issue was closed due to lack of activity after being marked stale for past 7 days.

pindinagesh self-assigned this Jan 28, 2022

pindinagesh added the type:feature label Jan 28, 2022

pindinagesh assigned godot73 and unassigned pindinagesh Jan 31, 2022

pindinagesh added the stat:awaiting tensorflower label Jan 31, 2022

singhniraj08 assigned nniuzft and unassigned godot73 Feb 17, 2023

singhniraj08 self-assigned this Jun 23, 2023

singhniraj08 added stat:awaiting response and removed stat:awaiting tensorflower labels Jun 23, 2023

github-actions bot added the stale This label marks the issue/pr stale - to be closed automatically if no activity label Jul 1, 2023

github-actions bot closed this as completed Jul 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Performance metrics per model-version #1970

Feature request: Performance metrics per model-version #1970

vitalyli commented Jan 27, 2022 •

edited

Loading

vitalyli commented Apr 16, 2022

vitalyli commented May 30, 2022

singhniraj08 commented Jun 23, 2023

github-actions bot commented Jul 1, 2023

github-actions bot commented Jul 9, 2023

Feature request: Performance metrics per model-version #1970

Feature request: Performance metrics per model-version #1970

Comments

vitalyli commented Jan 27, 2022 • edited Loading

Feature Request

Describe the problem the feature is intended to solve

Describe the solution

Describe alternatives you've considered

System information

vitalyli commented Apr 16, 2022

vitalyli commented May 30, 2022

singhniraj08 commented Jun 23, 2023

github-actions bot commented Jul 1, 2023

github-actions bot commented Jul 9, 2023

vitalyli commented Jan 27, 2022 •

edited

Loading