Allow selecting multiple metrics on compare page #133

dbutenhof · 2024-11-13T21:09:01Z

Type of change

Description

Support selection of multiple metrics using the pulldown in the comparison page. The update occurs when the pulldown closes.

To simplify the management of "available metrics" across multiple selected runs, which might have entirely different metrics, the reducer no longer tries to store separate metric selection lists for each run. This also means that the "default" metrics selection remains when adding another comparison run, or expanding another row.

This is chained from #122 (Crucible service) -> #140 (unit test framework) -> #146 (crucible unit tests) -> #123 (ilab API) -> #155 (API unit tests) -> #158 (functional test framework) -> #124 (ilab UI) -> #153 (date picker) -> #125 (multi-run graphing API) -> #127 (multi-run graphing UI) -> #129 (statistics aggregation) -> #131 (metadata flyover) -> #132 (multiple metrics selection) -> #133 (compare multiple metrics)

Related Tickets & Documents

PANDA-645 support multiple metrics selection in compare view

Checklist before requesting a review

I have performed a self-review of my code.
If it is a core feature, I have added thorough tests.

Testing

Manual testing on local deployment.

jaredoconnell

Reviewed the compare commit. Looks fine overall.

jaredoconnell · 2024-11-14T22:57:18Z

frontend/src/actions/ilabActions.js

  try {
+    if (getState().ilab.metrics?.find((i) => i.uid == uid)) {
+      return;


Is this the case for when it already has the data synced? If so I would just add a simple comment like this:

return; // already fetched

This also applies to the other instances of this.

jaredoconnell · 2024-11-14T23:02:17Z

frontend/src/actions/ilabActions.js

+    periods?.periods?.forEach((p) => {
+      if (p.is_primary) {
+        summaries.push({
+          run: uid,
+          metric: p.primary_metric,
+          periods: [p.id],
+        });
+      }
+      if (metrics) {
+        metrics.forEach((metric) => {
+          if (
+            avail_metrics.find((m) => m.uid == uid)?.metrics?.includes(metric)
+          ) {
            summaries.push({
              run: uid,
              metric,
              aggregate: true,
              periods: [p.id],
-            })
-          );
-        }
-      });
-      const response = await API.post(
-        `/api/v1/ilab/runs/multisummary`,
-        summaries
-      );
-      if (response.status === 200) {
-        dispatch({
-          type: TYPES.SET_ILAB_SUMMARY_DATA,
-          payload: { uid, data: response.data },
+            });
+          }


Can some basic comments be included to differentiate these two? Looking closely I can see that the bottom one is aggregate.

jaredoconnell

All of the new changes look good.

github-actions · 2024-12-19T18:43:25Z

This PR is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 10 days.

github-actions · 2024-12-25T18:42:44Z

This PR was closed because it has been stalled for 6 days with no activity.

This encapsulates substantial logic to encapsulate interpretation of the Crucible Common Data Model OpenSearch schema for the use of CPT dashboard API components. By itself, it does nothing.

This uses `black`, `isort` and `flake8` to check code quality, although failure is ignored until we've cleaned it up (which has begin in PR cloud-bulldozer#139 against the `revamp` branch). Minimal unit testing is introduced, generating a code coverage report. The text summary is added to the Action summary page, and the more detailed HTML report is stored as an artifact for download. NOTE: The GitHub Action environment is unhappy with `uvicorn` 0.15; upgrading to the latest 0.32.x seems to work and hasn't obviously broken anything else.

`crucible_svc.py` test coverage is now at 97%. While the remaining 3% is worth some effort later, the law of diminishing returns will require A significant additional effort; and since subsequent ILAB PRs will change some of the service code anyway it's good enough for now.

Provide the `api/v1/ilab` API endpoint to allow a client to query collected data on a Crucible CDM OpenSearch instance through the `crucible_svc` service layer. It is backed by the Crucible layer added in cloud-bulldozer#122, so only the final commit represents changes in this PR.

This covers 100% of the ilab.py API module using `FastAPI`'s `TestClient`. This proved ... interesting ... as the FastAPI and Starlette versions we use are incompatible with the underlying httpx version ... TestClient init fails in a way that can't be worked around. (Starlette passes an unknown keyword parameter.) After some experimentation, I ended up "unlocking" all the API-related packages in `project.toml` to `"*"` and letting `poetry update` resolve them, then "re-locked" them to those versions. The resulting combination of modules works for unit testing, and appears to work in a real `./local-compose.sh` deployment as well.

This adds a mechanism to "can" and restore a small prototype ILAB (Crucible CDM) Opensearch database in a pod along with the dashboard back end, front end, and functional tests. The functional tests run entirely within the pod, with no exposed ports and with unique container and pod names, allowing for the possibility of simultaneous runs (e.g., a CI) on the same system. This also has utilities for diagnosing a CDM (v7) datastore and cloning a limited subset, along with creating an Opensearch snapshot from that data to bootstrap the functional test pod. Only a few functional test cases are implemented here, as demonstration. More will be added separately.

This relies on the ilab API in cloud-bulldozer#123, which in turn builds on the crucible service in cloud-bulldozer#122.

The `fetchILabJobs` action wasn't updating the date picker values from the API response unless a non-empty list of jobs is returned. This means that on the initial load, if the default API date range (1 month) doesn't find any jobs, the displayed list is empty and the date range isn't updated to tell the user what we've done. I've seen no ill effects in local testing from simply removing the length check, and now the date picker is updated correctly.

When graphing metrics from two runs, the timestamps rarely align; so we add a `relative` option to convert the absolute metric timestamps into relative delta seconds from each run's start.

This adds the basic UI to support comparison of the metrics of two InstructLab runs. This compares only the primary metrics of the two runs, in a relative timeline graph. This is backed by cloud-bulldozer#125, which is backed by cloud-bulldozer#124, which is backed by cloud-bulldozer#123, which is backed by cloud-bulldozer#122. These represent a series of steps towards a complete InstructLab UI and API, and will be reviewed and merged from cloud-bulldozer#122 forward.

This PR is primarily CPT dashboard backend API (and Crucible service) changes to support pulling and displaying multiple Crucible metric statistics. Only minor UI changes are included to support API changes. The remaining UI changes to pull and display statistics will be pushed separately.

Add statistics charts for selected metric in row expansion and comparison views.

Extract the "Metadata" into a separate component, which allows it to be reused as an info flyover on the comparison page to help in identifying target runs to be compared.

Modify the metrics pulldown to allow multiple selection. The statistical summary chart and graph will show all selected metrics in addition to the inherent benchmark primary benchmark (for the primary period).

Support selection of multiple metrics using the pulldown in the comparison page. The update occurs when the pulldown closes. To simplify the management of "available metrics" across multiple selected runs, which might have entirely different metrics, the reducer no longer tries to store separate metric selection lists for each run. This also means that the "default" metrics selection remains when adding another comparison run, or expanding another row.

dbutenhof force-pushed the compare branch 2 times, most recently from 99e2605 to ac58188 Compare November 14, 2024 14:42

jaredoconnell reviewed Nov 15, 2024

View reviewed changes

dbutenhof self-assigned this Nov 18, 2024

dbutenhof mentioned this pull request Nov 18, 2024

Improve formatting of delta time X axis #134

Draft

7 tasks

jaredoconnell approved these changes Nov 19, 2024

View reviewed changes

This was referenced Nov 20, 2024

Allow "bare metal" deployment for testing #136

Draft

Add a metric label template mechanism #137

Draft

github-actions bot added the Stale label Dec 19, 2024

github-actions bot closed this Dec 25, 2024

dbutenhof removed the Stale label Jan 2, 2025

dbutenhof reopened this Jan 2, 2025

dbutenhof mentioned this pull request Jan 31, 2025

Add support for CDM v8 #144

Draft

7 tasks

dbutenhof force-pushed the compare branch from ac58188 to b2a55d7 Compare January 31, 2025 18:22

dbutenhof mentioned this pull request Feb 17, 2025

Allow toggling individual metrics #166

Draft

7 tasks

dbutenhof and others added 13 commits February 18, 2025 11:22

Add new Crucible backend service

e85ad48

This encapsulates substantial logic to encapsulate interpretation of the Crucible Common Data Model OpenSearch schema for the use of CPT dashboard API components. By itself, it does nothing.

Add an ilab UI tab

1e8cb1e

This relies on the ilab API in cloud-bulldozer#123, which in turn builds on the crucible service in cloud-bulldozer#122.

Support graphing multiple run comparisons

0a6fe27

When graphing metrics from two runs, the timestamps rarely align; so we add a `relative` option to convert the absolute metric timestamps into relative delta seconds from each run's start.

Display statistical summaries

666b859

Add statistics charts for selected metric in row expansion and comparison views.

Add metadata flyover on comparison page

46c9f92

Extract the "Metadata" into a separate component, which allows it to be reused as an info flyover on the comparison page to help in identifying target runs to be compared.

dbutenhof added 2 commits February 18, 2025 16:02

Support selection of multiple metrics

7299da9

Modify the metrics pulldown to allow multiple selection. The statistical summary chart and graph will show all selected metrics in addition to the inherent benchmark primary benchmark (for the primary period).

dbutenhof force-pushed the compare branch from b2a55d7 to 943c262 Compare February 18, 2025 21:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow selecting multiple metrics on compare page #133

Allow selecting multiple metrics on compare page #133

dbutenhof commented Nov 13, 2024 •

edited

Loading

jaredoconnell left a comment

jaredoconnell Nov 14, 2024

jaredoconnell Nov 14, 2024

jaredoconnell left a comment

github-actions bot commented Dec 19, 2024

github-actions bot commented Dec 25, 2024

Allow selecting multiple metrics on compare page #133

Are you sure you want to change the base?

Allow selecting multiple metrics on compare page #133

Conversation

dbutenhof commented Nov 13, 2024 • edited Loading

Type of change

Description

Related Tickets & Documents

Checklist before requesting a review

Testing

jaredoconnell left a comment

Choose a reason for hiding this comment

jaredoconnell Nov 14, 2024

Choose a reason for hiding this comment

jaredoconnell Nov 14, 2024

Choose a reason for hiding this comment

jaredoconnell left a comment

Choose a reason for hiding this comment

github-actions bot commented Dec 19, 2024

github-actions bot commented Dec 25, 2024

dbutenhof commented Nov 13, 2024 •

edited

Loading