Audit snapshot tests for scannability #168

zkamvar · 2024-11-21T18:15:01Z

@annakrystalli had noticed and fixed a silent bug in the snapshot tests for validations due to a leaky mock. My analysis was:

It looks like this bug popped up with the fix for #139 and was an urgent fix. It's completely understandable how it got through---it's a symptom of complex snapshot output not being easily scannable (i.e. it's hard to detect a check_failure/check_error from a check_success in a long list like this).

Originally posted by @zkamvar in #167 (comment)

While this may on the surface appear to be an isolated incident, it highlights the limitations of snapshot tests in that they were intended to be for human-readable output. As the output increases in complexity, the potential for missed side-effects and fragility increases. The fact is that these snapshot tests rely on someone being able to glance at the test output and confirm that it's showing what we expect it to be showing. This is influenced by the developer and reviewer's state of mind when inspecting the output and I've never met a single developer who was 100% every time they were writing or reviewing code.

solution

I would like to go through the snapshot tests that use str() or output tibbles and add programmatic validations. It's not a high-priority item and it's not so exciting to consider, but it will pay off in the future as we continue to work on this codebase and refactor.

An example of such a change from this particular snapshot could be to create a vector from the hub_validations object of the tests that did not pass and compare them to an expected list:

# Confirm that the only check that failed was "file_n"
check_path <- system.file("check_table.csv", package = "hubValidations")
checks <- arrow::read_csv_arrow(check_path) |>
  dplyr::filter(.data$`parent fun` != "validate_model_metadata", !.data$optional)

expected <- setNames(rep(FALSE, nrow(checks)), checks$Name)
expected["file_n"] <- TRUE # there is a failure for duplicate files

failures <- purrr::map_lgl(dup_model_out_val, not_pass)
expect_equal(failures, expected[names(failures)])

The text was updated successfully, but these errors were encountered:

github-project-automation bot added this to hubverse Development overview Nov 21, 2024

github-project-automation bot moved this to Todo in hubverse Development overview Nov 21, 2024

zkamvar added the upkeep maintenance, infrastructure, and similar label Nov 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audit snapshot tests for scannability #168

Audit snapshot tests for scannability #168

zkamvar commented Nov 21, 2024 •

edited

Loading

Audit snapshot tests for scannability #168

Audit snapshot tests for scannability #168

Comments

zkamvar commented Nov 21, 2024 • edited Loading

solution

zkamvar commented Nov 21, 2024 •

edited

Loading