feat: add more flexibility to visualize.pulls #342

andrzejnovak · 2022-06-09T09:30:43Z

feat: show unconstrained directly in pulls, add exclude_by_type
fix: leave redundant styling to style sheets
fix: pull display order
feat: add pull(exclude=... fnmatch, fix docs

codecov · 2022-06-09T17:03:04Z

Codecov Report

Merging #342 (fc3d344) into master (fce7208) will decrease coverage by 0.47%.
The diff coverage is 82.00%.

❗ Current head fc3d344 differs from pull request most recent head f5bd953. Consider uploading reports for the commit f5bd953 to get more accurate results

@@             Coverage Diff             @@
##            master     #342      +/-   ##
===========================================
- Coverage   100.00%   99.52%   -0.48%     
===========================================
  Files           23       23              
  Lines         1878     1911      +33     
  Branches       299      311      +12     
===========================================
+ Hits          1878     1902      +24     
- Misses           0        4       +4     
- Partials         0        5       +5

Impacted Files	Coverage Δ
src/cabinetry/visualize/plot_result.py	`95.94% <60.00%> (-4.06%)`	⬇️
src/cabinetry/visualize/utils.py	`90.90% <82.35%> (-9.10%)`	⬇️
src/cabinetry/fit/__init__.py	`100.00% <100.00%> (ø)`
src/cabinetry/fit/results_containers.py	`100.00% <100.00%> (ø)`
src/cabinetry/model_utils.py	`100.00% <100.00%> (ø)`
src/cabinetry/visualize/__init__.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update fce7208...f5bd953. Read the comment docs.

src/cabinetry/fit/results_containers.py

andrzejnovak · 2022-06-09T17:22:04Z

@alexander-held ready for review

alexander-held · 2022-06-17T11:27:02Z

src/cabinetry/model_utils.py

+    labels = model.config.par_names()
+    _mod_dict = dict(model.config.modifiers)
+    _clean_labels = [re.sub(r"\[.*\]", "", label) for label in labels]
+    types = [_mod_dict[n] if n in _mod_dict else None for n in _clean_labels]


I am wondering whether we should extract and save the information about the constraint term (none, Gaussian, Poisson) instead of the modifier types. The problem with the types is that a parameter may control multiple modifiers (e.g. a normsys plus a shapesys). For the purpose of plotting pulls, the constraint term type is all that is needed to decide the handling.

We could get the constraint term types like this:

constraint_terms = [] for parameter in model.config.par_order: if not model.config.param_set(parameter).constrained: # normfactor / shapefactor constraint_terms += [None] * model.config.param_set(parameter).n_parameters else: # remaining modifiers with Gaussian or Poisson constraints constraint_terms += [ model.config.param_set("staterror_Signal_region").pdf_type ] * model.config.param_set("staterror_Signal_region").n_parameters

Going one step further, we could also save the .width() information to evaluate constraints for Poisson-constrained parameters (since the pre-fit uncertainty for those varies per parameter).

For other use cases (like #332) it might be useful to also know all the modifier types.

While it is currently possible to determine the constraint term type from knowing the modifier, that will change when constraint terms become configurable in the future with pyhf. So perhaps it is best to store constraint term information directly, and optionally add another field in the future to also keep track of the modifier types?

I am remembering now where the idea of storing the modifier types come from: that allows to exclude by type in the plot, like excluding staterror. The constraint term type does not help there. It seems more likely that users would want to exclude by modifier type than by constraint term type, so perhaps it is best to stick with the implemented approach. The only thing that would need to be generalized is matching multiple modifier types.

Something like the following could help simplify things by using more of the pyhf API:

modifier_types = [] for parameter in model.config.par_order: modifier_types += [ [ mod_type for par_name, mod_type in model.config.modifiers if par_name == parameter ] ] * model.config.param_set(parameter).n_parameters

alexander-held · 2022-06-17T11:40:20Z

src/cabinetry/visualize/plot_result.py

-    ax.fill_between([-2, 2], -0.5, len(bestfit) - 0.5, color="yellow")
-    ax.fill_between([-1, 1], -0.5, len(bestfit) - 0.5, color="limegreen")
-    ax.vlines(0, -0.5, len(bestfit) - 0.5, linestyles="dotted", color="black")
+    fig, ax = plt.subplots()


Let's keep the figsize=(6, 1 + num_pars / 4) scaling unless there's a strong reason not to, as that allows the spacing in the plot to be fairly consistent across a wide range of numbers of parameters.

alexander-held · 2022-06-17T11:44:26Z

src/cabinetry/visualize/plot_result.py

-    ax.xaxis.set_minor_locator(mpl.ticker.AutoMinorLocator())  # minor ticks
-    ax.tick_params(axis="both", which="major", pad=8)
-    ax.tick_params(direction="in", top=True, right=True, which="both")
-    fig.set_tight_layout(True)


I believe this removal is related to making figure customization easier via style sheets? If so, could we split that out into a separate PR? I'd like things to be more easily configurable, but I also do think the minor ticks help with legibility of constraints. Is there a way to move this into a default style sheet that users could override?

andrzejnovak · 2022-07-11T11:49:04Z

src/cabinetry/model_utils.py

+                mod_type
+                for par_name, mod_type in model.config.modifiers
+                if par_name == parameter
+            ][:1]


so for some modifiers in the example.py this returns ['histosys', 'normsys'] so then best_fit and types would have different length, it's unclear to me why some parameters have more types?

@alexander-held

A parameter can control multiple modifiers, and in some specific cases a parameter can also control modifiers of different types if they have the same constraint terms. When building models, cabinetry implements systematic uncertainties that change both shape and normalization via two correlated modifiers (normsys and histosys) which are controlled by the same parameter. For pulls to +/- 1 sigma this results in an equivalent model prediction to what a single histosys modifier can provide. The split of overall normalization changes across a channel into a correlated normsys helps protect the model predictions from becoming negative due to the exponential extrapolation used with normsys modifiers (while histosys is linear).

andrzejnovak added 4 commits June 9, 2022 11:11

feat: show unconstrained directly in pulls, add exclude_by_type

000a7d3

fix: leave redundant styling to style sheets

80458f5

fix: pull display order

11126cf

feat: add pull(exclude=... fnmatch, fix docs

fefece4

andrzejnovak changed the title ~~pulls~~ feat: add more flexibility to visualize.pulls Jun 9, 2022

chore: black

9ed8358

andrzejnovak force-pushed the pulls branch 3 times, most recently from ee35ef3 to d91baf0 Compare June 9, 2022 12:12

fix: satisfy precommit

6a3ab96

andrzejnovak force-pushed the pulls branch from 98d3d73 to 6a3ab96 Compare June 9, 2022 16:56

andrzejnovak commented Jun 9, 2022

View reviewed changes

src/cabinetry/fit/results_containers.py Outdated Show resolved Hide resolved

fix: cleanup

fc581ca

andrzejnovak force-pushed the pulls branch from a54f565 to fc581ca Compare June 9, 2022 17:11

fix: test plot ref

f5bd953

alexander-held self-requested a review June 9, 2022 17:22

alexander-held reviewed Jun 17, 2022

View reviewed changes

andrzejnovak force-pushed the pulls branch from 3971083 to 06e9434 Compare July 11, 2022 11:05

andrzejnovak commented Jul 11, 2022

View reviewed changes

alexander-held force-pushed the pulls branch from fc3d344 to f5bd953 Compare July 30, 2022 15:51

alexander-held mentioned this pull request Jan 16, 2023

feat: pull comparison plot #387

Open

vaustrup mentioned this pull request Dec 4, 2023

[FEATURE] .viz.pulls for normfactors #341

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add more flexibility to visualize.pulls #342

feat: add more flexibility to visualize.pulls #342

andrzejnovak commented Jun 9, 2022

codecov bot commented Jun 9, 2022 •

edited

Loading

andrzejnovak commented Jun 9, 2022

alexander-held Jun 17, 2022

alexander-held Jun 17, 2022

alexander-held Jun 17, 2022

alexander-held Jun 17, 2022

alexander-held Jun 17, 2022

andrzejnovak Jul 11, 2022

andrzejnovak Jul 11, 2022

alexander-held Jul 11, 2022

feat: add more flexibility to visualize.pulls #342

Are you sure you want to change the base?

feat: add more flexibility to visualize.pulls #342

Conversation

andrzejnovak commented Jun 9, 2022

codecov bot commented Jun 9, 2022 • edited Loading

Codecov Report

andrzejnovak commented Jun 9, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Jun 9, 2022 •

edited

Loading