Generate neighbors and sample configurations correctly with deeply nested conditions #197

filipbartek · 2021-09-17T09:46:52Z

Resolves #194.

One-exchange neighborhood iterator rigorously checks some of the generated configurations. The check is performed with the probability 5 %, as decided by `np.random`. Make the check deterministic (using an explicitly seeded `np.random.RandomState`) instead of relying on the default numpy random state, which need not be seeded.

of `get_one_exchange_neighborhood`. Update the hard-coded expected values.

The PCS contains a conditional dependency with the operator "||", which is interpreted as `OrConjunction`.

An optimization declares a hyperparameter (HP) inactive as soon as any of the HPs parents according to a condition is inactive. This logic is incorrect in case the condition is an `OrConjunction`. This commit ensures that the optimization is not applied to `OrConjunction`s.

codecov · 2021-09-17T09:50:32Z

Codecov Report

Merging #197 (3b0dbf1) into master (4d69931) will decrease coverage by 0.19%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master     #197      +/-   ##
==========================================
- Coverage   67.15%   66.96%   -0.20%     
==========================================
  Files          17       17              
  Lines        1629     1662      +33     
==========================================
+ Hits         1094     1113      +19     
- Misses        535      549      +14

Impacted Files	Coverage Δ
ConfigSpace/read_and_write/json.py	`82.91% <0.00%> (-3.69%)`	⬇️
ConfigSpace/nx/classes/graph.py	`23.66% <0.00%> (-0.51%)`	⬇️
ConfigSpace/read_and_write/pcs_new.py	`90.69% <0.00%> (-0.22%)`	⬇️
ConfigSpace/__init__.py	`100.00% <0.00%> (ø)`
ConfigSpace/read_and_write/pcs.py	`85.53% <0.00%> (+0.06%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4d69931...3b0dbf1. Read the comment docs.

When searching the hierarchy of hyperparameters (HPs), every time a HP is activated or deactivated, its children need to be re-considered for activation or deactivation. If there is no cycle in the condition graph, this process will terminate and converge. Before this commit, each HP was only considered at most once. This suffices if there is no `OrConjunction` condition: as soon as one of the parents of a HP is inactive, the HP is necessarily inactive.

filipbartek · 2021-09-17T14:03:03Z

This solution resolves the constraints in an arbitrary order, possibly updating the status of a HP more than once. My guess is that following a topological order of the HPs (which seems to be respected by ConfigSpace.get_hyperparameters() and ConfigSpace._children_of) would lead to a linear complexity, while the worst-case complexity of this "arbitrary order" approach is quadratic. Using a topological order would require iterating through all the HPs in the CS, or maintaining to_visit as a priority queue.

Since the parents may be nested in a hierarchy of OrConjunction and AndConjunction, we may only infer inactivity without inspecting the whole hierarchy when all the parents are inactive.

Maintain a minimum heap of indices of HPs that should be visited. Since the HPs are topologically ordered by index, this ensures that all the parents of a HP are visited before that HP is visited. Forbid changing the value of an inactive HP. Raise ValueError on such attempt.

…P value

In some configuration spaces that include deeply nested conditions, sampling could yield an invalid configuration. This commit ensures that the procedure that normalizes the freshly sampled configuration handles all configuration spaces, including the complex ones, correctly. The new implementation of correct_sampled_array visits all the conditional hyperparameters in topological order. If any visited HP is deemed inactive, its vector value is set to NaN.

Break lines that are longer than 100 characters.

filipbartek · 2022-01-25T20:33:54Z

I have extended the changes so that generating neighbors and sampling random configurations works correctly with configuration spaces with conditions nested deeper than 1 level of AbstractConjunction. Both of these procedures now use the topological order of hyperparameters when ensuring that the output configuration is valid.

filipbartek · 2022-01-25T20:42:02Z

ConfigSpace/configuration_space.pyx

@@ -1269,6 +1269,7 @@ class ConfigurationSpace(collections.abc.Mapping):

        unconditional_hyperparameters = self.get_all_unconditional_hyperparameters()
        hyperparameters_with_children = list()
+        conditional_hyperparameters = sorted(self.get_all_conditional_hyperparameters(), key=lambda hp_name: self._hyperparameter_idx[hp_name])


n ... number of conditional hyperparameters
N ... number of all hyperparameters

This implementation has the complexity O(n log n d(N)), where d(N) is the complexity of fetching an element from a dictionary indexed by N strings.

Alternative:
[hp_name for hp_name in self._hyperparameters if hp_name in self.get_all_conditional_hyperparameters()]
Complexity: O(N s(n)), where s(n) is the complexity of checking membership in a set of n strings.

filipbartek · 2022-01-25T20:44:27Z

ConfigSpace/c_util.pyx

+    to_visit = [index]
+
+    # Since one hyperparameter may be reachable in several ways, we need to make sure we don't process it twice.
+    scheduled = np.zeros(len(configuration_space), dtype=bool)


Does using a np.ndarray without declaring it with cdef harm performance significantly?

filipbartek · 2022-01-25T20:46:00Z

This pull request probably solves the issue solved by #219.

mfeurer · 2022-05-02T14:54:01Z

Hi @dengdifan and @filipbartek,
Please excuse the delay, but I finally get to your two PRs (this one and #219). As a first step I checked, and they both fix the issue in https://github.com/automl/ConfigSpace/pull/219/files#diff-62b93f19eeb2c5ff6e26eaccf3f63ee6641e87f3a6ccfc2ffb0dec86925ae245

As a next step I'd like to figure why the two PRs are so different in size and whether they implement different things underneath.

From what I can see the PR of @difandeng touches the functions:

c_util.change_hp_value

and the PR from @filipbartek touches

c_util.correct_sampled_array
c_util.change_hp_value
configuration_space.ConfigSpace.get_active_hyperparameters

and none of your PRs touches

c_util.check_configurations()
util.deactivate_inactive_hyperparameters()

PR #194 is more complete than PR #219 as it handles the test added there but not vice versa.

I do like the rather small changes of @difandeng to c_util.change_hp_value and from @filipbartek to configuration_space.ConfigSpace.get_active_hyperparameters. However, I don't immediately understand the large changes from @filipbartek to c_util.correct_sampled_array and c_util.change_hp_value, could you please give some further explanation on what you are doing here? Would it be possible to reproduce the changed behavior in the style of the original algorithms? Lastly, we'd need to check the two functions that weren't touched by any of the PRs to ensure that they don't require an update.

Therefore, I propose the following concrete steps forward, let me know what you think about them:

Merge Diamond cond with OrConjunction #219
@filipbartek rebases this PR on master and keeps the changes from Diamond cond with OrConjunction #219 for c_util.change_hp_value and changes c_util.correct_sampled_array to be closer to the original version of the algorithm. We then merge this PR, too
We check whether c_util.check_configurations() and util.deactivate_inactive_hyperparameters() require any special treatment, too.

mfeurer · 2022-05-05T09:59:32Z

We need to check if this PR fixes #196 as well.

dengdifan · 2022-05-06T15:04:05Z

This should also fix #253 while #219 cannot fix it

filipbartek · 2022-05-11T16:21:01Z

ConfigSpace/c_util.pyx

-                    to_disable = set()
-                    for ch in children:
-                        to_disable.add(ch.name)
-                    while len(to_disable) > 0:


This loop adds all the descendants of current to disabled. No elements are ever removed from disabled. On lines 345 to 346, all of the HPs in disabled are set to NaN. However, if any of these descendants is conditioned by an OrConjunction, deactivating current need not suffice to deactivate the descendant. For each descendant, we should evaluate its conditions before disabling it.

filipbartek · 2022-05-11T16:24:28Z

ConfigSpace/c_util.pyx

-    disabled = []
+    # We maintain to_visit as a minimum heap of indices of hyperparameters that may need to be updated.
+    # We assume that the hyperparameters are sorted topologically by their index.
+    to_visit = [index]


We use a minimum heap (sorted by HP index) instead of a dequeue. Since the HP indices form a topological order with respect to the condition (directed acyclic) graph and since we process the HPs in a strictly increasing order by index, when we process a HP, we know that all of its parents have been processed already. Minimum heap is a data structure that allows us to ensure the topological order of processing with little computational overhead.

filipbartek · 2022-05-11T16:27:08Z

ConfigSpace/c_util.pyx

+    to_visit = [index]
+
+    # Since one hyperparameter may be reachable in several ways, we need to make sure we don't process it twice.
+    scheduled = np.zeros(len(configuration_space), dtype=bool)


scheduled is a replacement of visited. While visited is a set of HP names, scheduled is a boolean vector representation of a set of HPs, where each HP is identified by its index. I estimate that this change is probably a premature potential optimization.

filipbartek · 2022-05-11T16:33:17Z

ConfigSpace/c_util.pyx

-            for condition in conditions:
-                if not condition._evaluate_vector(configuration_array):
-                    active = False
-                    break


The value of active for the HP current need not be definitive. For example, let's assume that current is being deactivated because one of its parent conditions does not hold. Since we do not process the HPs in topological order, there may be a future iteration of the to_visit loop that activates one of the parents of current. If current is conditioned by an OrConjunction, current may need to be re-activated then. However, we will not get to re-visit and re-activate current afterward because of its membership in visited.
Processing the HPs in topological order resolves this issue.

filipbartek · 2022-05-11T16:38:40Z

ConfigSpace/c_util.pyx

+        if current_idx == index:
+            if not active:
+                raise ValueError(
+                    "Attempting to change the value of the inactive hyperparameter '%s' to '%s'." % (hp_name, hp_value))


As a nice side effect of the code reorganization, we explicitly prevent the caller from changing the value of an inactive HP.

because there is another assertion at line 261 that covers mostly the same issues and because this one is rather uninformative.

filipbartek · 2022-05-11T17:12:51Z

[...] I don't immediately understand the large changes from @filipbartek to c_util.correct_sampled_array and c_util.change_hp_value, could you please give some further explanation on what you are doing here? Would it be possible to reproduce the changed behavior in the style of the original algorithms? [...]

To explain the changes in change_hp_value, I have extended the code comments with commit 3b0dbf1 and added some commit comments here on GitHub.
Summary of the new aspects of the implementation of change_hp_value:

We process the HPs in topological order to ensure that the conditions are resolved correctly even if they contain an OrConjunction while each condition only needs to be evaluated once. We use a minimum heap to maintain the HPs that are yet to be visited to ensure the topological order. We rely on the assumption that the HP indices order the HPs topologically.

When I modified the function, I attempted to make the modification rather conservative, keeping namely variable names and semantics where possible. However, I could still stay closer to the original implementation. I can try to undo some of the changes (namely switching from set visited to bitvector scheduled, dropping unused variable activated_values, preventing code duplication by introducing update) to concentrate the patch on the crucial stuff. Would you prefer that, @mfeurer?

I agree with merging #219 first.

I will try to document the changes in correct_sampled_array and respond to the other concerns you raised later.

mfeurer · 2022-06-08T09:51:21Z

Hi @filipbartek, thank you very much for your explanation. As discussed, I just merged #219. Could you please rebase your PR, I will then check again how to proceed.

filipbartek · 2022-10-18T09:17:30Z

I am trying to minimize the changes introduced by this PR. 3 functions are affected:

c_util.correct_sampled_array
c_util.change_hp_value
configuration_space.ConfigSpace.get_active_hyperparameters

I believe each of these is broken in a distinct way and can be fixed independently. Breaking the fix down might further facilitate the inspection of the changes. @mfeurer, would you prefer to have 3 PRs for these 3 fixes, or does breaking them down into sets of commits suffice?

mfeurer · 2023-01-05T16:39:01Z

Hi @filipbartek, a single PR per problem would be appreciated as this will make reviewing and merging as simple and quickly as possible. Please excuse the delay, I've been on a longer break after finishing my PhD.

eddiebergman · 2024-07-12T13:47:39Z

Sorry to close this after so long of inactivity. I did a major rework in #346 and the code around this has completely changed. It's also been benchmarked to be just as fast/faster than the existing solution. I'm not sure if this closes the existing issue #194 though, I will have to take a deeper look

filipbartek · 2024-07-12T14:59:42Z

Thanks for the update, @eddiebergman!

I'm not sure if this closes the existing issue #194 though, I will have to take a deeper look

Consider scavenging the tests from this pull request.

filipbartek added 5 commits September 15, 2021 16:58

Make a test consistent with the current behavior

51c2fba

of `get_one_exchange_neighborhood`. Update the hard-coded expected values.

Add a test searchspace that uses a disjunctive condition

5c5a683

The PCS contains a conditional dependency with the operator "||", which is interpreted as `OrConjunction`.

Merge remote-tracking branch 'origin/master' into issues/194

d1d457e

filipbartek marked this pull request as ready for review September 17, 2021 12:29

Remove obsolete cdef type declarations in change_hp_value

21b553c

filipbartek mentioned this pull request Sep 17, 2021

One-exchange neighborhood may raise ValueError non-deterministically #194

Open

filipbartek added 7 commits January 25, 2022 13:15

Merge branch 'master' into issues/194

61007b0

Only bypass full condition evaluation if all parents are inactive

9e4c3ee

Since the parents may be nested in a hierarchy of OrConjunction and AndConjunction, we may only infer inactivity without inspecting the whole hierarchy when all the parents are inactive.

Test that the hyperparameters are ordered topologically

1d87585

Test that an exception is raised when attempting to change inactive H…

3e7254a

…P value

Shorten long lines

1d28633

Break lines that are longer than 100 characters.

filipbartek commented Jan 25, 2022

View reviewed changes

filipbartek changed the title ~~Stop generating invalid neighbors~~ Generate neighbors and sample configurations correctly with deeply nested conditions Jan 26, 2022

filipbartek commented May 11, 2022

View reviewed changes

filipbartek added 3 commits May 11, 2022 18:59

Remove an assertion

e50ad16

because there is another assertion at line 261 that covers mostly the same issues and because this one is rather uninformative.

Add an assertion

3d1a9b9

Improve code documentation of change_hp_value

3b0dbf1

mfeurer mentioned this pull request Apr 25, 2023

Nested Conditions result in incorrect deactivation #253

Open

eddiebergman closed this Jul 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate neighbors and sample configurations correctly with deeply nested conditions #197

Generate neighbors and sample configurations correctly with deeply nested conditions #197

filipbartek commented Sep 17, 2021

codecov bot commented Sep 17, 2021 •

edited

Loading

filipbartek commented Sep 17, 2021

filipbartek commented Jan 25, 2022

filipbartek Jan 25, 2022

filipbartek Jan 25, 2022

filipbartek commented Jan 25, 2022

mfeurer commented May 2, 2022

mfeurer commented May 5, 2022

dengdifan commented May 6, 2022 •

edited

Loading

filipbartek May 11, 2022

filipbartek May 11, 2022

filipbartek May 11, 2022

filipbartek May 11, 2022

filipbartek May 11, 2022

filipbartek commented May 11, 2022 •

edited

Loading

mfeurer commented Jun 8, 2022

filipbartek commented Oct 18, 2022 •

edited

Loading

mfeurer commented Jan 5, 2023

eddiebergman commented Jul 12, 2024

filipbartek commented Jul 12, 2024 •

edited

Loading

Generate neighbors and sample configurations correctly with deeply nested conditions #197

Generate neighbors and sample configurations correctly with deeply nested conditions #197

Conversation

filipbartek commented Sep 17, 2021

codecov bot commented Sep 17, 2021 • edited Loading

Codecov Report

filipbartek commented Sep 17, 2021

filipbartek commented Jan 25, 2022

filipbartek Jan 25, 2022

Choose a reason for hiding this comment

filipbartek Jan 25, 2022

Choose a reason for hiding this comment

filipbartek commented Jan 25, 2022

mfeurer commented May 2, 2022

mfeurer commented May 5, 2022

dengdifan commented May 6, 2022 • edited Loading

filipbartek May 11, 2022

Choose a reason for hiding this comment

filipbartek May 11, 2022

Choose a reason for hiding this comment

filipbartek May 11, 2022

Choose a reason for hiding this comment

filipbartek May 11, 2022

Choose a reason for hiding this comment

filipbartek May 11, 2022

Choose a reason for hiding this comment

filipbartek commented May 11, 2022 • edited Loading

mfeurer commented Jun 8, 2022

filipbartek commented Oct 18, 2022 • edited Loading

mfeurer commented Jan 5, 2023

eddiebergman commented Jul 12, 2024

filipbartek commented Jul 12, 2024 • edited Loading

codecov bot commented Sep 17, 2021 •

edited

Loading

dengdifan commented May 6, 2022 •

edited

Loading

filipbartek commented May 11, 2022 •

edited

Loading

filipbartek commented Oct 18, 2022 •

edited

Loading

filipbartek commented Jul 12, 2024 •

edited

Loading