Scalar estimators allow for the reduction over many output values (i.… #215

josephdviviano · 2024-11-13T20:40:07Z

…e., the output of the nn.Module does not need to be a scalar, because the Estimator will apply a reduction to the final output if required).

I am not in love with the current code organization of modules.py -- there is some duplication, but I am thinking that a bigger refactoring effort might be en route and perhaps we should wait to optimize. Open to feedback on this!

…e., the output of the nn.Module does not need to be a scalar, because the Estimator will apply a reduction to the final output if required).

younik

Looks good! I added a comment for possible refactoring.

younik · 2024-11-13T21:31:48Z

src/gfn/modules.py

+        reduction_fxns = {
+            "mean": torch.mean,
+            "sum": torch.sum,
+            "prod": torch.prod,
+        }


here you can use the global constant, if you follow the previous comment

Thanks - I've done this

younik · 2024-11-13T21:32:58Z

src/gfn/modules.py

+        reduction_fxns = {
+            "mean": torch.mean,
+            "sum": torch.sum,
+            "prod": torch.prod,
+        }


this is constant, for convention it should go outside with upper case name

hyeok9855

Looks good! I left a few questions and comments below.

hyeok9855 · 2024-11-14T15:14:26Z

src/gfn/modules.py

+    Attributes:
+        preprocessor: Preprocessor object that transforms raw States objects to tensors
+            that can be used as input to the module. Optional, defaults to
+            `IdentityPreprocessor`.
+        module: The module to use. If the module is a Tabular module (from
+            `gfn.utils.modules`), then the environment preprocessor needs to be an
+            `EnumPreprocessor`.
+        preprocessor: Preprocessor from the environment.
+        _output_dim_is_checked: Flag for tracking whether the output dimenions of
+            the states (after being preprocessed and transformed by the modules) have
+            been verified.


This needs to be updated accordingly, e.g., add is_backward and reduction and remove _output_dim_is_checked.

hyeok9855 · 2024-11-14T15:14:40Z

src/gfn/modules.py

+    Attributes:
+        preprocessor: Preprocessor object that transforms raw States objects to tensors
+            that can be used as input to the module. Optional, defaults to
+            `IdentityPreprocessor`.
+        module: The module to use. If the module is a Tabular module (from
+            `gfn.utils.modules`), then the environment preprocessor needs to be an
+            `EnumPreprocessor`.
+        preprocessor: Preprocessor from the environment.
+        reduction_fxn: the selected torch reduction operation.
+        _output_dim_is_checked: Flag for tracking whether the output dimensions of
+            the states (after being preprocessed and transformed by the modules) have
+            been verified.
+    """


hyeok9855 · 2024-11-14T15:19:22Z

src/gfn/modules.py

@@ -134,9 +134,71 @@ def to_probability_distribution(


 class ScalarEstimator(GFNModule):
+    r"""Class for estimating scalars such as LogZ.


Note that logZ for unconditional TB is usually modeled with a single learnable parameter (nn.Parameter).
Should we consider modifying ScalarEstimator to support this kind of behavior?

The GFNs themselves support this directly (you do not need to pass an estimator at all, instead you just pass a float for Z).

This comment is because of such as LogZ in the docstring!

I'm not entirely sure what would be most clear here but I'm open to suggestions.

Why not just state flow functions of DB/SubTB??

hyeok9855 · 2024-11-14T15:21:30Z

src/gfn/samplers.py

+    ) -> Tuple[
+        Actions,
+        torch.Tensor | None,
+        torch.Tensor | None,
+    ]:


Removing the last , will make this one line.

hyeok9855 · 2024-11-14T15:23:30Z

src/gfn/modules.py

    def expected_output_dim(self) -> int:
        return 1

+    def forward(self, input: States | torch.Tensor) -> torch.Tensor:


In which case is the input torch.Tensor?

Yes I was looking at this and not entirely sure. It might be in the case of conditioning, where we currently don't have any sort of container, conditioning is done with a raw Tensor.

Oh, it should be conditioning (e.g., conditional log Z(c)).
However, it might be a bit confusing whether to use ConditionalScalarEstimator or ScalarEstimator to model log Z(c).

Well of note, ScalarEstimators are used for more than just logZ, but in this case, I see it like this:

LogZ can be a single parameter.

LogZ can be estimated using a neural network - in this case, the output of the network can actually be multiple items that are averaged over.

LogZ can be conditionally estimated using a neural network - in this case, the output of the network can actually be multiple items that are averaged over.

From an optimization POV, sometimes having logZ only be estimated by a single parameter can cause problems (i.e., the gradients push the number around a lot), so using a neural network helps.

I agree we could make it clearer though -- I am open to suggestions.

ConditionalScalarEstimator is used to take in both the State and the Conditioning, i.e., it's a two-headed estimator. I think this is the normal conditioning case.

hyeok9855

Approved!

Scalar estimators allow for the reduction over many output values (i.…

1c4ec37

…e., the output of the nn.Module does not need to be a scalar, because the Estimator will apply a reduction to the final output if required).

josephdviviano requested review from hyeok9855 and younik November 13, 2024 20:40

josephdviviano self-assigned this Nov 13, 2024

black

db13637

younik approved these changes Nov 13, 2024

View reviewed changes

hyeok9855 reviewed Nov 14, 2024

View reviewed changes

josephdviviano added 3 commits November 14, 2024 17:44

updated docstrings

be1087c

isort

0adf808

isort/black

98579e8

hyeok9855 approved these changes Nov 14, 2024

View reviewed changes

josephdviviano added 2 commits November 14, 2024 19:17

black

9f0140e

updated docstrings

90e14e2

josephdviviano merged commit 8259a21 into master Nov 15, 2024
4 checks passed

josephdviviano deleted the scalar_estimation_from_vectors branch November 15, 2024 00:52

josephdviviano restored the scalar_estimation_from_vectors branch November 15, 2024 00:53

josephdviviano deleted the scalar_estimation_from_vectors branch November 15, 2024 00:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scalar estimators allow for the reduction over many output values (i.… #215

Scalar estimators allow for the reduction over many output values (i.… #215

josephdviviano commented Nov 13, 2024

younik left a comment

younik Nov 13, 2024

josephdviviano Nov 14, 2024

younik Nov 13, 2024

hyeok9855 left a comment

hyeok9855 Nov 14, 2024

hyeok9855 Nov 14, 2024

josephdviviano Nov 14, 2024

hyeok9855 Nov 14, 2024

josephdviviano Nov 14, 2024

hyeok9855 Nov 14, 2024

josephdviviano Nov 14, 2024

hyeok9855 Nov 14, 2024

hyeok9855 Nov 14, 2024

hyeok9855 Nov 14, 2024

josephdviviano Nov 14, 2024

hyeok9855 Nov 14, 2024

josephdviviano Nov 14, 2024

josephdviviano Nov 14, 2024

hyeok9855 left a comment

		@@ -134,9 +134,71 @@ def to_probability_distribution(


		class ScalarEstimator(GFNModule):
		r"""Class for estimating scalars such as LogZ.

Scalar estimators allow for the reduction over many output values (i.… #215

Scalar estimators allow for the reduction over many output values (i.… #215

Conversation

josephdviviano commented Nov 13, 2024

younik left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hyeok9855 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hyeok9855 left a comment

Choose a reason for hiding this comment