[RFC, WIP]: Remove evaluate kernel #2342

gpleiss · 2023-05-13T20:32:11Z

With KernelLinearOperator defined in the linear operator package, LazyEvaluatedKernelTensor becomes kind of redundant.

This PR attempts to improve the lazy evaluation of kernels in GPyTorch, by doing the following:

Remove LazyEvaluatedKernelTensor, instead using KernelLinearOperator (which is also used by KeOps kernels)
Removing evaluate_kernel form GPyTorch (which will be redundant with KernelLinearOperator) and also removing it from the LinearOperator package.

cc/ @Balandat

jacobrgardner · 2023-05-18T03:26:11Z

Aren't we going to take at least some amount of performance hit here on kernels you don't lazily evaluate when making predictions, because we don't get to do indexing before calling the kernel?

gpytorch/kernels/kernel.py

Balandat · 2023-05-20T22:25:04Z

gpytorch/kernels/kernel.py

+
+                if len(named_parameters):
+                    param_names, params = zip(*named_parameters)
+                    param_batch_shapes = [self.batch_shape] * len(params)


So being able to use self.batch_shape here relies on the fact that all parameters have a fully explicit shape (rather than broadcasting over them)?

Yes. I think this is generally true for all kernels in GPyTorch. We can add something to the documentation to be more explicit about this.

Balandat · 2023-05-20T22:27:36Z

gpytorch/kernels/kernel.py

+                else:
+                    res = KernelLinearOperator(
+                        x1_, x2_, covar_func=self.forward, num_outputs_per_input=num_outputs_per_input, **kwargs
+                    )


We're pretty deep in the indentations here, might make sense to pull out some of the above into helper functions to make it clear what's going on here / easier to read the code

Balandat · 2023-05-20T22:45:53Z

gpytorch/kernels/scale_kernel.py

@@ -75,7 +68,7 @@ def __init__(
            outputscale_constraint = Positive()

        self.base_kernel = base_kernel
-        outputscale = torch.zeros(*self.batch_shape) if len(self.batch_shape) else torch.tensor(0.0)
+        outputscale = torch.zeros(*self.batch_shape, 1, 1) if len(self.batch_shape) else torch.zeros(1, 1)


Changing the parameter shapes for widely used kernels like this will likely result in a bunch of downstream backward compatibility issues. I'm not necessarily suggestion not to do this, but those changes will need to be properly documented / announced.

btw, if we're doing this, we should probably combine this with updating the shapes of the priors (c.f. #1317) to make all of that consistent

I'm going back and forward about changing parameter shapes. On one hand, it could be quite the breaking change. On the other hand, it would create more consistency, and we should also be able to dramatically simplify many kernels as well.

@Balandat thoughts? I imagine this would have the biggest impact on BoTorch.

Actually, what probably makes the most sense is leaving kernel parameter shapes the way they are currently, and potentially creating consistent parameter shapes and addressing #1317 in a separate PR.

Makes sense. I think fixing the parameter shape inconsistencies and making things consistent across the board would be good, and hopefully this would also get rid of some of the long standing bugs/issues. This would impact botorch, but if we coordinate on the releases we can make sure that the effect of this is minimized. It would likely generate some short term pain for some power users (of both gpytorch and botorch) with custom setups, but I think that short term pain is probably the long-term gains from consistency.,

@Balandat That sounds like a good plan. Here's what I'm thinking as a course of action:

Make a PR that makes kernel parameters consistent sizes (and adds appropriate warnings/deprecations/etc)

Address [Bug] Sampling from priors doesn't match shape of hyperparameters #1317

Merge this PR

(+ other breaking changes that we've wanted to address for a while)

After all of that, we make a major release, timed with a BoTorch release.

gpleiss · 2023-05-25T16:30:37Z

Aren't we going to take at least some amount of performance hit here on kernels you don't lazily evaluate when making predictions, because we don't get to do indexing before calling the kernel?

@jacobrgardner I don't think it'll lead to a loss in performance. This PR mostly merges LazyEvaluatedKernelTensor and KeOpsLinearOperator, since they share most of the same functionality.

**Note**: This is not a breaking change; "legacy" grids were deprecated pre v1.0.

…batch

…Kernel - The functionality of both kernels has not disappeared, but both kernels cannot work without the last_dim_is_batch_option. - The examples/00_Basic_Usage/kernels_with_additive_or_product_structure.ipynb notebook describes how to replicate the functionality of both kernels without last_dim_is_batch.

- The functionality of this kernels has not disappeared, but this kernel cannot work without the last_dim_is_batch_option. - The examples/00_Basic_Usage/kernels_with_additive_or_product_structure.ipynb notebook describes how to replicate the functionality of this kernel using the gpytorch.utils.sum_interaction_terms utility.

Balandat reviewed May 20, 2023

View reviewed changes

gpleiss marked this pull request as draft May 26, 2023 13:01

gpleiss force-pushed the remove_evaluate_kernel branch from 5fba78d to 6d58b66 Compare May 26, 2023 13:23

gpleiss force-pushed the remove_evaluate_kernel branch from 6d58b66 to 963c84c Compare June 2, 2023 19:03

Turakar mentioned this pull request Aug 8, 2023

Better support for missing labels #2288

Merged

gpleiss mentioned this pull request Sep 21, 2023

[Bug] The kernel ScaleKernel is not equipped to handle and diag. #2411

Closed

gpleiss mentioned this pull request Mar 19, 2024

Improve sampling from GP predictive posteriors. #2498

Open

gpleiss added 6 commits July 11, 2024 10:56

Remove convert_legacy_grid option.

70af591

**Note**: This is not a breaking change; "legacy" grids were deprecated pre v1.0.

Rework GridKernel and GridInterpolationKernel to not use last_dim_is_…

8326781

…batch

[Breaking Change] remove last_dim_is_batch from remaining kernels

dc61fb1

WIP

daf43a3

gpleiss force-pushed the remove_evaluate_kernel branch from 963c84c to daf43a3 Compare July 12, 2024 18:06

gpleiss mentioned this pull request Sep 20, 2024

Hotfix: fix .diagonal() calls for keops kernel matrices. #2590

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC, WIP]: Remove evaluate kernel #2342

[RFC, WIP]: Remove evaluate kernel #2342

gpleiss commented May 13, 2023

jacobrgardner commented May 18, 2023

Balandat May 20, 2023

gpleiss May 25, 2023

Balandat May 20, 2023

Balandat May 20, 2023

Balandat May 20, 2023

gpleiss May 25, 2023

gpleiss May 25, 2023

Balandat May 26, 2023

gpleiss May 26, 2023

gpleiss commented May 25, 2023

[RFC, WIP]: Remove evaluate kernel #2342

Are you sure you want to change the base?

[RFC, WIP]: Remove evaluate kernel #2342

Conversation

gpleiss commented May 13, 2023

jacobrgardner commented May 18, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gpleiss commented May 25, 2023