Make preprocessing and postprocessing consistent accross transforms #93

felixblanke · 2024-06-24T23:32:51Z

This addresses #92.

For all discrete transforms, the preprocessing and postprocessing of coefficients and tensors is very similar (i.e. folding and swapping of axes, adding batch dims, etc.). This PR moves this functionality into shared functions that use _map_result.

Also the check for consistent devices and dtypes between coefficient tensors is moved into a function in _utils.

Last, as it was possible to add it with a few lines of code, I added the $n$-dimensional fully separable transforms (fswavedecn, fswaverecn). If this is not wanted, I can revert their addition.

Further, I did some minor refactorings along the way.

v0lta · 2024-06-25T07:35:50Z

I did not add n-dimensional separable transforms on purpose, because I was thinking people will ask for these in all other cases, too, where these are trickier to deliver.

src/ptwt/_util.py

cthoyt · 2024-06-25T07:42:42Z

src/ptwt/_util.py

+            raise ValueError(f"{ndim}D transforms work with {ndim} axes.")
+        else:
+            undo_swap_fn = partial(_undo_swap_axes, axes=axes)
+            coeffs = _map_result(coeffs, undo_swap_fn)


I would really advise against all of these _map_result calls - have one function that does the processing that can be reused, then just do list comprehensions for all successive function calls.

coeffs = _map_result(coeffs, lambda x: x.squeeze(0))

will always be less readable and understandable than

coeffs = _map_result(coeffs) coeffs = [coeff.squeeze(0) for coeff in coeffs]

when _map_result has lots of hidden functionality

The snippet

coeffs = _map_result(coeffs, lambda x: x.squeeze(0))

applies the function x.squeeze(0) to all tensors in coeffs. In the 1d case (where coeffs is of type list[Tensor]) this is equivalent to the list comprehension, as you said. However, coeffs might also be

(Tensor, dict[str, Tensor], ...)

(Tensor, (Tensor, Tensor, Tensor), ...)

So using _map_result allows to write the function once for all possible coefficient types. Would it perhaps help to rename _map_result or add documentation for it?

I think the name should have something to do with tree and map. The new _apply_to_tensor_elems name really hides how general the concept is. I would take a page from https://jax.readthedocs.io/en/latest/_autosummary/jax.tree.map.html#jax.tree.map and also use their type hinting. The concept does not exist in torch, but I think it makes sense here, since we save on a lot of boilerplate-code. Perhaps we should include the link and explain what's going on?

Here is an interesting intro discussing the pytree processing philosophy: https://jax.readthedocs.io/en/latest/working-with-pytrees.html .

I also think @cthoyt has a point since the tree-map concept is not very popular.

I agree when comparing

coeffs = _map_result(coeffs, partial(torch.squeeze, 0))

to

coeffs = [coeff.squeeze(0) for coeff in coeffs]

The list wins, but what if it's a nested structure?

felixblanke · 2024-06-25T08:40:50Z

@v0lta I made the n-dim transform private. Does that work?

v0lta · 2024-06-26T10:14:05Z

Yes, that works. However, we won't be able to support n-dimensional transforms across the board because PyTorch does not provide the interfaces we would need to do that. Padding, for example, works only up until 3D ( https://pytorch.org/docs/stable/generated/torch.nn.functional.pad.html ). We have the same problem with isotropic convolution. So, I think we should communicate that nd-transforms are out of the scope of this project.

v0lta · 2024-06-26T10:56:40Z

In general, I am a big fan of this full request! Thanks at @felixblanke I am going to clean up the docs for _map_result and commit here.

v0lta · 2024-06-26T11:56:28Z

Our coeff_tree_map is not a general tree map, but it does not have to be since we know the approximation tensor will always be the first entry. I ran the not-slow tests. Everything checked out. The code is cleaner now. If everyone is on board, I would be ready to merge.

felixblanke · 2024-06-26T13:09:52Z

I think we so far only refer to the Packet data structure as a tree. Maybe we can add a link to the JAX discussion as a reference?

v0lta · 2024-06-26T13:27:05Z

I am not sure if users need to know. I think this is more for us here internally. Unlike Jax's tree map, ours is coefficient-specific, hence the proposed coeff_tree_map function name.

v0lta · 2024-06-26T13:29:12Z

Actually, never mind. The user argument does not matter since it's a private function. If you think it helps to understand the idea, please add a link. I think it might help potential future contributors.

…Toolbox into fix/keep-ndims-Nd

v0lta · 2024-07-01T08:32:34Z

Okay, I think we are ready to merge. @felixblanke @cthoyt is everyone on board?

src/ptwt/separable_conv_transform.py

v0lta · 2024-07-01T09:27:15Z

okay let's merge!

felixblanke added 17 commits June 24, 2024 19:28

Integrate axis swap into 1d processing funcs

0a1b515

Make channel dim addition optional

2426060

Refactor; Add processing funcs for 2d

eaa2fb6

Move processung funcs to _util module

0e4cb27

Generalize tensor processing

3554a53

Adapt 1d cases

4e8ed02

Extend _map_result to 1d case

f889158

Make _preprocess_coeffs general

0ed2896

Make postprocessing general

03f74b7

Apply process funcs in 3d transforms

ae7b345

Format

21332bf

Fix coeff postprocess

b461c97

Reduce tensor processing to coeff processing

105a0a8

Add fully separable transforms for n dims

24a25f4

Move dtype check to preprocessing

169d6cf

Encapsulate check for consistent dtype and device

ba1b83d

Revert changes to coeff shape check

d50a2c2

felixblanke added enhancement New feature or request invalid This doesn't seem right labels Jun 24, 2024

cthoyt reviewed Jun 25, 2024

View reviewed changes

src/ptwt/_util.py Outdated Show resolved Hide resolved

cthoyt reviewed Jun 25, 2024

View reviewed changes

felixblanke added 3 commits June 25, 2024 10:06

Make n-dim separable trafo private

2d4620a

Add explainatory comments

1b45eae

Add docstrings

15af78b

felixblanke added 2 commits June 25, 2024 10:49

Rename _map_result to _apply_to_tensor_elems

35bab80

Format

37db5db

v0lta added 2 commits June 26, 2024 13:46

rename tree_map

b4c16e2

rename coeff tree map.

a9569f2

Add remark on JAX tree map

d1339f0

v0lta self-assigned this Jul 1, 2024

v0lta and others added 9 commits July 1, 2024 09:37

Merge branch 'main' into fix/keep-ndims-Nd

6254f13

merge.

e281c85

formatting.

75ad167

fix typing.

e3fe0f4

Fix ndim sep trafo usage comments

d6d8259

Fix docstr

de1b7c8

nd-transforms are out of scope.

986692f

Merge branch 'fix/keep-ndims-Nd' of github.com:v0lta/PyTorch-Wavelet-…

f008cfa

…Toolbox into fix/keep-ndims-Nd

short note.

77ca368

felixblanke commented Jul 1, 2024

View reviewed changes

src/ptwt/separable_conv_transform.py Outdated Show resolved Hide resolved

v0lta added 2 commits July 1, 2024 11:04

move note.

cef6a04

add note to forward and backward.

9981521

v0lta merged commit 85b898a into main Jul 1, 2024
7 checks passed

v0lta deleted the fix/keep-ndims-Nd branch July 1, 2024 09:27

v0lta mentioned this pull request Jul 1, 2024

Avoid adding of a batch dim on 2d signals. #92

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make preprocessing and postprocessing consistent accross transforms #93

Make preprocessing and postprocessing consistent accross transforms #93

felixblanke commented Jun 24, 2024

v0lta commented Jun 25, 2024 •

edited

Loading

cthoyt Jun 25, 2024

felixblanke Jun 25, 2024

v0lta Jun 26, 2024

v0lta Jun 26, 2024

v0lta Jun 26, 2024

v0lta Jun 26, 2024

felixblanke commented Jun 25, 2024

v0lta commented Jun 26, 2024

v0lta commented Jun 26, 2024 •

edited

Loading

v0lta commented Jun 26, 2024 •

edited

Loading

felixblanke commented Jun 26, 2024

v0lta commented Jun 26, 2024

v0lta commented Jun 26, 2024 •

edited

Loading

v0lta commented Jul 1, 2024 •

edited

Loading

v0lta commented Jul 1, 2024

Make preprocessing and postprocessing consistent accross transforms #93

Make preprocessing and postprocessing consistent accross transforms #93

Conversation

felixblanke commented Jun 24, 2024

v0lta commented Jun 25, 2024 • edited Loading

cthoyt Jun 25, 2024

Choose a reason for hiding this comment

felixblanke Jun 25, 2024

Choose a reason for hiding this comment

v0lta Jun 26, 2024

Choose a reason for hiding this comment

v0lta Jun 26, 2024

Choose a reason for hiding this comment

v0lta Jun 26, 2024

Choose a reason for hiding this comment

v0lta Jun 26, 2024

Choose a reason for hiding this comment

felixblanke commented Jun 25, 2024

v0lta commented Jun 26, 2024

v0lta commented Jun 26, 2024 • edited Loading

v0lta commented Jun 26, 2024 • edited Loading

felixblanke commented Jun 26, 2024

v0lta commented Jun 26, 2024

v0lta commented Jun 26, 2024 • edited Loading

v0lta commented Jul 1, 2024 • edited Loading

v0lta commented Jul 1, 2024

v0lta commented Jun 25, 2024 •

edited

Loading

v0lta commented Jun 26, 2024 •

edited

Loading

v0lta commented Jun 26, 2024 •

edited

Loading

v0lta commented Jun 26, 2024 •

edited

Loading

v0lta commented Jul 1, 2024 •

edited

Loading