TSL memory layouts #47

jorendumoulin · 2023-12-21T15:26:47Z

TSL Memory Layouts

Rationale

A TSL (tiled-strided-layout) memory layout is an MLIR attribute, designed to be used as the layout parameter for a memref type. A TSL layout tiles the data and defines a stride for every tile, allowing for flexible memory layouts especially suited for hardware accelerators. This layout adds tiling to the existing StridedLayoutAttr. While the AffineMapLayoutAttr allows for a tiled layout, the representation is not always clear, and more importantly does not allow for non-contiguity, which may be required to maximally exploit the full bandwidth of the memory.

Notation

We employ the following notation for TSL attributes: (for a 2D matrix and one level of tiling), where the bounds and strides are ordered from outermost -> innermost

[bound, bound] -> (stride, stride), [bound, bound] -> (stride, stride)

Consider the following memory layout:
The image represents an 8x8 matrix, where every digit represents the memory address where the element will be stored.

<img src="https://github.com/KULeuven-MICAS/snax-mlir/assets/47864363/6d03debe-888e-4e5f-82c2-040434bc1f99 " width="400">

In both dimensions, the data is tiled in 2 tiles of size 4, this information is represented with the tiling bounds:

[2, 4] -> (stride, stride), [2, 4] -> (stride, stride)

For the first dimension there is a stride within the tile of 4 and across tiles of 32:

[2, 4] -> (32, 4), [4, 2] -> (stride, stride)

For the second dimension there is a stride within the tile of 1 and across tiles of 16:

[2, 4] -> (32, 4), [2, 4] -> (16, 1)

Additionally, the full TSL layout attribute can also include a base memory offset:

#tsl.tsl<[2, 4] -> (32, 4), [2, 4] -> (16, 1), offset: 5>

When no offset is defined, it is assumed to be 0

Dynamic Sizes

The layout provided allows for some flexibility in defining dynamic shapes within a matrix:

#tsl.tsl<[?, 4] -> (32, 4), [?, 4] -> (?, 1)>

The key point is that only the outermost tile is allowed to have dynamic sizes; the sizes and strides of the inner tiles must remain fixed. In the example, the fixed tile sizes are set to 4x4, with strides of 4 and 1. Additionally, there's one extra stride of 32, causing the tiles to be spaced at intervals of 32. The determination of the other strides, once the full matrix dimensions are known, is not yet determined. However, a likely approach is to densely determine the strides from left to right.

For example, if dealing with a 64x64 matrix, the layout would be adjusted accordingly:

#tsl.tsl<[16, 4] -> (32, 4), [16, 4] -> (?, 1)>

Here, the missing stride is calculated as 32x16=512. This adjustment ensures that the dynamic shapes remain consistent with the fixed tile sizes and strides while accommodating the overall matrix dimensions.

compiler/ir/tsl/tiled_stride.py

compiler/ir/tsl/tiled_strided_layout.py

compiler/transforms/set_memory_space.py

tests/filecheck/dialects/tsl.mlir

JosseVanDelm

I don't fully understand everything I'm afraid, can we discuss offline?

Cool PR!

JosseVanDelm · 2024-01-09T13:26:05Z

tests/ir/tsl/test_tiled_strided_layout.py

Wondering whether we should include negative tests for the parser here

compiler/ir/tsl/README.md

compiler/ir/tsl/stride.py

JosseVanDelm · 2024-01-10T14:17:22Z

tests/ir/tsl/test_tiled_strided_layout.py

+    assert lccb2[1].stride == 4
+    assert lccb2[1].bound == 4
+
+    lccb3 = tsl1.largest_common_contiguous_block(tsl3)


What happens if there's a dynamic tile size in there?

It still works, this case is included in the tests!

It just stops searching for a larger contiguous block as soon as it hits a dynamic shape

tests/ir/tsl/test_tiled_stride.py

JosseVanDelm

Few comments still 😄

tests/filecheck/dialects/tsl.mlir

compiler/ir/tsl/stride.py

compiler/ir/tsl/tiled_stride.py

compiler/ir/tsl/tiled_strided_layout.py

JosseVanDelm · 2024-01-15T09:42:53Z

compiler/parser/tsl_parser.py

+        if len(strides) != len(bounds):
+            raise ParseError("Expected same number of strides and bounds")


Please test these invariants

compiler/ir/tsl/tiled_strided_layout.py

jorendumoulin · 2024-01-16T10:07:43Z

I resolved all your comments!
Just the negative parsing checks come with some issues:
I included some negative tests, but as you may see, they always check for the following line: Expected: '>', instead of the actual error thrown.

XDSL tries to parse arguments (inclosed in <attr>) the following:

def in_angle_brackets(self):
        self.parse_punctuation("<")
        try:
            yield
        finally:
            self.parse_punctuation(">")

When parsing attr, my own errors will be thrown. This code will then try to just parse >, but the attr has not been parsed yet, thus the error looks something like this:

blablabla
Error thrown: my own error
blablabla
...
Error thrown: Expected >

However, when testing with parsing_diagnostics, only the last line of the error is printed, and I cannot check for my own errors.

  %0 = "test.op"() : () -> memref<64x64xindex, #tsl.tsl<[a, b] -> (8, 1), [16, 4] -> (256, 64), offset: 5>, 2 : i32>
                                                         ^
                                                         Expected '>'

JosseVanDelm · 2024-01-16T11:50:02Z

🎉

* add tsl layout * add dialect implementation * remove old files * remove old files * re-enable python tests * add ir implementation * add parser * delete old tests * add simple filecheck * undo change * redo change * resolv own comments * add offsets * Add dynamic stride and bound support * add readme * Update README.md * Update README.md * change TSL notation * fix python test * stride is now step but stride is still stride * add starting stride * change constructor ordering * add negative parsing checks

jorendumoulin force-pushed the Joren/memory-layouts branch from b7bc0d1 to 252f7ae Compare December 22, 2023 14:28

jorendumoulin commented Jan 2, 2024

View reviewed changes

jorendumoulin marked this pull request as ready for review January 4, 2024 09:28

jorendumoulin marked this pull request as draft January 8, 2024 08:23

jorendumoulin force-pushed the Joren/memory-layouts branch from dab0bf8 to aac5de4 Compare January 9, 2024 09:00

jorendumoulin marked this pull request as ready for review January 9, 2024 11:06

jorendumoulin requested a review from JosseVanDelm January 9, 2024 11:07

jorendumoulin force-pushed the Joren/memory-layouts branch from bd89f8e to 1e240f5 Compare January 9, 2024 15:30

JosseVanDelm requested changes Jan 10, 2024

View reviewed changes

JosseVanDelm reviewed Jan 15, 2024

View reviewed changes

jorendumoulin requested a review from JosseVanDelm January 16, 2024 10:08

JosseVanDelm approved these changes Jan 16, 2024

View reviewed changes

jorendumoulin added 17 commits January 16, 2024 12:32

add tsl layout

f49634d

add dialect implementation

d3a5312

remove old files

5f893ba

remove old files

d64de30

re-enable python tests

6ce5811

add ir implementation

b14eaac

add parser

b528132

delete old tests

d30ebe8

add simple filecheck

4b39b5d

undo change

21881b7

redo change

448eda9

resolv own comments

63e54c2

add offsets

b0b0f09

Add dynamic stride and bound support

9561c6a

add readme

a862aff

Update README.md

d440a00

Update README.md

03415a1

jorendumoulin added 6 commits January 16, 2024 12:33

change TSL notation

52a1302

fix python test

273ab0e

stride is now step but stride is still stride

a20fd07

add starting stride

d5c6b7c

change constructor ordering

85144c9

add negative parsing checks

d50a112

jorendumoulin force-pushed the Joren/memory-layouts branch from 76965f8 to d50a112 Compare January 16, 2024 11:34

jorendumoulin merged commit f5e4e72 into main Jan 16, 2024
4 checks passed

jorendumoulin deleted the Joren/memory-layouts branch January 17, 2024 07:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TSL memory layouts #47

TSL memory layouts #47

jorendumoulin commented Dec 21, 2023 •

edited

Loading

JosseVanDelm left a comment

JosseVanDelm Jan 9, 2024

jorendumoulin Jan 11, 2024

JosseVanDelm Jan 10, 2024

jorendumoulin Jan 11, 2024

jorendumoulin Jan 11, 2024

JosseVanDelm left a comment

JosseVanDelm Jan 15, 2024

jorendumoulin commented Jan 16, 2024

JosseVanDelm commented Jan 16, 2024

		if len(strides) != len(bounds):
		raise ParseError("Expected same number of strides and bounds")

TSL memory layouts #47

TSL memory layouts #47

Conversation

jorendumoulin commented Dec 21, 2023 • edited Loading

TSL Memory Layouts

Rationale

Notation

Dynamic Sizes

JosseVanDelm left a comment

Choose a reason for hiding this comment

JosseVanDelm Jan 9, 2024

Choose a reason for hiding this comment

jorendumoulin Jan 11, 2024

Choose a reason for hiding this comment

JosseVanDelm Jan 10, 2024

Choose a reason for hiding this comment

jorendumoulin Jan 11, 2024

Choose a reason for hiding this comment

jorendumoulin Jan 11, 2024

Choose a reason for hiding this comment

JosseVanDelm left a comment

Choose a reason for hiding this comment

JosseVanDelm Jan 15, 2024

Choose a reason for hiding this comment

jorendumoulin commented Jan 16, 2024

JosseVanDelm commented Jan 16, 2024

jorendumoulin commented Dec 21, 2023 •

edited

Loading