Re-enable aarch64 package builds #19135

banach-space · 2024-11-13T10:59:11Z

I was told that the underlying issue has been resolved.

I was told that the underlying issue has been resolved. Signed-off-by: Andrzej Warzynski <[email protected]>

marbre

Just a few too many spaces but otherwise looks good.

(Just applied the suggestions via a batch commit, hope that's okay for you @banach-space.)

.github/workflows/build_package.yml

banach-space · 2024-11-13T11:36:57Z

Thanks @marbre ! 🤞🏻 this works 😅

ScottTodd · 2024-11-13T16:11:50Z

Thanks! Runners are online again. The build/test job already spotted a regression in one test too 👀 https://github.com/iree-org/iree/actions/runs/11814365861/job/32913288871#step:6:9488

FAILED: tests/e2e/stablehlo_ops/check_llvm-cpu-host_local-task_gather.mlir_module.vmfb /__w/iree/iree/build-arm64/tests/e2e/stablehlo_ops/check_llvm-cpu-host_local-task_gather.mlir_module.vmfb 
cd /__w/iree/iree/build-arm64/tests/e2e/stablehlo_ops && /__w/iree/iree/build-arm64/tools/iree-compile --output-format=vm-bytecode --mlir-print-op-on-diagnostic=false --iree-hal-target-backends=llvm-cpu --iree-input-type=stablehlo --iree-input-demote-f64-to-f32 --iree-llvmcpu-target-cpu=host /__w/iree/iree/tests/e2e/stablehlo_ops/gather.mlir -o check_llvm-cpu-host_local-task_gather.mlir_module.vmfb --iree-hal-executable-object-search-path=\"/__w/iree/iree/build-arm64\" --iree-llvmcpu-embedded-linker-path=\"/__w/iree/iree/build-arm64/llvm-project/bin/lld\" --iree-llvmcpu-wasm-linker-path=\"/__w/iree/iree/build-arm64/llvm-project/bin/lld\"
failed to translate executables
failed to translate executables
/__w/iree/iree/tests/e2e/stablehlo_ops/gather.mlir:92:13: error: 'vector.transfer_read' op inferred mask type ('vector<i1>') and mask operand type ('vector<1x1x4xi1>') don't match
  %result = "stablehlo.gather"(%operand, %start_indices) {
            ^

BTW, a clean revert of #19116 from the GitHub UI may have been easier to make than this manually authored commit.

banach-space · 2024-11-13T16:43:19Z

I've not been able to repro and have run out of screen time for today :( Will try again tomorrow.

EDIT

I am able to repro with iree-compile. Looks like the vectorizer is failing. Things seems fine with mlir-opt when we let the vectorize decide what vector shape to use. Things brake when using vector shapes as implied by IREE:

attrs =  {lowering_config = #iree_codegen.lowering_config<tile_sizes = [[2, 2, 3], [1, 1, 4], [0, 0, 0], [0, 0, 0]]>}

Have tile size selection logic been updated recently?

MLIR repro:

func.func @vectorization_test(%extracted_slice : tensor<1x1x3xi32>, %arg0: index, %arg2: index, %3: tensor<2x4xi32>, %4: tensor<1x3x2x4xi32>) -> tensor<1x1x3xi32>{
%c3 = arith.constant 3 :index
%c0 = arith.constant 0 :index
%c1 = arith.constant 1 :index

%8 = linalg.generic {
  indexing_maps = [affine_map<(d0, d1, d2) -> (d0, d1, d2)>],
  iterator_types = ["parallel", "parallel", "parallel"]}
  outs(%extracted_slice : tensor<1x1x3xi32>) {
  ^bb0(%out: i32):
    %9 = linalg.index 0 : index
    %10 = affine.apply affine_map<(d0, d1) -> (d0 + d1)>(%9, %arg0)
    %11 = linalg.index 1 : index
    %12 = affine.apply affine_map<(d0, d1) -> (d0 + d1)>(%11, %arg2)
    %13 = linalg.index 2 : index
    %extracted = tensor.extract %3[%10, %c0] : tensor<2x4xi32>
    %14 = arith.index_cast %extracted : i32 to index
    %extracted_0 = tensor.extract %3[%10, %c1] : tensor<2x4xi32>
    %15 = arith.index_cast %extracted_0 : i32 to index
    %extracted_1 = tensor.extract %3[%10, %c3] : tensor<2x4xi32>
    %16 = arith.index_cast %extracted_1 : i32 to index
    %17 = arith.maxsi %16, %c0 : index
    %18 = arith.minui %17, %c1 : index
    %19 = arith.maxsi %15, %c0 : index
    %20 = arith.minui %19, %c1 : index
    %21 = arith.maxsi %14, %c0 : index
    %22 = arith.minui %21, %c1 : index
    %23 = arith.addi %18, %12 : index
    %24 = arith.addi %22, %13 : index
    %extracted_2 = tensor.extract %4[%c0, %23, %20, %24] : tensor<1x3x2x4xi32>
    linalg.yield %extracted_2 : i32
  } -> tensor<1x1x3xi32>

  return %8 : tensor<1x1x3xi32>
}
module attributes {transform.with_named_sequence} {
  transform.named_sequence @__transform_main(%arg1: !transform.any_op {transform.readonly}) {
    %0 = transform.structured.match ops{["linalg.generic"]} in %arg1 : (!transform.any_op) -> !transform.any_op
    %1 = transform.get_parent_op %0 {isolated_from_above} : (!transform.any_op) -> !transform.any_op
    // %2 = transform.structured.vectorize_children_and_apply_patterns %1  { vectorize_nd_extract } : (!transform.any_op) -> !transform.any_op
    transform.structured.vectorize %0 vector_sizes [1, 1, 4] {vectorize_nd_extract} : !transform.any_op
    transform.yield
  }
}
}

banach-space · 2024-11-13T22:30:51Z

Here's the offending patch: #19007. Ping @Groverkss :)

Groverkss · 2024-11-14T00:12:02Z

Here's the offending patch: #19007. Ping @Groverkss :)

I had a look, this seems to be a bug in upstream masking vectorization implementation. The transfer_read operation infers the return type differently than vectorization does:

transfer_read: https://github.com/llvm/llvm-project/blob/main/mlir/lib/Dialect/Vector/IR/VectorOps.cpp#L4123
vectorization: https://github.com/llvm/llvm-project/blob/main/mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp#L1402C18-L1402C31

It looks like vectorization is wrong here, and should be using an inverse map instead of using the transfer_read indexing map. Mask needs an inverse map.

Groverkss · 2024-11-14T00:44:35Z

Here's the offending patch: #19007. Ping @Groverkss :)

I had a look, this seems to be a bug in upstream masking vectorization implementation. The transfer_read operation infers the return type differently than vectorization does:

transfer_read: https://github.com/llvm/llvm-project/blob/main/mlir/lib/Dialect/Vector/IR/VectorOps.cpp#L4123 vectorization: https://github.com/llvm/llvm-project/blob/main/mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp#L1402C18-L1402C31

It looks like vectorization is wrong here, and should be using an inverse map instead of using the transfer_read indexing map. Mask needs an inverse map.

Actually, looking more, that's not where it's coming from. There is a bug in linalg vectorization for vector.transfer_read with broadcast permutation maps when using custom vectorization hooks:

https://github.com/llvm/llvm-project/blob/main/mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp#L1457

Vectorizing a tensor.extract using a custom vectorization hook creates a transfer_read with a permutation map of (d0, d1) -> (0, 0, 0). This custom hook does not mask it. Masking is done after this hook is ran. The masking picks an identity map (wrong!), which does not match the indexing map that should have been used for masking of transfer_read.

The reason that patch uncovered it was because before, we were doing a hack where if we saw a transfer_read of 0 rank, we would simply turn it into memref.load/tensor.extract . But that only works for the case where you have a full broadcast. If you have any indexing map for the transfer_read generated by the custom vectorization hook which is not an identity map, this will break.

banach-space · 2024-11-14T07:52:05Z

Thanks for digging into this!

The reason that patch uncovered it

But your patch didn't touch CPU lowering? Was that an LLVM patch that uncovered this? Do you know which?

banach-space · 2024-11-14T10:12:37Z

Created a smaller repro and moved the discussion here: llvm/llvm-project#116197

See iree-org#19135 for a discussion. Signed-off-by: Andrzej Warzynski <[email protected]>

banach-space · 2024-11-14T13:52:41Z

As discussed with @Groverkss offline, #19007 exposes a bug related to masked vectorization. I have a prototype fix for that, but it needs more work/consideration. Sending a revert in the meantime:

Revert LLVM changes from #19007 #19153

Note, the code generated by the vectorizer will still fail verification, but the buggy/problematic part gets folded away by subsequent transformations.

…ceRank (llvm#112907)" This reverts commit 1004865. Failing CI as discussed here: * iree-org/iree#19135

See #19135 for a discussion. Signed-off-by: Andrzej Warzynski <[email protected]>

…ceRank (llvm#112907)" This reverts commit 1004865. Failing CI as discussed here: * iree-org/iree#19135

banach-space requested a review from ScottTodd as a code owner November 13, 2024 10:59

banach-space requested a review from marbre November 13, 2024 10:59

Re-enable aarch64 package builds

001db5a

I was told that the underlying issue has been resolved. Signed-off-by: Andrzej Warzynski <[email protected]>

banach-space force-pushed the andrzej/re-enable-aarch64-runners branch from a5f81b0 to 001db5a Compare November 13, 2024 11:01

marbre reviewed Nov 13, 2024

View reviewed changes

.github/workflows/build_package.yml Outdated Show resolved Hide resolved

.github/workflows/build_package.yml Outdated Show resolved Hide resolved

.github/workflows/build_package.yml Outdated Show resolved Hide resolved

Drop superfluous spaces

a2cf6c6

marbre approved these changes Nov 13, 2024

View reviewed changes

marbre enabled auto-merge (squash) November 13, 2024 11:13

marbre merged commit ea03080 into iree-org:main Nov 13, 2024
33 of 36 checks passed

banach-space deleted the andrzej/re-enable-aarch64-runners branch November 13, 2024 16:16

banach-space mentioned this pull request Nov 14, 2024

Revert LLVM changes from #19007 #19153

Merged

banach-space added a commit to banach-space/iree that referenced this pull request Nov 14, 2024

Revert LLVM changes from iree-org#19007

fe26d41

See iree-org#19135 for a discussion. Signed-off-by: Andrzej Warzynski <[email protected]>

banach-space added a commit to banach-space/iree that referenced this pull request Nov 14, 2024

Revert LLVM changes from iree-org#19007

76b99f7

See iree-org#19135 for a discussion. Signed-off-by: Andrzej Warzynski <[email protected]>

banach-space added a commit to banach-space/llvm-project that referenced this pull request Nov 14, 2024

Revert "[mlir][Vector] Support 0-d vectors natively in TransferOpRedu…

2ab1021

…ceRank (llvm#112907)" This reverts commit 1004865. Failing CI as discussed here: * iree-org/iree#19135

banach-space added a commit that referenced this pull request Nov 15, 2024

Revert LLVM changes from #19007 (#19153)

8cb8743

See #19135 for a discussion. Signed-off-by: Andrzej Warzynski <[email protected]>

Groverkss pushed a commit to iree-org/llvm-project that referenced this pull request Nov 15, 2024

Revert "[mlir][Vector] Support 0-d vectors natively in TransferOpRedu…

a42b8ff

…ceRank (llvm#112907)" This reverts commit 1004865. Failing CI as discussed here: * iree-org/iree#19135

Groverkss pushed a commit to iree-org/llvm-project that referenced this pull request Nov 15, 2024

Revert "[mlir][Vector] Support 0-d vectors natively in TransferOpRedu…

b86bf2a

…ceRank (llvm#112907)" This reverts commit 1004865. Failing CI as discussed here: * iree-org/iree#19135

Groverkss pushed a commit to iree-org/llvm-project that referenced this pull request Nov 18, 2024

Revert "[mlir][Vector] Support 0-d vectors natively in TransferOpRedu…

5e907e8

…ceRank (llvm#112907)" This reverts commit 1004865. Failing CI as discussed here: * iree-org/iree#19135

Groverkss pushed a commit to iree-org/llvm-project that referenced this pull request Nov 18, 2024

Revert "[mlir][Vector] Support 0-d vectors natively in TransferOpRedu…

e3fe360

…ceRank (llvm#112907)" This reverts commit 1004865. Failing CI as discussed here: * iree-org/iree#19135

Groverkss pushed a commit to iree-org/llvm-project that referenced this pull request Nov 21, 2024

Revert "[mlir][Vector] Support 0-d vectors natively in TransferOpRedu…

5620d3b

…ceRank (llvm#112907)" This reverts commit 1004865. Failing CI as discussed here: * iree-org/iree#19135

Groverkss pushed a commit to iree-org/llvm-project that referenced this pull request Nov 21, 2024

Revert "[mlir][Vector] Support 0-d vectors natively in TransferOpRedu…

afa1dcc

…ceRank (llvm#112907)" This reverts commit 1004865. Failing CI as discussed here: * iree-org/iree#19135

kuhar pushed a commit to iree-org/llvm-project that referenced this pull request Nov 25, 2024

Revert "[mlir][Vector] Support 0-d vectors natively in TransferOpRedu…

e3494c4

…ceRank (llvm#112907)" This reverts commit 1004865. Failing CI as discussed here: * iree-org/iree#19135

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-enable aarch64 package builds #19135

Re-enable aarch64 package builds #19135

banach-space commented Nov 13, 2024

marbre left a comment •

edited

Loading

banach-space commented Nov 13, 2024

ScottTodd commented Nov 13, 2024

banach-space commented Nov 13, 2024 •

edited

Loading

banach-space commented Nov 13, 2024

Groverkss commented Nov 14, 2024

Groverkss commented Nov 14, 2024

banach-space commented Nov 14, 2024

banach-space commented Nov 14, 2024

banach-space commented Nov 14, 2024

Re-enable aarch64 package builds #19135

Re-enable aarch64 package builds #19135

Conversation

banach-space commented Nov 13, 2024

marbre left a comment • edited Loading

Choose a reason for hiding this comment

banach-space commented Nov 13, 2024

ScottTodd commented Nov 13, 2024

banach-space commented Nov 13, 2024 • edited Loading

banach-space commented Nov 13, 2024

Groverkss commented Nov 14, 2024

Groverkss commented Nov 14, 2024

banach-space commented Nov 14, 2024

banach-space commented Nov 14, 2024

banach-space commented Nov 14, 2024

marbre left a comment •

edited

Loading

banach-space commented Nov 13, 2024 •

edited

Loading