MPI workarounds for aarch64 #999

abussy · 2024-08-30T15:13:29Z

This PR contains MPI workarounds for aarch64. As a result, DFTK can now be run on ARM architecture with full MPI support. These workarounds are meant to be temporary, until there is an upstream fix (whenever that happens, issue JuliaParallel/MPI.jl#404 was already opened 4 years ago).

The procedure was the following: :mpi tagged tests were run on an ARM machine, until a crash would occur. A set of specific MPI reduction methods were then written for the custom type involved. Some other custom types might pop up in the future. Adding a new method for them should be trivial.

I know this is rather clunky, and it inflates the code size and complexity. However, I think it is crucial to fully support DFTK on ARM architecture, especially now that many modern HPC clusters run on it (e.g. Alps at CSCS).

mfherbst · 2024-09-09T13:26:09Z

Thanks. Two remarks:

We used to have a workarounds folder. I suggest to put the code there.
Did you report these fixes upstream or is this not easy to do ?

abussy · 2024-09-19T13:10:32Z

I moved the fixes to the workarounds directory for better consistency.

I did not explicitly report these fixes upstream, but my mention of the original issue in my initial message ensures that this PR appears there for reference.

mfherbst · 2024-09-19T13:21:03Z

src/workarounds/aarch64_mpi.jl

+
+# Vec3{T} must be cast to Vector{T} before MPI reduction
+function mpi_sum!(arr::Vector{Vec3{T}}, comm::MPI.Comm) where{T}
+    n = length(arr)


Can this not be solved by a reinterpret(reshape, ... )? Then there is no copy

I think this one is tricky to get without copies, because SVector are immutable. So even if I use something like new_aarr = reshape(reinterpret(T, arr), 3, :), I still could not call mpi_sum!(). And assuming I call mpi_sum() instead, I also end up with a copy.

mfherbst · 2024-09-19T13:22:54Z

src/workarounds/aarch64_mpi.jl

+# utility function to cast a Dual type to an array containing a value and the partial diffs
+function dual_array(dual::ForwardDiff.Dual{T, U, V}) where{T, U, V}
+    dual_array = [ForwardDiff.value(dual)]
+    append!(dual_array, collect(ForwardDiff.partials(dual)))


I vaguely recall there are again ways to do thiy without copy. Check the forwarddiff rules for array operations (multiplication neefs to be done equally to all partials)

mfherbst · 2024-09-19T13:23:41Z

src/workarounds/aarch64_mpi.jl

+end
+
+# utility function that casts back an array to a Dual type, based on a template Dual
+function new_dual(dual_array, template)


Again Forwarddifc may have sth here

mfherbst · 2024-09-19T13:25:04Z

src/workarounds/aarch64_mpi.jl

+    mpi_sum!(array, comm)
+    offset = 0
+    for i in 1:length(dual)
+        dual[i] = new_dual(array[offset+1:offset+lengths[i]], dual[i])


Index operation does a copy by default. Use views (@views)

abussy · 2024-09-20T13:59:48Z

In the last commit, the amount of copies and/or temporary arrays in the MPI workarounds for aarch64 was minimized. There are unfortunately a few things that cannot be avoided, e.g. the creation of a standard type arrays for MPI communication, and some copies when accessing elements of a Vector{ForwardDiff.Dual}.

mfherbst · 2024-09-26T07:10:21Z

@abussy There seems to be some update in the issue on the MPI.jl side as well. Maybe that helps.

abussy · 2024-10-03T12:06:02Z

Indeed, since MPI.jl v0.20.22, it is possible to manually register custom operations for MPI on aarch64. This PR now uses this feature, and is much cleaner as a result. Thanks for pointing that out!

mfherbst · 2024-10-05T07:36:22Z

Awesome ! Thanks @abussy lets merge this.

MPI workarounds for aarch64

25afc5a

Mover aarch64_mpi.jl to workarounds dir

1547839

mfherbst reviewed Sep 19, 2024

View reviewed changes

Minimize amount of copies/tmps in aarch64_mpi.jl

5118a22

Simpler workaround because of MPI.jl v0.20.22

6b46f20

Merge branch 'master' into mpi_aarch64

3de0ad0

mfherbst enabled auto-merge (squash) October 5, 2024 07:35

mfherbst disabled auto-merge October 5, 2024 07:35

mfherbst enabled auto-merge (squash) October 5, 2024 07:36

mfherbst merged commit 5b5e214 into JuliaMolSim:master Oct 5, 2024
6 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPI workarounds for aarch64 #999

MPI workarounds for aarch64 #999

abussy commented Aug 30, 2024

mfherbst commented Sep 9, 2024

abussy commented Sep 19, 2024

mfherbst Sep 19, 2024

abussy Sep 19, 2024

mfherbst Sep 19, 2024

mfherbst Sep 19, 2024

mfherbst Sep 19, 2024

abussy commented Sep 20, 2024

mfherbst commented Sep 26, 2024

abussy commented Oct 3, 2024

mfherbst commented Oct 5, 2024

MPI workarounds for aarch64 #999

MPI workarounds for aarch64 #999

Conversation

abussy commented Aug 30, 2024

mfherbst commented Sep 9, 2024

abussy commented Sep 19, 2024

mfherbst Sep 19, 2024

Choose a reason for hiding this comment

abussy Sep 19, 2024

Choose a reason for hiding this comment

mfherbst Sep 19, 2024

Choose a reason for hiding this comment

mfherbst Sep 19, 2024

Choose a reason for hiding this comment

mfherbst Sep 19, 2024

Choose a reason for hiding this comment

abussy commented Sep 20, 2024

mfherbst commented Sep 26, 2024

abussy commented Oct 3, 2024

mfherbst commented Oct 5, 2024