Reduce memory allocation #410

tizianoGuadagnino · 2024-12-04T09:18:04Z

Motivation

After doing some heaptrack analysis in PRBonn/kinematic-icp/pull/22, it turns out that our beloved system performs an insane amount of unnecessary allocations.

This PR

This change can be synthesized with "don't let that vector to reallocate for each point".

std::vector<Voxel> GetAdjacentVoxels(const Voxel &voxel, int adjacent_voxels = 1) {
    std::vector<Voxel> voxel_neighborhood; // <--- NO RESERVE HAS BEEN CALLED
    for (int i = voxel.x() - adjacent_voxels; i < voxel.x() + adjacent_voxels + 1; ++i) {
        for (int j = voxel.y() - adjacent_voxels; j < voxel.y() + adjacent_voxels + 1; ++j) {
            for (int k = voxel.z() - adjacent_voxels; k < voxel.z() + adjacent_voxels + 1; ++k) {
                voxel_neighborhood.emplace_back(i, j, k);
            }
        }
    }
    return voxel_neighborhood;
}

This code does not preallocate memory, and even worse, this reallocation happens for each point in the scan for which we are computing associations. The funny thing is that we perfectly know how these voxel offsets look, so I just precompute them. The result in terms of the number of allocations is incredible.

Results

Memory allocations

nachovizzo · 2024-12-04T11:35:06Z

cpp/kiss_icp/core/VoxelHashMap.cpp

-    }
-    return voxel_neighborhood;
-}
+static const std::array<Voxel, 27> shifts{


Something I tried moons ago, and also worked fine (mainly style):

const std::array<Voxel, 27> &GetAdjacentVoxels() { static const auto ADJACENT_VOXELS = [&]() -> std::array<Voxel, 27> { std::array<Voxel, 27> output; // clang-format off size_t idx = 0; for (int i = -1; i <= 1 ; ++i) { for (int j = -1; j <= 1 ; ++j) { for (int k = -1; k <= 1 ; ++k) { output[idx++] = Voxel{i,j,k}; }}} // clang-format on return output; }(); return ADJACENT_VOXELS; }

And down the line:

std::array<Voxel, 27> GetVoxelNeighborhood(const Voxel &voxel) { auto voxel_neighborhood = GetAdjacentVoxels(); for (auto &adjacent_voxel : voxel_neighborhood) adjacent_voxel += voxel; return voxel_neighborhood; }

This is for "more readability." as later we will "search in the neighboring voxels", instead of the shift+voxel

To me, it looks a bit too complicated for what it should be. I will compile the decision using -Wpedantic @benemer ;)

I don't have a strong opinion on this. I see one advantage with Nacho's solution: it is easier to consider more than one neighboring voxel (in case you need to have a small voxel size but still find nearest neighbors).

How about a compromise?

static const std::array<Voxel, 27> voxel_shifts = []() { std::array<Voxel, 27> output; size_t idx = 0; for (int i = -1; i <= 1; ++i) { for (int j = -1; j <= 1; ++j) { for (int k = -1; k <= 1; ++k) { output[idx++] = Voxel{i, j, k}; } } } return output; }();

This keeps the simplicity but allows better readability and simpler extension to more neighboring voxels.

This reverts commit 5ae3283.

tizianoGuadagnino added 2 commits December 4, 2024 09:31

Zero additional allocations

1fdd09b

We know this shifts

a741512

tizianoGuadagnino requested review from nachovizzo and benemer as code owners December 4, 2024 09:18

tizianoGuadagnino added the core label Dec 4, 2024

tizianoGuadagnino self-assigned this Dec 4, 2024

nachovizzo reviewed Dec 4, 2024

View reviewed changes

Revert VoxelHashMap change -> Allocations go in a separate PR

5ae3283

tizianoGuadagnino changed the title ~~Reduce allocations -> Improve runtime~~ Improve runtime by reshaping TBB Data Association Dec 4, 2024

tizianoGuadagnino changed the title ~~Improve runtime by reshaping TBB Data Association~~ Reduce memory allocation Dec 4, 2024

tizianoGuadagnino added 2 commits December 4, 2024 18:56

Revert "Revert VoxelHashMap change -> Allocations go in a separate PR"

4521454

This reverts commit 5ae3283.

Revert concurrent vector change

6e56ac7

tizianoGuadagnino mentioned this pull request Dec 4, 2024

Improve Runtime by reshaping the DataAssociation #411

Open

Some renaming for clarity

a094695

benemer mentioned this pull request Dec 13, 2024

Introduce Bonxai as Map representation PRBonn/kinematic-icp#22

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce memory allocation #410

Reduce memory allocation #410

tizianoGuadagnino commented Dec 4, 2024 •

edited

Loading

nachovizzo Dec 4, 2024

tizianoGuadagnino Dec 4, 2024

benemer Dec 13, 2024

Reduce memory allocation #410

Are you sure you want to change the base?

Reduce memory allocation #410

Conversation

tizianoGuadagnino commented Dec 4, 2024 • edited Loading

Motivation

This PR

Results

Memory allocations

nachovizzo Dec 4, 2024

Choose a reason for hiding this comment

tizianoGuadagnino Dec 4, 2024

Choose a reason for hiding this comment

benemer Dec 13, 2024

Choose a reason for hiding this comment

tizianoGuadagnino commented Dec 4, 2024 •

edited

Loading