cuda-reduce-many Optimalization including parallel reduction for W-SLDA Toolkit. For more information see: https://gitlab.fizyka.pw.edu.pl/wtools/wslda/-/snippets/46