diff --git a/404.html b/404.html
index 17c26da..6945460 100644
--- a/404.html
+++ b/404.html
@@ -1 +1 @@
-<!doctype html> <html lang=en > <meta charset=UTF-8 > <meta name=viewport  content="width=device-width, initial-scale=1"> <link rel=stylesheet  href="/css/franklin.css"> <link rel=stylesheet  href="/css/basic.css"> <link rel=icon  href="/assets/favicon.ico"> <title>404</title> <header> <div class=blog-name ><a href="/"><img src="/assets/logo.png" width=100px ></a></div> <nav> <ul> <li><a href="/">Home</a> <li><a href="https://github.com/trixi-framework" target=_blank  rel="noopener noreferrer">Trixi on GitHub</a> </ul> </nav> </header> <div class=franklin-content > <div style="margin-top: 40px; font-size: 40px; text-align: center;"> <br> <div style="font-weight: bold;"> 404 </div> <br> <br> The requested page was not found <br> <br> <br> <br> <div style="margin-bottom: 300px; font-size: 24px"> <a href="/">Click here</a> to go back to the homepage. </div> </div> <div class=page-foot > <div class=copyright > &copy; <a href="https://github.com/trixi-framework/Trixi.jl/blob/main/AUTHORS.md" target=_blank  rel="noopener noreferrer">The Trixi Authors</a>. Last modified: August 22, 2024. Website built with <a href="https://github.com/tlienart/Franklin.jl">Franklin.jl</a> and the <a href="https://julialang.org">Julia programming language</a>. </div> </div> </div>
\ No newline at end of file
+<!doctype html> <html lang=en > <meta charset=UTF-8 > <meta name=viewport  content="width=device-width, initial-scale=1"> <link rel=stylesheet  href="/css/franklin.css"> <link rel=stylesheet  href="/css/basic.css"> <link rel=icon  href="/assets/favicon.ico"> <title>404</title> <header> <div class=blog-name ><a href="/"><img src="/assets/logo.png" width=100px ></a></div> <nav> <ul> <li><a href="/">Home</a> <li><a href="https://github.com/trixi-framework" target=_blank  rel="noopener noreferrer">Trixi on GitHub</a> </ul> </nav> </header> <div class=franklin-content > <div style="margin-top: 40px; font-size: 40px; text-align: center;"> <br> <div style="font-weight: bold;"> 404 </div> <br> <br> The requested page was not found <br> <br> <br> <br> <div style="margin-bottom: 300px; font-size: 24px"> <a href="/">Click here</a> to go back to the homepage. </div> </div> <div class=page-foot > <div class=copyright > &copy; <a href="https://github.com/trixi-framework/Trixi.jl/blob/main/AUTHORS.md" target=_blank  rel="noopener noreferrer">The Trixi Authors</a>. Last modified: September 04, 2024. Website built with <a href="https://github.com/tlienart/Franklin.jl">Franklin.jl</a> and the <a href="https://julialang.org">Julia programming language</a>. </div> </div> </div>
\ No newline at end of file
diff --git a/Manifest.toml b/Manifest.toml
index 7673e17..3b44432 100644
--- a/Manifest.toml
+++ b/Manifest.toml
@@ -85,9 +85,9 @@ uuid = "b77e0a4c-d291-57a0-90e8-8db25a27a240"
 
 [[JLLWrappers]]
 deps = ["Artifacts", "Preferences"]
-git-tree-sha1 = "7e5d6779a1e09a36db2a7b6cff50942a0a7d0fca"
+git-tree-sha1 = "f389674c99bfcde17dc57454011aa44d5a260a40"
 uuid = "692b3bcd-3c85-4b1f-b108-f13ce0eb3210"
-version = "1.5.0"
+version = "1.6.0"
 
 [[JSON]]
 deps = ["Dates", "Mmap", "Parsers", "Unicode"]
diff --git a/index.html b/index.html
index db2ba07..39a226d 100644
--- a/index.html
+++ b/index.html
@@ -1 +1 @@
-<!doctype html> <html lang=en > <meta charset=UTF-8 > <meta name=viewport  content="width=device-width, initial-scale=1"> <link rel=stylesheet  href="/css/franklin.css"> <link rel=stylesheet  href="/css/basic.css"> <link rel=icon  href="/assets/favicon.ico"> <title>Trixi Framework</title> <header> <div class=blog-name ><a href="/"><img src="/assets/logo.png" width=100px ></a></div> <nav> <ul> <li><a href="/">Home</a> <li><a href="https://github.com/trixi-framework" target=_blank  rel="noopener noreferrer">Trixi on GitHub</a> </ul> </nav> </header> <div class=franklin-content ><h1 id=trixi_framework ><a href="#trixi_framework" class=header-anchor >Trixi Framework</a></h1> <p>The Trixi framework is a collaborative scientific effort to provide open source tools for adaptive high-order numerical simulations of hyperbolic PDEs in Julia. Besides the core algorithms, the framework also includes mesh and visualization tools. Moreover, it includes utilities such as Julia wrappers of mature libraries written in other programming languages.</p> <p>This page gives an overview of the different activities that, together, constitute the Trixi framework on <a href="https://github.com/orgs/trixi-framework">GitHub</a>.</p> <div class=franklin-toc ><ol><li><a href="#adaptive_high-order_numerical_simulations_of_hyperbolic_pdes">Adaptive high-order numerical simulations of hyperbolic PDEs</a><li><a href="#mesh_generation">Mesh generation</a><li><a href="#particle-based_multiphysics_simulations">Particle-based multiphysics simulations</a><li><a href="#additional_packages">Additional packages</a><li><a href="#publications">Publications</a><li><a href="#talks">Talks</a><li><a href="#outreach">Outreach</a><li><a href="#authors">Authors</a><li><a href="#get_in_touch">Get in touch&#33;</a><li><a href="#acknowledgments">Acknowledgments</a></ol></div> <h2 id=adaptive_high-order_numerical_simulations_of_hyperbolic_pdes ><a href="#adaptive_high-order_numerical_simulations_of_hyperbolic_pdes" class=header-anchor >Adaptive high-order numerical simulations of hyperbolic PDEs</a></h2> <ul> <li><p><a href="https://github.com/trixi-framework/Trixi.jl"><strong>Trixi.jl</strong></a></p> <p>Adaptive high-order numerical simulations of hyperbolic PDEs in Julia</p> <li><p><a href="https://github.com/trixi-framework/Trixi2Vtk.jl"><strong>Trixi2Vtk.jl</strong></a></p> <p>Convert output files generated with Trixi.jl to VTK</p> <li><p><a href="https://github.com/trixi-framework/libtrixi"><strong>libtrixi</strong></a></p> <p>Use <a href="https://github.com/trixi-framework/Trixi.jl">Trixi.jl</a> from C/C&#43;&#43;/Fortran</p> <li><p><a href="https://github.com/trixi-framework/SmartShockFinder.jl"><strong>SmartShockFinder.jl</strong></a></p> <p>Create troubled cell indicators for Trixi.jl using artificial neural networks</p> </ul> <h2 id=mesh_generation ><a href="#mesh_generation" class=header-anchor >Mesh generation</a></h2> <ul> <li><p><a href="https://github.com/trixi-framework/HOHQMesh.jl"><strong>HOHQMesh.jl</strong></a></p> <p>HOHQMesh.jl is a Julia wrapper for the HOHQMesh mesh generator, which allows to produce curved quadrilateral and hexahedral meshes for high-order numerical simulations.</p> <li><p><a href="https://github.com/trixi-framework/HOHQMesh"><strong>HOHQMesh</strong></a></p> <p>High Order Hex-Quad Mesh &#40;HOHQMesh&#41; package to automatically generate all-quadrilateral meshes with high order boundary information.</p> <li><p><a href="https://github.com/trixi-framework/Smesh.jl"><strong>Smesh.jl</strong></a></p> <p>Smesh.jl is a Julia wrapper packagae for smesh, a simple Fortran package for generating and handling unstructured triangular and polygonal meshes.</p> <li><p><a href="https://github.com/trixi-framework/smesh"><strong>smesh</strong></a></p> <p>A simple Fortran package for generating and handling unstructured triangular and polygonal meshes.</p> </ul> <h2 id=particle-based_multiphysics_simulations ><a href="#particle-based_multiphysics_simulations" class=header-anchor >Particle-based multiphysics simulations</a></h2> <ul> <li><p><a href="https://github.com/trixi-framework/TrixiParticles.jl"><strong>TrixiParticles.jl</strong></a></p> <p>Particle-based multiphysics simulations in Julia</p> <li><p><a href="https://github.com/trixi-framework/PointNeighbors.jl"><strong>PointNeighbors.jl</strong></a></p> <p>Efficient neighborhood search in point clouds with fixed search radius</p> </ul> <h2 id=additional_packages ><a href="#additional_packages" class=header-anchor >Additional packages</a></h2> <ul> <li><p><a href="https://github.com/trixi-framework/P4est.jl"><strong>P4est.jl</strong></a></p> <p>P4est.jl is lightweight Julia wrapper for the p4est C library.</p> <li><p><a href="https://github.com/trixi-framework/KROME.jl"><strong>KROME.jl</strong></a></p> <p>KROME.jl is a lightweight Julia wrapper for KROME, a Fortran library for including chemistry and microphysics in astrophysics simulations.</p> <li><p><a href="https://github.com/JuliaVTK/ReadVTK.jl"><strong>JuliaVTK/ReadVTK.jl</strong></a></p> <p>Julia package for reading VTK XML files &#40;maintained by the Trixi framework authors&#41;.</p> </ul> <h2 id=publications ><a href="#publications" class=header-anchor >Publications</a></h2> <p>The following publications make use of Trixi.jl or one of the other packages listed above. Author names of Trixi.jl&#39;s main developers are in <em>italics</em>.</p> <h3 id=2024 ><a href="#2024" class=header-anchor >2024</a></h3> <ul> <li><p><em>Doehring</em>, <em>Christmann</em>, <em>Schlottke-Lakemper</em>, <em>Gassner</em>, Torrilhon, <strong>Fourth-Order Paired-Explicit Runge-Kutta Methods</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2408.05470"><img src="https://img.shields.io/badge/arXiv-2408.05470-yellow" alt="arXiv:2408.05470" /></a></p> <li><p><em>Ersing</em>, Goldberg, <em>Winters</em>, <strong>Entropy stable hydrostatic reconstruction schemes for shallow water systems</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2406.14119"><img src="https://img.shields.io/badge/arXiv-2406.14119-yellow" alt="arXiv:2406.14119" /></a> <a href="https://github.com/patrickersing/paper-2024-es_hydrostatic_reconstruction"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p>Glaubitz, <em>Ranocha</em>, <em>Winters</em>, <em>Schlottke-Lakemper</em>, Öffner, <em>Gassner</em>, <strong>Generalized upwind summation-by-parts operators and their application to nodal discontinuous Galerkin methods</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2406.14557"><img src="https://img.shields.io/badge/arXiv-2406.14557-yellow" alt="arXiv:2406.14557" /></a> <a href="https://github.com/trixi-framework/paper-2024-generalized-upwind-sbp"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p>Bender, Öffner, <strong>Entropy-Conservative Discontinuous Galerkin Methods for the Shallow Water Equations with Uncertainty</strong>, 2024.<br/> <a href="https://doi.org/10.1007/s42967-024-00369-y"><img src="https://zenodo.org/badge/doi/10.1007/s42967-024-00369-y.svg" alt="doi:10.1007/s42967-024-00369-y" /></a></p> <li><p>Oblapenko, Torrilhon, <strong>Entropy-conservative high-order methods for high-enthalpy flows</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2403.16882"><img src="https://img.shields.io/badge/arXiv-2403.16882-yellow" alt="arXiv:2403.16882" /></a> <a href="https://github.com/knstmrd/paper-ec_trixi_inte"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Doehring</em>, <em>Schlottke-Lakemper</em>, <em>Gassner</em>, Torrilhon, <strong>Multirate Time-Integration based on Dynamic ODE Partitioning through Adaptively Refined Meshes for Compressible Fluid Dynamics</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2403.05144"><img src="https://img.shields.io/badge/arXiv-2403.05144-yellow" alt="arXiv:2403.05144" /></a> <a href="https://doi.org/10.1016/j.jcp.2024.113223"><img src="https://zenodo.org/badge/doi/10.1016/j.jcp.2024.113223.svg" alt="doi:10.1016/j.jcp.2024.113223" /></a> <a href="https://github.com/trixi-framework/paper-2024-amr-paired-rk"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p>Babbar, Chandrashekar, <strong>Multi-Derivative Runge-Kutta Flux Reconstruction for Hyperbolic Conservation Laws</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2403.02141"><img src="https://img.shields.io/badge/arXiv-2403.02141-yellow" alt="arXiv:2403.02141" /></a></p> <li><p><em>Lampert</em>, <em>Ranocha</em>, <strong>Structure-Preserving Numerical Methods for Two Nonlinear Systems of Dispersive Wave Equations</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2402.16669"><img src="https://img.shields.io/badge/arXiv-2402.16669-yellow" alt="arXiv:2402.16669" /></a> <a href="https://github.com/JoshuaLampert/2024_dispersive_shallow_water"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p>Rueda-Ramírez, Sikstel, <em>Gassner</em>, <strong>An Entropy-Stable Discontinuous Galerkin Discretization of the Ideal Multi-Ion Magnetohydrodynamics System</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2402.14615"><img src="https://img.shields.io/badge/arXiv-2402.14615-yellow" alt="arXiv:2402.14615" /></a></p> <li><p><em>Doehring</em>, <em>Gassner</em>, Torrilhon, <strong>Many-Stage Optimal Stabilized Runge-Kutta Methods for Hyperbolic Partial Differential Equations</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2402.12140"><img src="https://img.shields.io/badge/arXiv-2402.12140-yellow" alt="arXiv:2402.12140" /></a></p> <li><p>Babbar, Chandrashekar, <strong>Lax-Wendroff Flux Reconstruction for advection-diffusion equations with error-based time stepping</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2402.12669"><img src="https://img.shields.io/badge/arXiv-2402.12669-yellow" alt="arXiv:2402.12669" /></a></p> <li><p>Babbar, Chandrashekar, <strong>Lax-Wendroff Flux Reconstruction on adaptive curvilinear meshes with error based time stepping for hyperbolic conservation laws</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2402.11926"><img src="https://img.shields.io/badge/arXiv-2402.11926-yellow" alt="arXiv:2402.11926" /></a></p> </ul> <h3 id=2023 ><a href="#2023" class=header-anchor >2023</a></h3> <ul> <li><p>Babbar, Kenettinkara, Chandrashekar, <strong>Admissibility preserving subcell limiter for Lax-Wendroff flux reconstruction</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2305.10781"><img src="https://img.shields.io/badge/arXiv-2305.10781-yellow" alt="arXiv:2305.10781" /></a></p> <li><p>Ovadia, Oommen, Kahana, Peyvan, Turkel, Karniadakis, <strong>Real-time Inference and Extrapolation via a Diffusion-inspired Temporal Transformer Operator &#40;DiTTO&#41;</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2307.09072"><img src="https://img.shields.io/badge/arXiv-2307.09072-yellow" alt="arXiv:2307.09072" /></a></p> <li><p><em>Ranocha</em>, <em>Winters</em>, <em>Schlottke-Lakemper</em>, Öffner, Glaubitz, <em>Gassner</em>, <strong>High-order upwind summation-by-parts methods for nonlinear conservation laws</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2311.13888"><img src="https://img.shields.io/badge/arXiv-2311.13888-yellow" alt="arXiv:2311.13888" /></a> <a href="https://github.com/trixi-framework/paper-2023-upwind"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Ranocha</em>, Schütz, <strong>Multiderivative time integration methods preserving nonlinear functionals via relaxation</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2311.03883"><img src="https://img.shields.io/badge/arXiv-2311.03883-yellow" alt="arXiv:2311.03883" /></a> <a href="https://doi.org/10.2140/camcos.2024.19.27"><img src="https://zenodo.org/badge/doi/10.2140/camcos.2024.19.27.svg" alt="doi:10.2140/camcos.2024.19.27" /></a> <a href="https://github.com/ranocha/2023_multiderivative_relaxation"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Ranocha</em>, Giesselmann, <strong>Stability of step size control based on a posteriori error estimates</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2307.12677"><img src="https://img.shields.io/badge/arXiv-2307.12677-yellow" alt="arXiv:2307.12677" /></a> <a href="https://github.com/ranocha/2023_RK_error_estimate"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Chan</em>, Shukla, Wu, Liu, Nalluri, <strong>High order entropy stable schemes for the quasi-one-dimensional shallow water and compressible Euler equations</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2307.12089"><img src="https://img.shields.io/badge/arXiv-2307.12089-yellow" alt="arXiv:2307.12089" /></a></p> <li><p>Ersing, <em>Winters</em>, <strong>An entropy stable discontinuous Galerkin method for the two-layer shallow water equations on curvilinear meshes</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2306.12699"><img src="https://img.shields.io/badge/arXiv-2306.12699-yellow" alt="arXiv:2306.12699" /></a> <a href="https://doi.org/10.1007/s10915-024-02451-2"><img src="https://zenodo.org/badge/doi/10.1007/s10915-024-02451-2.svg" alt="doi:10.1007/s10915-024-02451-2" /></a> <a href="https://github.com/trixi-framework/paper-2023-es_two_layer"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p>Rueda-Ramírez, Bolm, Kuzmin, <em>Gassner</em>, <strong>Monolithic Convex Limiting for Legendre–Gauss–Lobatto Discontinuous Galerkin Spectral Element Methods</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2303.00374"><img src="https://img.shields.io/badge/arXiv-2303.00374-yellow" alt="arXiv:2303.00374" /></a> <a href="https://doi.org/10.1007/s42967-023-00321-6"><img src="https://zenodo.org/badge/doi/10.1007/s42967-023-00321-6.svg" alt="doi:10.1007/s42967-023-00321-6" /></a> <a href="https://github.com/amrueda/paper_2023_MCL_LGL-DGSEM"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Ranocha</em>, <strong>A discontinuous Galerkin discretization of elliptic problems with improved convergence properties using summation by parts operators</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2302.12488"><img src="https://img.shields.io/badge/arXiv-2302.12488-yellow" alt="arXiv:2302.12488" /></a> <a href="https://doi.org/10.1016/j.jcp.2023.112367"><img src="https://zenodo.org/badge/doi/10.1016/j.jcp.2023.112367.svg" alt="doi:10.1016/j.jcp.2023.112367" /></a> <a href="https://github.com/ranocha/2023_elliptic"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Ranocha</em>, <em>Winters</em>, Castro, Dalcin, <em>Schlottke-Lakemper</em>, <em>Gassner</em>, Parsani, <strong>On error-based step size control for discontinuous Galerkin methods for compressible fluid dynamics</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2209.07037"><img src="https://img.shields.io/badge/arXiv-2209.07037-yellow" alt="arXiv:2209.07037" /></a> <a href="https://doi.org/10.1007/s42967-023-00264-y"><img src="https://zenodo.org/badge/doi/10.1007/s42967-023-00264-y.svg" alt="doi:10.1007/s42967-023-00264-y" /></a> <a href="https://github.com/trixi-framework/paper-2022-stepsize_control"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Ranocha</em>, <em>Schlottke-Lakemper</em>, <em>Chan</em>, Rueda-Ramírez, <em>Winters</em>, Hindenlang, <em>Gassner</em>, <strong>Efficient implementation of modern entropy stable and kinetic energy preserving discontinuous Galerkin methods for conservation laws</strong>, ACM Transactions on Mathematical Software, 2023.<br/> <a href="https://arxiv.org/abs/2112.10517"><img src="https://img.shields.io/badge/arXiv-2112.10517-yellow" alt="arXiv:2112.10517" /></a> <a href="https://doi.org/10.1145/3625559"><img src="https://zenodo.org/badge/doi/10.1145/3625559.svg" alt="doi:10.1145/3625559" /></a> <a href="https://github.com/trixi-framework/paper-2021-EC_performance"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> </ul> <h3 id=2022 ><a href="#2022" class=header-anchor >2022</a></h3> <ul> <li><p><em>Chan</em>, <em>Ranocha</em>, Rueda-Ramírez, <em>Gassner</em>, Warburton, <strong>On the entropy projection and the robustness of high order entropy stable discontinuous Galerkin schemes for under-resolved flows</strong>, 2022.<br/> <a href="https://arxiv.org/abs/2203.10238"><img src="https://img.shields.io/badge/arXiv-2203.10238-yellow" alt="arXiv:2203.10238" /></a> <a href="https://doi.org/10.3389/fphy.2022.898028"><img src="https://zenodo.org/badge/doi/10.3389/fphy.2022.898028.svg" alt="doi:10.3389/fphy.2022.898028" /></a> <a href="https://github.com/trixi-framework/paper-2022-robustness-entropy-projection"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p>Rueda-Ramírez, Pazner, <em>Gassner</em>, <strong>Subcell limiting strategies for discontinuous Galerkin spectral element methods</strong>, 2022.<br/> <a href="https://arxiv.org/abs/2202.00576"><img src="https://img.shields.io/badge/arXiv-2202.00576-yellow" alt="arXiv:2202.00576" /></a> <a href="https://doi.org/10.1016/j.compfluid.2022.105627"><img src="https://zenodo.org/badge/doi/10.1016/j.compfluid.2022.105627.svg" alt="doi:10.1016/j.compfluid.2022.105627" /></a></p> <li><p>Lukáčová-Medvid’ová, Öffner, <strong>Convergence of Discontinuous Galerkin Schemes for the Euler Equations via Dissipative Weak Solutions</strong>, 2022.<br/> <a href="https://arxiv.org/abs/2202.10043"><img src="https://img.shields.io/badge/arXiv-2202.10043-yellow" alt="arXiv:2202.10043" /></a> <a href="https://doi.org/10.1016/j.amc.2022.127508"><img src="https://zenodo.org/badge/doi/10.1016/j.amc.2022.127508.svg" alt="doi:10.1016/j.amc.2022.127508" /></a></p> <li><p><em>Ranocha</em>, <strong>A Note on Numerical Fluxes Conserving Harten&#39;s Entropies for the Compressible Euler Equations</strong>, 2022.<br/> <a href="https://arxiv.org/abs/2201.03946"><img src="https://img.shields.io/badge/arXiv-2201.03946-yellow" alt="arXiv:2201.03946" /></a> <a href="https://doi.org/10.1016/j.jcp.2022.111236"><img src="https://zenodo.org/badge/doi/10.1016/j.jcp.2022.111236.svg" alt="doi:10.1016/j.jcp.2022.111236" /></a> <a href="https://github.com/ranocha/paper-2022-Euler_Harten_EC"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Ranocha</em>, <em>Schlottke-Lakemper</em>, <em>Winters</em>, <em>Faulhaber</em>, <em>Chan</em>, <em>Gassner</em>, <strong>Adaptive numerical simulations with Trixi.jl: A case study of Julia for scientific computing</strong>, JuliaCon Proceedings, 77, 2022.<br/> <a href="https://arxiv.org/abs/2108.06476"><img src="https://img.shields.io/badge/arXiv-2108.06476-yellow" alt="arXiv:2108.06476" /></a> <a href="https://doi.org/10.21105/jcon.00077"><img src="https://zenodo.org/badge/doi/10.21105/jcon.00077.svg" alt="doi:10.21105/jcon.00077" /></a> <a href="https://github.com/trixi-framework/paper-2021-juliacon"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Gassner</em>, Svärd, Hindenlang, <strong>Stability Issues of Entropy-Stable and/or Split-form High-order Schemes</strong>, 2022.<br/> <a href="https://arxiv.org/abs/2007.09026"><img src="https://img.shields.io/badge/arXiv-2007.09026-yellow" alt="arXiv:2007.09026" /></a> <a href="https://doi.org/10.1007/s10915-021-01720-8"><img src="https://zenodo.org/badge/doi/10.1007/s10915-021-01720-8.svg" alt="doi:10.1007/s10915-021-01720-8" /></a></p> </ul> <h3 id=2021 ><a href="#2021" class=header-anchor >2021</a></h3> <ul> <li><p>Singh, Chandrashekar, <strong>On a linear stability issue of split form schemes for compressible flows</strong>, 2021.<br/> <a href="https://arxiv.org/abs/2104.14941"><img src="https://img.shields.io/badge/arXiv-2104.14941-yellow" alt="arXiv:2104.14941" /></a></p> <li><p><em>Ranocha</em>, <em>Gassner</em>, <strong>Preventing pressure oscillations does not fix local linear stability issues of entropy-based split-form high-order schemes</strong>, Communications on Applied Mathematics and Computation, 2021.<br/> <a href="https://arxiv.org/abs/2009.13139"><img src="https://img.shields.io/badge/arXiv-2009.13139-yellow" alt="arXiv:2009.13139" /></a> <a href="https://doi.org/10.1007/s42967-021-00148-z"><img src="https://zenodo.org/badge/doi/10.1007/s42967-021-00148-z.svg" alt="doi:10.1007/s42967-021-00148-z" /></a> <a href="https://github.com/trixi-framework/paper-EC-KEP-PEP"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Schlottke-Lakemper</em>, <em>Winters</em>, <em>Ranocha</em>, <em>Gassner</em>, <strong>A purely hyperbolic discontinuous Galerkin approach for self-gravitating gas dynamics</strong>, Journal of Computational Physics &#40;442&#41;, 110467, 2021.<br/> <a href="https://arxiv.org/abs/2008.10593"><img src="https://img.shields.io/badge/arXiv-2008.10593-yellow" alt="arXiv:2008.10593" /></a> <a href="https://doi.org/10.1016/j.jcp.2021.110467"><img src="https://zenodo.org/badge/doi/10.1016/j.jcp.2021.110467.svg" alt="doi:10.1016/j.jcp.2021.110467" /></a> <a href="https://github.com/trixi-framework/paper-self-gravitating-gas-dynamics"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> </ul> <h2 id=talks ><a href="#talks" class=header-anchor >Talks</a></h2> <h3 id=2024__2 ><a href="#2024__2" class=header-anchor >2024</a></h3> <ul> <li><p><strong>Non-intrusive Multirate Time-Integration for High-Order accurate Compressible Fluid Dynamics with Trixi.jl</strong><br/> <em>Doehring</em><br/> 2nd July 2024, PDESoft 2024, Cambridge, UK</p> </ul> <h3 id=2023__2 ><a href="#2023__2" class=header-anchor >2023</a></h3> <ul> <li><p><strong>Challenges of sustainable research software engineering in Trixi.jl</strong><br/> <em>Schlottke-Lakemper</em><br/> 27th October 2023, MBD Colloquium, Aachen, Germany</p> <li><p><strong>Julia for scientific high-performance computing: opportunities and challenges</strong><br/> <em>Schlottke-Lakemper</em><br/> 6th October 2023, Ferrite.jl User &amp; Developer Conference, Bochum, Germany</p> <li><p><strong>Scaling Trixi.jl to more than 10,000 cores using MPI</strong><br/> <em>Schlottke-Lakemper</em>, <em>Ranocha</em><br/> 27th July 2023, JuliaCon 2023, Cambridge, US</p> <li><p><strong>Massively Parallel Computational Fluid Dynamics with Julia and Trixi.jl</strong><br/> <em>Schlottke-Lakemper</em><br/> 28th June 2023, PASC Conference, Davos, Switzerland</p> <li><p><strong>Research Software Engineering for Sustainable Scientific Computing</strong><br/> <em>Schlottke-Lakemper</em><br/> 30th January 2023, SSD Seminar Series, Aachen, Germany</p> <li><p><strong>Trixi.jl: High-Order Numerical Simulations of Conservation Laws in Julia</strong><br/> <em>Schlottke-Lakemper</em><br/> 19th January 2023, SNuBIC Seminar<br/> <a href="https://github.com/trixi-framework/tutorial-2023-snubic">tutorials &amp; notebooks</a></p> </ul> <h3 id=2022__2 ><a href="#2022__2" class=header-anchor >2022</a></h3> <ul> <li><p><strong>Robust and efficient high-performance computational fluid dynamics enabled by modern numerical methods and technologies</strong><br/> <em>Ranocha</em><br/> 3rd November 2022, MUSEN Colloquium, TU Braunschweig, Germany</p> <li><p><strong>Reproducibility as a service: collaborative scientific computing with Julia</strong><br/> <em>Schlottke-Lakemper</em>, <em>Ranocha</em><br/> 27th October 2022, MaRDI Workshop for Scientific Computing, Münster, Germany</p> <li><p><strong>From Mesh Generation to Adaptive Simulation: A Journey in Julia</strong><br/> <em>Winters</em><br/> 27th July 2022, JuliaCon 2022<br/> <a href="https://youtu.be/_N4ozHr-t9E">recorded talk on YouTube</a> | <a href="https://github.com/trixi-framework/talk-2022-juliacon_toolchain">presentation &amp; code</a></p> <li><p><strong>Running Julia code in parallel with MPI: Lessons learned</strong><br/> <em>Christmann</em>, Neher, <em>Schlottke-Lakemper</em><br/> 26th July 2022, Julia for HPC Minisymposium, JuliaCon 2022<br/> <a href="https://youtu.be/fog1x9rs71Q?t&#61;5172">recorded talk on YouTube</a> | <a href="https://github.com/JuliaParallel/juliacon-2022-julia-for-hpc-minisymposium">presentation</a></p> <li><p><strong>Extensible Computational Fluid Dynamics in Julia with Trixi.jl</strong><br/> <em>Schlottke-Lakemper</em>, <em>Ranocha</em>, <em>Gassner</em><br/> 25th February 2022, SIAM Conference on Parallel Processing for Scientific Computing, Seattle, US</p> </ul> <h3 id=2021__2 ><a href="#2021__2" class=header-anchor >2021</a></h3> <ul> <li><p><strong>Research software development with Julia</strong><br/> <em>Schlottke-Lakemper</em>, <em>Ranocha</em><br/> 27th September 2021, NFDI4Ing Conference 2021</p> <li><p><strong>Adaptive high-order numerical simulations with Trixi.jl</strong><br/> <em>Schlottke-Lakemper</em>, <em>Ranocha</em><br/> 9th September 2021, CliMA Seminar, California Institute of Technology</p> <li><p><strong>Adaptive and extendable numerical simulations with Trixi.jl</strong><br/> <em>Schlottke-Lakemper</em>, <em>Ranocha</em><br/> 30th July 2021, JuliaCon 2021<br/> <a href="https://github.com/trixi-framework/talk-2021-juliacon">presentation &amp; notebooks</a> | <a href="https://www.youtube.com/watch?v&#61;hoViWRAhCBE">recorded talk on YouTube</a></p> <li><p><strong>Trixi.jl: High-Order Numerical Simulations of Hyperbolic PDEs in Julia</strong><br/> <em>Ranocha</em>, <em>Schlottke-Lakemper</em>, <em>Winters</em><br/> 14th July 2021, ICOSAHOM 2021<br/> <a href="https://github.com/trixi-framework/tutorial-2021-icosahom">tutorials &amp; notebooks</a></p> <li><p><strong>Introduction to Julia and Trixi, a numerical simulation framework for hyperbolic PDEs</strong><br/> <em>Ranocha</em><br/> 27th April 2021, Applied Mathematics Seminar, University of Münster<br/> <a href="https://github.com/trixi-framework/talk-2021-Introduction_to_Julia_and_Trixi">presentation</a></p> <li><p><strong>Purely hyperbolic self-gravitating flow simulations in Julia</strong><br/> <em>Schlottke-Lakemper</em>, <em>Winters</em>, <em>Ranocha</em>, <em>Gassner</em><br/> 15th March 2021, GAMM Annual Meeting 2021</p> <li><p><strong>Julia for adaptive high-order multi-physics simulations</strong><br/> <em>Schlottke-Lakemper</em><br/> 27th January 2021, Numerical Analysis Seminar, Lund University<br/> <a href="https://github.com/trixi-framework/talk-2021-julia-adaptive-multi-physics-simulations">presentation &amp; notebooks</a></p> </ul> <h2 id=outreach ><a href="#outreach" class=header-anchor >Outreach</a></h2> <h3 id=google_summer_of_code_2023 ><a href="#google_summer_of_code_2023" class=header-anchor >Google Summer of Code 2023</a></h3> <p>Trixi.jl participated in the Google Summer of Code 2023, marking its initial steps towards running on GPUs. This project was mentored by <a href="https://ranocha.de/">Hendrik Ranocha</a> and <a href="https://www.uni-augsburg.de/fakultaet/mntf/math/prof/hpsc">Michael Schlottke-Lakemper</a>. <a href="outreach/gsoc/2023/gpu-acceleration-in-trixi-jl-using-cuda-jl">Here</a> you can find the report from our contributor <a href="https://github.com/huiyuxie">Huiyu Xie</a>.</p> <h2 id=authors ><a href="#authors" class=header-anchor >Authors</a></h2> <p><a href="https://www.uni-augsburg.de/fakultaet/mntf/math/prof/hpsc">Michael Schlottke-Lakemper</a> &#40;University of Augsburg, Germany&#41;, <a href="https://www.mi.uni-koeln.de/NumSim/gregor-gassner">Gregor Gassner</a> &#40;University of Cologne, Germany&#41;, <a href="https://ranocha.de/">Hendrik Ranocha</a> &#40;University of Hamburg, Germany&#41;, <a href="https://liu.se/en/employee/andwi94">Andrew Winters</a> &#40;Linköping University, Sweden&#41;, and <a href="https://jlchan.github.io/">Jesse Chan</a> &#40;Rice University, US&#41; are the principal developers of <a href="https://github.com/trixi-framework/Trixi.jl">Trixi.jl</a>. <a href="https://www.math.fsu.edu/~kopriva/">David A. Kopriva</a> &#40;Florida State University, US&#41; is the principal developer of <a href="https://github.com/trixi-framework/HOHQMesh">HOHQMesh</a> and <a href="https://github.com/trixi-framework/HOHQMesh.jl">HOHQMesh.jl</a>. For a full list of authors, please check out the respective packages.</p> <h2 id=get_in_touch ><a href="#get_in_touch" class=header-anchor >Get in touch&#33;</a></h2> <p>There are a number of ways to reach out to us:</p> <ul> <li><p>Meet us on <a href="https://join.slack.com/t/trixi-framework/shared_invite/zt-sgkc6ppw-6OXJqZAD5SPjBYqLd8MU~g">Slack</a></p> <li><p>Create an issue in one of the repositories listed on this page</p> <li><p>Get in touch with one of the <a href="https://github.com/trixi-framework/Trixi.jl/blob/main/AUTHORS.md">Trixi Authors</a></p> </ul> <h2 id=acknowledgments ><a href="#acknowledgments" class=header-anchor >Acknowledgments</a></h2> <div style="width: 100%; text-align: center; font-size: 0;"> <div><!-- BMBF --><img src="https://github.com/trixi-framework/Trixi.jl/assets/3637659/f59af636-3098-4be6-bf80-c6be3f17cbc6" style="height: 120px; width: auto"><!-- DFG --><img src="https://github.com/trixi-framework/Trixi.jl/assets/3637659/e67b9ed3-7699-466a-bdaf-2ba070a29a8e" style="height: 120px; width: auto"><!-- --> </div> <div><!-- SRC --><img src="https://github.com/trixi-framework/Trixi.jl/assets/3637659/48f9da06-6f7a-4586-b23e-739bee3901c0" style="height: 120px; width: auto"><!-- ERC --><img src="https://github.com/trixi-framework/Trixi.jl/assets/3637659/9371e7e4-3491-4433-ac5f-b3bfb215f5ca" style="height: 120px; width: auto"><!-- --> </div> <div><!-- NSF --><img src="https://github.com/trixi-framework/Trixi.jl/assets/3637659/5325103c-ae81-4747-b87c-c6e4a1b1d7a8" style="height: 120px; width: auto"><!-- DUBS --><img src="https://github.com/trixi-framework/Trixi.jl/assets/3637659/bb021e6e-42e6-4fe1-a414-c847402e1937" style="height: 120px; width: auto"><!-- --> </div> <div><!-- NumFOCUS --><img src="https://github.com/trixi-framework/Trixi.jl/assets/3637659/8496ac9e-b586-475f-adb7-69bcfc415185" style="height: 120px; width: auto"><!-- --> </div> </div> <p>This project has benefited from funding by the <a href="https://www.dfg.de/">Deutsche Forschungsgemeinschaft</a> &#40;DFG, German Research Foundation&#41; through the following grants:</p> <ul> <li><p>Excellence Strategy EXC 2044-390685587, Mathematics Münster: Dynamics-Geometry-Structure.</p> <li><p>Research unit FOR 5409 &quot;Structure-Preserving Numerical Methods for Bulk- and Interface Coupling of Heterogeneous Models &#40;SNuBIC&#41;&quot; &#40;project number 463312734&#41;.</p> <li><p>Individual grant no. 528753982.</p> </ul> <p>This project has benefited from funding from the <a href="https://erc.europa.eu">European Research Council</a> through the ERC Starting Grant &quot;An Exascale aware and Un-crashable Space-Time-Adaptive Discontinuous Spectral Element Solver for Non-Linear Conservation Laws&quot; &#40;Extreme&#41;, ERC grant agreement no. 714487.</p> <p>This project has benefited from funding from <a href="https://www.vr.se">Vetenskapsrådet</a> &#40;VR, Swedish Research Council&#41;, Sweden through the VR Starting Grant &quot;Shallow water flows including sediment transport and morphodynamics&quot;, VR grant agreement 2020-03642 VR.</p> <p>This project has benefited from funding from the United States <a href="https://www.nsf.gov/">National Science Foundation</a> &#40;NSF&#41; under awards DMS-1719818 and DMS-1943186.</p> <p>This project has benefited from funding from the German <a href="https://www.bmbf.de">Federal Ministry of Education and Research</a> &#40;BMBF&#41; through the project grant &quot;Adaptive earth system modeling with significantly reduced computation time for exascale supercomputers &#40;ADAPTEX&#41;&quot; &#40;funding id: 16ME0668K&#41;.</p> <p>This project has benefited from funding by the <a href="https://www.daimler-benz-stiftung.de">Daimler und Benz Stiftung</a> &#40;Daimler and Benz Foundation&#41; through grant no. 32-10/22.</p> <p>Trixi.jl is supported by <a href="https://numfocus.org/">NumFOCUS</a> as an Affiliated Project.</p> <div class=page-foot > <div class=copyright > &copy; <a href="https://github.com/trixi-framework/Trixi.jl/blob/main/AUTHORS.md" target=_blank  rel="noopener noreferrer">The Trixi Authors</a>. Last modified: August 22, 2024. Website built with <a href="https://github.com/tlienart/Franklin.jl">Franklin.jl</a> and the <a href="https://julialang.org">Julia programming language</a>. </div> </div> </div>
\ No newline at end of file
+<!doctype html> <html lang=en > <meta charset=UTF-8 > <meta name=viewport  content="width=device-width, initial-scale=1"> <link rel=stylesheet  href="/css/franklin.css"> <link rel=stylesheet  href="/css/basic.css"> <link rel=icon  href="/assets/favicon.ico"> <title>Trixi Framework</title> <header> <div class=blog-name ><a href="/"><img src="/assets/logo.png" width=100px ></a></div> <nav> <ul> <li><a href="/">Home</a> <li><a href="https://github.com/trixi-framework" target=_blank  rel="noopener noreferrer">Trixi on GitHub</a> </ul> </nav> </header> <div class=franklin-content ><h1 id=trixi_framework ><a href="#trixi_framework" class=header-anchor >Trixi Framework</a></h1> <p>The Trixi framework is a collaborative scientific effort to provide open source tools for adaptive high-order numerical simulations of hyperbolic PDEs in Julia. Besides the core algorithms, the framework also includes mesh and visualization tools. Moreover, it includes utilities such as Julia wrappers of mature libraries written in other programming languages.</p> <p>This page gives an overview of the different activities that, together, constitute the Trixi framework on <a href="https://github.com/orgs/trixi-framework">GitHub</a>.</p> <div class=franklin-toc ><ol><li><a href="#adaptive_high-order_numerical_simulations_of_hyperbolic_pdes">Adaptive high-order numerical simulations of hyperbolic PDEs</a><li><a href="#mesh_generation">Mesh generation</a><li><a href="#particle-based_multiphysics_simulations">Particle-based multiphysics simulations</a><li><a href="#additional_packages">Additional packages</a><li><a href="#publications">Publications</a><li><a href="#talks">Talks</a><li><a href="#outreach">Outreach</a><li><a href="#authors">Authors</a><li><a href="#get_in_touch">Get in touch&#33;</a><li><a href="#acknowledgments">Acknowledgments</a></ol></div> <h2 id=adaptive_high-order_numerical_simulations_of_hyperbolic_pdes ><a href="#adaptive_high-order_numerical_simulations_of_hyperbolic_pdes" class=header-anchor >Adaptive high-order numerical simulations of hyperbolic PDEs</a></h2> <ul> <li><p><a href="https://github.com/trixi-framework/Trixi.jl"><strong>Trixi.jl</strong></a></p> <p>Adaptive high-order numerical simulations of hyperbolic PDEs in Julia</p> <li><p><a href="https://github.com/trixi-framework/Trixi2Vtk.jl"><strong>Trixi2Vtk.jl</strong></a></p> <p>Convert output files generated with Trixi.jl to VTK</p> <li><p><a href="https://github.com/trixi-framework/libtrixi"><strong>libtrixi</strong></a></p> <p>Use <a href="https://github.com/trixi-framework/Trixi.jl">Trixi.jl</a> from C/C&#43;&#43;/Fortran</p> <li><p><a href="https://github.com/trixi-framework/SmartShockFinder.jl"><strong>SmartShockFinder.jl</strong></a></p> <p>Create troubled cell indicators for Trixi.jl using artificial neural networks</p> </ul> <h2 id=mesh_generation ><a href="#mesh_generation" class=header-anchor >Mesh generation</a></h2> <ul> <li><p><a href="https://github.com/trixi-framework/HOHQMesh.jl"><strong>HOHQMesh.jl</strong></a></p> <p>HOHQMesh.jl is a Julia wrapper for the HOHQMesh mesh generator, which allows to produce curved quadrilateral and hexahedral meshes for high-order numerical simulations.</p> <li><p><a href="https://github.com/trixi-framework/HOHQMesh"><strong>HOHQMesh</strong></a></p> <p>High Order Hex-Quad Mesh &#40;HOHQMesh&#41; package to automatically generate all-quadrilateral meshes with high order boundary information.</p> <li><p><a href="https://github.com/trixi-framework/Smesh.jl"><strong>Smesh.jl</strong></a></p> <p>Smesh.jl is a Julia wrapper packagae for smesh, a simple Fortran package for generating and handling unstructured triangular and polygonal meshes.</p> <li><p><a href="https://github.com/trixi-framework/smesh"><strong>smesh</strong></a></p> <p>A simple Fortran package for generating and handling unstructured triangular and polygonal meshes.</p> </ul> <h2 id=particle-based_multiphysics_simulations ><a href="#particle-based_multiphysics_simulations" class=header-anchor >Particle-based multiphysics simulations</a></h2> <ul> <li><p><a href="https://github.com/trixi-framework/TrixiParticles.jl"><strong>TrixiParticles.jl</strong></a></p> <p>Particle-based multiphysics simulations in Julia</p> <li><p><a href="https://github.com/trixi-framework/PointNeighbors.jl"><strong>PointNeighbors.jl</strong></a></p> <p>Efficient neighborhood search in point clouds with fixed search radius</p> </ul> <h2 id=additional_packages ><a href="#additional_packages" class=header-anchor >Additional packages</a></h2> <ul> <li><p><a href="https://github.com/trixi-framework/P4est.jl"><strong>P4est.jl</strong></a></p> <p>P4est.jl is lightweight Julia wrapper for the p4est C library.</p> <li><p><a href="https://github.com/trixi-framework/KROME.jl"><strong>KROME.jl</strong></a></p> <p>KROME.jl is a lightweight Julia wrapper for KROME, a Fortran library for including chemistry and microphysics in astrophysics simulations.</p> <li><p><a href="https://github.com/JuliaVTK/ReadVTK.jl"><strong>JuliaVTK/ReadVTK.jl</strong></a></p> <p>Julia package for reading VTK XML files &#40;maintained by the Trixi framework authors&#41;.</p> </ul> <h2 id=publications ><a href="#publications" class=header-anchor >Publications</a></h2> <p>The following publications make use of Trixi.jl or one of the other packages listed above. Author names of Trixi.jl&#39;s main developers are in <em>italics</em>.</p> <h3 id=2024 ><a href="#2024" class=header-anchor >2024</a></h3> <ul> <li><p><em>Doehring</em>, <em>Christmann</em>, <em>Schlottke-Lakemper</em>, <em>Gassner</em>, Torrilhon, <strong>Fourth-Order Paired-Explicit Runge-Kutta Methods</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2408.05470"><img src="https://img.shields.io/badge/arXiv-2408.05470-yellow" alt="arXiv:2408.05470" /></a></p> <li><p><em>Ersing</em>, Goldberg, <em>Winters</em>, <strong>Entropy stable hydrostatic reconstruction schemes for shallow water systems</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2406.14119"><img src="https://img.shields.io/badge/arXiv-2406.14119-yellow" alt="arXiv:2406.14119" /></a> <a href="https://github.com/patrickersing/paper-2024-es_hydrostatic_reconstruction"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p>Glaubitz, <em>Ranocha</em>, <em>Winters</em>, <em>Schlottke-Lakemper</em>, Öffner, <em>Gassner</em>, <strong>Generalized upwind summation-by-parts operators and their application to nodal discontinuous Galerkin methods</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2406.14557"><img src="https://img.shields.io/badge/arXiv-2406.14557-yellow" alt="arXiv:2406.14557" /></a> <a href="https://github.com/trixi-framework/paper-2024-generalized-upwind-sbp"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p>Bender, Öffner, <strong>Entropy-Conservative Discontinuous Galerkin Methods for the Shallow Water Equations with Uncertainty</strong>, 2024.<br/> <a href="https://doi.org/10.1007/s42967-024-00369-y"><img src="https://zenodo.org/badge/doi/10.1007/s42967-024-00369-y.svg" alt="doi:10.1007/s42967-024-00369-y" /></a></p> <li><p>Oblapenko, Torrilhon, <strong>Entropy-conservative high-order methods for high-enthalpy flows</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2403.16882"><img src="https://img.shields.io/badge/arXiv-2403.16882-yellow" alt="arXiv:2403.16882" /></a> <a href="https://github.com/knstmrd/paper-ec_trixi_inte"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Doehring</em>, <em>Schlottke-Lakemper</em>, <em>Gassner</em>, Torrilhon, <strong>Multirate Time-Integration based on Dynamic ODE Partitioning through Adaptively Refined Meshes for Compressible Fluid Dynamics</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2403.05144"><img src="https://img.shields.io/badge/arXiv-2403.05144-yellow" alt="arXiv:2403.05144" /></a> <a href="https://doi.org/10.1016/j.jcp.2024.113223"><img src="https://zenodo.org/badge/doi/10.1016/j.jcp.2024.113223.svg" alt="doi:10.1016/j.jcp.2024.113223" /></a> <a href="https://github.com/trixi-framework/paper-2024-amr-paired-rk"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p>Babbar, Chandrashekar, <strong>Multi-Derivative Runge-Kutta Flux Reconstruction for Hyperbolic Conservation Laws</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2403.02141"><img src="https://img.shields.io/badge/arXiv-2403.02141-yellow" alt="arXiv:2403.02141" /></a></p> <li><p><em>Lampert</em>, <em>Ranocha</em>, <strong>Structure-Preserving Numerical Methods for Two Nonlinear Systems of Dispersive Wave Equations</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2402.16669"><img src="https://img.shields.io/badge/arXiv-2402.16669-yellow" alt="arXiv:2402.16669" /></a> <a href="https://github.com/JoshuaLampert/2024_dispersive_shallow_water"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p>Rueda-Ramírez, Sikstel, <em>Gassner</em>, <strong>An Entropy-Stable Discontinuous Galerkin Discretization of the Ideal Multi-Ion Magnetohydrodynamics System</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2402.14615"><img src="https://img.shields.io/badge/arXiv-2402.14615-yellow" alt="arXiv:2402.14615" /></a></p> <li><p><em>Doehring</em>, <em>Gassner</em>, Torrilhon, <strong>Many-Stage Optimal Stabilized Runge-Kutta Methods for Hyperbolic Partial Differential Equations</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2402.12140"><img src="https://img.shields.io/badge/arXiv-2402.12140-yellow" alt="arXiv:2402.12140" /></a></p> <li><p>Babbar, Chandrashekar, <strong>Lax-Wendroff Flux Reconstruction for advection-diffusion equations with error-based time stepping</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2402.12669"><img src="https://img.shields.io/badge/arXiv-2402.12669-yellow" alt="arXiv:2402.12669" /></a></p> <li><p>Babbar, Chandrashekar, <strong>Lax-Wendroff Flux Reconstruction on adaptive curvilinear meshes with error based time stepping for hyperbolic conservation laws</strong>, 2024.<br/> <a href="https://arxiv.org/abs/2402.11926"><img src="https://img.shields.io/badge/arXiv-2402.11926-yellow" alt="arXiv:2402.11926" /></a></p> </ul> <h3 id=2023 ><a href="#2023" class=header-anchor >2023</a></h3> <ul> <li><p>Babbar, Kenettinkara, Chandrashekar, <strong>Admissibility preserving subcell limiter for Lax-Wendroff flux reconstruction</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2305.10781"><img src="https://img.shields.io/badge/arXiv-2305.10781-yellow" alt="arXiv:2305.10781" /></a></p> <li><p>Ovadia, Oommen, Kahana, Peyvan, Turkel, Karniadakis, <strong>Real-time Inference and Extrapolation via a Diffusion-inspired Temporal Transformer Operator &#40;DiTTO&#41;</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2307.09072"><img src="https://img.shields.io/badge/arXiv-2307.09072-yellow" alt="arXiv:2307.09072" /></a></p> <li><p><em>Ranocha</em>, <em>Winters</em>, <em>Schlottke-Lakemper</em>, Öffner, Glaubitz, <em>Gassner</em>, <strong>High-order upwind summation-by-parts methods for nonlinear conservation laws</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2311.13888"><img src="https://img.shields.io/badge/arXiv-2311.13888-yellow" alt="arXiv:2311.13888" /></a> <a href="https://github.com/trixi-framework/paper-2023-upwind"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Ranocha</em>, Schütz, <strong>Multiderivative time integration methods preserving nonlinear functionals via relaxation</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2311.03883"><img src="https://img.shields.io/badge/arXiv-2311.03883-yellow" alt="arXiv:2311.03883" /></a> <a href="https://doi.org/10.2140/camcos.2024.19.27"><img src="https://zenodo.org/badge/doi/10.2140/camcos.2024.19.27.svg" alt="doi:10.2140/camcos.2024.19.27" /></a> <a href="https://github.com/ranocha/2023_multiderivative_relaxation"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Ranocha</em>, Giesselmann, <strong>Stability of step size control based on a posteriori error estimates</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2307.12677"><img src="https://img.shields.io/badge/arXiv-2307.12677-yellow" alt="arXiv:2307.12677" /></a> <a href="https://github.com/ranocha/2023_RK_error_estimate"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Chan</em>, Shukla, Wu, Liu, Nalluri, <strong>High order entropy stable schemes for the quasi-one-dimensional shallow water and compressible Euler equations</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2307.12089"><img src="https://img.shields.io/badge/arXiv-2307.12089-yellow" alt="arXiv:2307.12089" /></a></p> <li><p>Ersing, <em>Winters</em>, <strong>An entropy stable discontinuous Galerkin method for the two-layer shallow water equations on curvilinear meshes</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2306.12699"><img src="https://img.shields.io/badge/arXiv-2306.12699-yellow" alt="arXiv:2306.12699" /></a> <a href="https://doi.org/10.1007/s10915-024-02451-2"><img src="https://zenodo.org/badge/doi/10.1007/s10915-024-02451-2.svg" alt="doi:10.1007/s10915-024-02451-2" /></a> <a href="https://github.com/trixi-framework/paper-2023-es_two_layer"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p>Rueda-Ramírez, Bolm, Kuzmin, <em>Gassner</em>, <strong>Monolithic Convex Limiting for Legendre–Gauss–Lobatto Discontinuous Galerkin Spectral Element Methods</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2303.00374"><img src="https://img.shields.io/badge/arXiv-2303.00374-yellow" alt="arXiv:2303.00374" /></a> <a href="https://doi.org/10.1007/s42967-023-00321-6"><img src="https://zenodo.org/badge/doi/10.1007/s42967-023-00321-6.svg" alt="doi:10.1007/s42967-023-00321-6" /></a> <a href="https://github.com/amrueda/paper_2023_MCL_LGL-DGSEM"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Ranocha</em>, <strong>A discontinuous Galerkin discretization of elliptic problems with improved convergence properties using summation by parts operators</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2302.12488"><img src="https://img.shields.io/badge/arXiv-2302.12488-yellow" alt="arXiv:2302.12488" /></a> <a href="https://doi.org/10.1016/j.jcp.2023.112367"><img src="https://zenodo.org/badge/doi/10.1016/j.jcp.2023.112367.svg" alt="doi:10.1016/j.jcp.2023.112367" /></a> <a href="https://github.com/ranocha/2023_elliptic"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Ranocha</em>, <em>Winters</em>, Castro, Dalcin, <em>Schlottke-Lakemper</em>, <em>Gassner</em>, Parsani, <strong>On error-based step size control for discontinuous Galerkin methods for compressible fluid dynamics</strong>, 2023.<br/> <a href="https://arxiv.org/abs/2209.07037"><img src="https://img.shields.io/badge/arXiv-2209.07037-yellow" alt="arXiv:2209.07037" /></a> <a href="https://doi.org/10.1007/s42967-023-00264-y"><img src="https://zenodo.org/badge/doi/10.1007/s42967-023-00264-y.svg" alt="doi:10.1007/s42967-023-00264-y" /></a> <a href="https://github.com/trixi-framework/paper-2022-stepsize_control"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Ranocha</em>, <em>Schlottke-Lakemper</em>, <em>Chan</em>, Rueda-Ramírez, <em>Winters</em>, Hindenlang, <em>Gassner</em>, <strong>Efficient implementation of modern entropy stable and kinetic energy preserving discontinuous Galerkin methods for conservation laws</strong>, ACM Transactions on Mathematical Software, 2023.<br/> <a href="https://arxiv.org/abs/2112.10517"><img src="https://img.shields.io/badge/arXiv-2112.10517-yellow" alt="arXiv:2112.10517" /></a> <a href="https://doi.org/10.1145/3625559"><img src="https://zenodo.org/badge/doi/10.1145/3625559.svg" alt="doi:10.1145/3625559" /></a> <a href="https://github.com/trixi-framework/paper-2021-EC_performance"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> </ul> <h3 id=2022 ><a href="#2022" class=header-anchor >2022</a></h3> <ul> <li><p><em>Chan</em>, <em>Ranocha</em>, Rueda-Ramírez, <em>Gassner</em>, Warburton, <strong>On the entropy projection and the robustness of high order entropy stable discontinuous Galerkin schemes for under-resolved flows</strong>, 2022.<br/> <a href="https://arxiv.org/abs/2203.10238"><img src="https://img.shields.io/badge/arXiv-2203.10238-yellow" alt="arXiv:2203.10238" /></a> <a href="https://doi.org/10.3389/fphy.2022.898028"><img src="https://zenodo.org/badge/doi/10.3389/fphy.2022.898028.svg" alt="doi:10.3389/fphy.2022.898028" /></a> <a href="https://github.com/trixi-framework/paper-2022-robustness-entropy-projection"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p>Rueda-Ramírez, Pazner, <em>Gassner</em>, <strong>Subcell limiting strategies for discontinuous Galerkin spectral element methods</strong>, 2022.<br/> <a href="https://arxiv.org/abs/2202.00576"><img src="https://img.shields.io/badge/arXiv-2202.00576-yellow" alt="arXiv:2202.00576" /></a> <a href="https://doi.org/10.1016/j.compfluid.2022.105627"><img src="https://zenodo.org/badge/doi/10.1016/j.compfluid.2022.105627.svg" alt="doi:10.1016/j.compfluid.2022.105627" /></a></p> <li><p>Lukáčová-Medvid’ová, Öffner, <strong>Convergence of Discontinuous Galerkin Schemes for the Euler Equations via Dissipative Weak Solutions</strong>, 2022.<br/> <a href="https://arxiv.org/abs/2202.10043"><img src="https://img.shields.io/badge/arXiv-2202.10043-yellow" alt="arXiv:2202.10043" /></a> <a href="https://doi.org/10.1016/j.amc.2022.127508"><img src="https://zenodo.org/badge/doi/10.1016/j.amc.2022.127508.svg" alt="doi:10.1016/j.amc.2022.127508" /></a></p> <li><p><em>Ranocha</em>, <strong>A Note on Numerical Fluxes Conserving Harten&#39;s Entropies for the Compressible Euler Equations</strong>, 2022.<br/> <a href="https://arxiv.org/abs/2201.03946"><img src="https://img.shields.io/badge/arXiv-2201.03946-yellow" alt="arXiv:2201.03946" /></a> <a href="https://doi.org/10.1016/j.jcp.2022.111236"><img src="https://zenodo.org/badge/doi/10.1016/j.jcp.2022.111236.svg" alt="doi:10.1016/j.jcp.2022.111236" /></a> <a href="https://github.com/ranocha/paper-2022-Euler_Harten_EC"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Ranocha</em>, <em>Schlottke-Lakemper</em>, <em>Winters</em>, <em>Faulhaber</em>, <em>Chan</em>, <em>Gassner</em>, <strong>Adaptive numerical simulations with Trixi.jl: A case study of Julia for scientific computing</strong>, JuliaCon Proceedings, 77, 2022.<br/> <a href="https://arxiv.org/abs/2108.06476"><img src="https://img.shields.io/badge/arXiv-2108.06476-yellow" alt="arXiv:2108.06476" /></a> <a href="https://doi.org/10.21105/jcon.00077"><img src="https://zenodo.org/badge/doi/10.21105/jcon.00077.svg" alt="doi:10.21105/jcon.00077" /></a> <a href="https://github.com/trixi-framework/paper-2021-juliacon"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Gassner</em>, Svärd, Hindenlang, <strong>Stability Issues of Entropy-Stable and/or Split-form High-order Schemes</strong>, 2022.<br/> <a href="https://arxiv.org/abs/2007.09026"><img src="https://img.shields.io/badge/arXiv-2007.09026-yellow" alt="arXiv:2007.09026" /></a> <a href="https://doi.org/10.1007/s10915-021-01720-8"><img src="https://zenodo.org/badge/doi/10.1007/s10915-021-01720-8.svg" alt="doi:10.1007/s10915-021-01720-8" /></a></p> </ul> <h3 id=2021 ><a href="#2021" class=header-anchor >2021</a></h3> <ul> <li><p>Singh, Chandrashekar, <strong>On a linear stability issue of split form schemes for compressible flows</strong>, 2021.<br/> <a href="https://arxiv.org/abs/2104.14941"><img src="https://img.shields.io/badge/arXiv-2104.14941-yellow" alt="arXiv:2104.14941" /></a></p> <li><p><em>Ranocha</em>, <em>Gassner</em>, <strong>Preventing pressure oscillations does not fix local linear stability issues of entropy-based split-form high-order schemes</strong>, Communications on Applied Mathematics and Computation, 2021.<br/> <a href="https://arxiv.org/abs/2009.13139"><img src="https://img.shields.io/badge/arXiv-2009.13139-yellow" alt="arXiv:2009.13139" /></a> <a href="https://doi.org/10.1007/s42967-021-00148-z"><img src="https://zenodo.org/badge/doi/10.1007/s42967-021-00148-z.svg" alt="doi:10.1007/s42967-021-00148-z" /></a> <a href="https://github.com/trixi-framework/paper-EC-KEP-PEP"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> <li><p><em>Schlottke-Lakemper</em>, <em>Winters</em>, <em>Ranocha</em>, <em>Gassner</em>, <strong>A purely hyperbolic discontinuous Galerkin approach for self-gravitating gas dynamics</strong>, Journal of Computational Physics &#40;442&#41;, 110467, 2021.<br/> <a href="https://arxiv.org/abs/2008.10593"><img src="https://img.shields.io/badge/arXiv-2008.10593-yellow" alt="arXiv:2008.10593" /></a> <a href="https://doi.org/10.1016/j.jcp.2021.110467"><img src="https://zenodo.org/badge/doi/10.1016/j.jcp.2021.110467.svg" alt="doi:10.1016/j.jcp.2021.110467" /></a> <a href="https://github.com/trixi-framework/paper-self-gravitating-gas-dynamics"><img src="https://img.shields.io/badge/reproduce-me&#33;-brightgreen" alt="reproduce me&#33;" /></a></p> </ul> <h2 id=talks ><a href="#talks" class=header-anchor >Talks</a></h2> <h3 id=2024__2 ><a href="#2024__2" class=header-anchor >2024</a></h3> <ul> <li><p><strong>Non-intrusive Multirate Time-Integration for High-Order accurate Compressible Fluid Dynamics with Trixi.jl</strong><br/> <em>Doehring</em><br/> 2nd July 2024, PDESoft 2024, Cambridge, UK</p> </ul> <h3 id=2023__2 ><a href="#2023__2" class=header-anchor >2023</a></h3> <ul> <li><p><strong>Challenges of sustainable research software engineering in Trixi.jl</strong><br/> <em>Schlottke-Lakemper</em><br/> 27th October 2023, MBD Colloquium, Aachen, Germany</p> <li><p><strong>Julia for scientific high-performance computing: opportunities and challenges</strong><br/> <em>Schlottke-Lakemper</em><br/> 6th October 2023, Ferrite.jl User &amp; Developer Conference, Bochum, Germany</p> <li><p><strong>Scaling Trixi.jl to more than 10,000 cores using MPI</strong><br/> <em>Schlottke-Lakemper</em>, <em>Ranocha</em><br/> 27th July 2023, JuliaCon 2023, Cambridge, US</p> <li><p><strong>Massively Parallel Computational Fluid Dynamics with Julia and Trixi.jl</strong><br/> <em>Schlottke-Lakemper</em><br/> 28th June 2023, PASC Conference, Davos, Switzerland</p> <li><p><strong>Research Software Engineering for Sustainable Scientific Computing</strong><br/> <em>Schlottke-Lakemper</em><br/> 30th January 2023, SSD Seminar Series, Aachen, Germany</p> <li><p><strong>Trixi.jl: High-Order Numerical Simulations of Conservation Laws in Julia</strong><br/> <em>Schlottke-Lakemper</em><br/> 19th January 2023, SNuBIC Seminar<br/> <a href="https://github.com/trixi-framework/tutorial-2023-snubic">tutorials &amp; notebooks</a></p> </ul> <h3 id=2022__2 ><a href="#2022__2" class=header-anchor >2022</a></h3> <ul> <li><p><strong>Robust and efficient high-performance computational fluid dynamics enabled by modern numerical methods and technologies</strong><br/> <em>Ranocha</em><br/> 3rd November 2022, MUSEN Colloquium, TU Braunschweig, Germany</p> <li><p><strong>Reproducibility as a service: collaborative scientific computing with Julia</strong><br/> <em>Schlottke-Lakemper</em>, <em>Ranocha</em><br/> 27th October 2022, MaRDI Workshop for Scientific Computing, Münster, Germany</p> <li><p><strong>From Mesh Generation to Adaptive Simulation: A Journey in Julia</strong><br/> <em>Winters</em><br/> 27th July 2022, JuliaCon 2022<br/> <a href="https://youtu.be/_N4ozHr-t9E">recorded talk on YouTube</a> | <a href="https://github.com/trixi-framework/talk-2022-juliacon_toolchain">presentation &amp; code</a></p> <li><p><strong>Running Julia code in parallel with MPI: Lessons learned</strong><br/> <em>Christmann</em>, Neher, <em>Schlottke-Lakemper</em><br/> 26th July 2022, Julia for HPC Minisymposium, JuliaCon 2022<br/> <a href="https://youtu.be/fog1x9rs71Q?t&#61;5172">recorded talk on YouTube</a> | <a href="https://github.com/JuliaParallel/juliacon-2022-julia-for-hpc-minisymposium">presentation</a></p> <li><p><strong>Extensible Computational Fluid Dynamics in Julia with Trixi.jl</strong><br/> <em>Schlottke-Lakemper</em>, <em>Ranocha</em>, <em>Gassner</em><br/> 25th February 2022, SIAM Conference on Parallel Processing for Scientific Computing, Seattle, US</p> </ul> <h3 id=2021__2 ><a href="#2021__2" class=header-anchor >2021</a></h3> <ul> <li><p><strong>Research software development with Julia</strong><br/> <em>Schlottke-Lakemper</em>, <em>Ranocha</em><br/> 27th September 2021, NFDI4Ing Conference 2021</p> <li><p><strong>Adaptive high-order numerical simulations with Trixi.jl</strong><br/> <em>Schlottke-Lakemper</em>, <em>Ranocha</em><br/> 9th September 2021, CliMA Seminar, California Institute of Technology</p> <li><p><strong>Adaptive and extendable numerical simulations with Trixi.jl</strong><br/> <em>Schlottke-Lakemper</em>, <em>Ranocha</em><br/> 30th July 2021, JuliaCon 2021<br/> <a href="https://github.com/trixi-framework/talk-2021-juliacon">presentation &amp; notebooks</a> | <a href="https://www.youtube.com/watch?v&#61;hoViWRAhCBE">recorded talk on YouTube</a></p> <li><p><strong>Trixi.jl: High-Order Numerical Simulations of Hyperbolic PDEs in Julia</strong><br/> <em>Ranocha</em>, <em>Schlottke-Lakemper</em>, <em>Winters</em><br/> 14th July 2021, ICOSAHOM 2021<br/> <a href="https://github.com/trixi-framework/tutorial-2021-icosahom">tutorials &amp; notebooks</a></p> <li><p><strong>Introduction to Julia and Trixi, a numerical simulation framework for hyperbolic PDEs</strong><br/> <em>Ranocha</em><br/> 27th April 2021, Applied Mathematics Seminar, University of Münster<br/> <a href="https://github.com/trixi-framework/talk-2021-Introduction_to_Julia_and_Trixi">presentation</a></p> <li><p><strong>Purely hyperbolic self-gravitating flow simulations in Julia</strong><br/> <em>Schlottke-Lakemper</em>, <em>Winters</em>, <em>Ranocha</em>, <em>Gassner</em><br/> 15th March 2021, GAMM Annual Meeting 2021</p> <li><p><strong>Julia for adaptive high-order multi-physics simulations</strong><br/> <em>Schlottke-Lakemper</em><br/> 27th January 2021, Numerical Analysis Seminar, Lund University<br/> <a href="https://github.com/trixi-framework/talk-2021-julia-adaptive-multi-physics-simulations">presentation &amp; notebooks</a></p> </ul> <h2 id=outreach ><a href="#outreach" class=header-anchor >Outreach</a></h2> <h3 id=google_summer_of_code_2023 ><a href="#google_summer_of_code_2023" class=header-anchor >Google Summer of Code 2023</a></h3> <p>Trixi.jl participated in the Google Summer of Code 2023, marking its initial steps towards running on GPUs. This project was mentored by <a href="https://ranocha.de/">Hendrik Ranocha</a> and <a href="https://www.uni-augsburg.de/fakultaet/mntf/math/prof/hpsc">Michael Schlottke-Lakemper</a>. <a href="outreach/gsoc/2023/gpu-acceleration-in-trixi-jl-using-cuda-jl">Here</a> you can find the report from our contributor <a href="https://github.com/huiyuxie">Huiyu Xie</a>.</p> <h2 id=authors ><a href="#authors" class=header-anchor >Authors</a></h2> <p><a href="https://www.uni-augsburg.de/fakultaet/mntf/math/prof/hpsc">Michael Schlottke-Lakemper</a> &#40;University of Augsburg, Germany&#41;, <a href="https://www.mi.uni-koeln.de/NumSim/gregor-gassner">Gregor Gassner</a> &#40;University of Cologne, Germany&#41;, <a href="https://ranocha.de/">Hendrik Ranocha</a> &#40;University of Hamburg, Germany&#41;, <a href="https://liu.se/en/employee/andwi94">Andrew Winters</a> &#40;Linköping University, Sweden&#41;, and <a href="https://jlchan.github.io/">Jesse Chan</a> &#40;Rice University, US&#41; are the principal developers of <a href="https://github.com/trixi-framework/Trixi.jl">Trixi.jl</a>. <a href="https://www.math.fsu.edu/~kopriva/">David A. Kopriva</a> &#40;Florida State University, US&#41; is the principal developer of <a href="https://github.com/trixi-framework/HOHQMesh">HOHQMesh</a> and <a href="https://github.com/trixi-framework/HOHQMesh.jl">HOHQMesh.jl</a>. For a full list of authors, please check out the respective packages.</p> <h2 id=get_in_touch ><a href="#get_in_touch" class=header-anchor >Get in touch&#33;</a></h2> <p>There are a number of ways to reach out to us:</p> <ul> <li><p>Meet us on <a href="https://join.slack.com/t/trixi-framework/shared_invite/zt-sgkc6ppw-6OXJqZAD5SPjBYqLd8MU~g">Slack</a></p> <li><p>Create an issue in one of the repositories listed on this page</p> <li><p>Get in touch with one of the <a href="https://github.com/trixi-framework/Trixi.jl/blob/main/AUTHORS.md">Trixi Authors</a></p> </ul> <h2 id=acknowledgments ><a href="#acknowledgments" class=header-anchor >Acknowledgments</a></h2> <div style="width: 100%; text-align: center; font-size: 0;"> <div><!-- BMBF --><img src="https://github.com/trixi-framework/Trixi.jl/assets/3637659/f59af636-3098-4be6-bf80-c6be3f17cbc6" style="height: 120px; width: auto"><!-- DFG --><img src="https://github.com/trixi-framework/Trixi.jl/assets/3637659/e67b9ed3-7699-466a-bdaf-2ba070a29a8e" style="height: 120px; width: auto"><!-- --> </div> <div><!-- SRC --><img src="https://github.com/trixi-framework/Trixi.jl/assets/3637659/48f9da06-6f7a-4586-b23e-739bee3901c0" style="height: 120px; width: auto"><!-- ERC --><img src="https://github.com/trixi-framework/Trixi.jl/assets/3637659/9371e7e4-3491-4433-ac5f-b3bfb215f5ca" style="height: 120px; width: auto"><!-- --> </div> <div><!-- NSF --><img src="https://github.com/trixi-framework/Trixi.jl/assets/3637659/5325103c-ae81-4747-b87c-c6e4a1b1d7a8" style="height: 120px; width: auto"><!-- DUBS --><img src="https://github.com/trixi-framework/Trixi.jl/assets/3637659/bb021e6e-42e6-4fe1-a414-c847402e1937" style="height: 120px; width: auto"><!-- --> </div> <div><!-- NumFOCUS --><img src="https://github.com/trixi-framework/Trixi.jl/assets/3637659/8496ac9e-b586-475f-adb7-69bcfc415185" style="height: 120px; width: auto"><!-- --> </div> </div> <p>This project has benefited from funding by the <a href="https://www.dfg.de/">Deutsche Forschungsgemeinschaft</a> &#40;DFG, German Research Foundation&#41; through the following grants:</p> <ul> <li><p>Excellence Strategy EXC 2044-390685587, Mathematics Münster: Dynamics-Geometry-Structure.</p> <li><p>Research unit FOR 5409 &quot;Structure-Preserving Numerical Methods for Bulk- and Interface Coupling of Heterogeneous Models &#40;SNuBIC&#41;&quot; &#40;project number 463312734&#41;.</p> <li><p>Individual grant no. 528753982.</p> </ul> <p>This project has benefited from funding from the <a href="https://erc.europa.eu">European Research Council</a> through the ERC Starting Grant &quot;An Exascale aware and Un-crashable Space-Time-Adaptive Discontinuous Spectral Element Solver for Non-Linear Conservation Laws&quot; &#40;Extreme&#41;, ERC grant agreement no. 714487.</p> <p>This project has benefited from funding from <a href="https://www.vr.se">Vetenskapsrådet</a> &#40;VR, Swedish Research Council&#41;, Sweden through the VR Starting Grant &quot;Shallow water flows including sediment transport and morphodynamics&quot;, VR grant agreement 2020-03642 VR.</p> <p>This project has benefited from funding from the United States <a href="https://www.nsf.gov/">National Science Foundation</a> &#40;NSF&#41; under awards DMS-1719818 and DMS-1943186.</p> <p>This project has benefited from funding from the German <a href="https://www.bmbf.de">Federal Ministry of Education and Research</a> &#40;BMBF&#41; through the project grant &quot;Adaptive earth system modeling with significantly reduced computation time for exascale supercomputers &#40;ADAPTEX&#41;&quot; &#40;funding id: 16ME0668K&#41;.</p> <p>This project has benefited from funding by the <a href="https://www.daimler-benz-stiftung.de">Daimler und Benz Stiftung</a> &#40;Daimler and Benz Foundation&#41; through grant no. 32-10/22.</p> <p>Trixi.jl is supported by <a href="https://numfocus.org/">NumFOCUS</a> as an Affiliated Project.</p> <div class=page-foot > <div class=copyright > &copy; <a href="https://github.com/trixi-framework/Trixi.jl/blob/main/AUTHORS.md" target=_blank  rel="noopener noreferrer">The Trixi Authors</a>. Last modified: September 04, 2024. Website built with <a href="https://github.com/tlienart/Franklin.jl">Franklin.jl</a> and the <a href="https://julialang.org">Julia programming language</a>. </div> </div> </div>
\ No newline at end of file
diff --git a/outreach/gsoc/2023/gpu-acceleration-in-trixi-jl-using-cuda-jl/index.html b/outreach/gsoc/2023/gpu-acceleration-in-trixi-jl-using-cuda-jl/index.html
index fd335eb..15b78e5 100644
--- a/outreach/gsoc/2023/gpu-acceleration-in-trixi-jl-using-cuda-jl/index.html
+++ b/outreach/gsoc/2023/gpu-acceleration-in-trixi-jl-using-cuda-jl/index.html
@@ -1,6 +1,6 @@
-<!doctype html> <html lang=en > <meta charset=UTF-8 > <meta name=viewport  content="width=device-width, initial-scale=1"> <link rel=stylesheet  href="/libs/highlight/github.min.css"> <link rel=stylesheet  href="/css/franklin.css"> <link rel=stylesheet  href="/css/basic.css"> <link rel=icon  href="/assets/favicon.ico"> <title>GSoC 2023: GPU acceleration in Trixi.jl using CUDA.jl</title> <header> <div class=blog-name ><a href="/"><img src="/assets/logo.png" width=100px ></a></div> <nav> <ul> <li><a href="/">Home</a> <li><a href="https://github.com/trixi-framework" target=_blank  rel="noopener noreferrer">Trixi on GitHub</a> </ul> </nav> </header> <div class=franklin-content ><h1 id=gsoc_2023_gpu_acceleration_in_trixijl_using_cudajl ><a href="#gsoc_2023_gpu_acceleration_in_trixijl_using_cudajl" class=header-anchor >GSoC 2023: GPU acceleration in Trixi.jl using CUDA.jl</a></h1> <ul> <li><p>Mentee: <a href="https://github.com/huiyuxie">Huiyu Xie</a></p> <li><p>Mentors: <a href="https://github.com/ranocha">Hendrik Ranocha</a> and <a href="https://github.com/sloede">Michael Schlottke-Lakemper</a></p> <li><p>Project Link: <a href="https://github.com/huiyuxie/trixi_cuda">https://github.com/huiyuxie/trixi&#95;cuda</a></p> </ul> <p>The goal of this GSoC project was to accelerate Trixi.jl using GPUs.</p> <p><strong>Table of contents</strong> <div class=franklin-toc ><ol><li><a href="#project_overview">Project Overview</a><li><a href="#key_highlights">Key Highlights</a><li><a href="#performance_benchmarks">Performance Benchmarks</a><li><a href="#future_work">Future Work</a><li><a href="#acknowledgements">Acknowledgements</a></ol></div></p> <h2 id=project_overview ><a href="#project_overview" class=header-anchor >Project Overview</a></h2> <p>The project was focused on enhancing the <a href="https://github.com/trixi-framework/Trixi.jl">Trixi.jl</a> numerical simulation framework, a prominent tool used for solving hyperbolic conservation laws within the Julia programming language. The primary aim was to introduce GPU support through <a href="https://github.com/JuliaGPU/CUDA.jl">CUDA.jl</a> and essentially <a href="https://docs.nvidia.com/cuda">CUDA</a> to accelerate the discretization processes used in solving partial differential equations &#40;PDEs&#41;. This work was undertaken as part of the <a href="https://summerofcode.withgoogle.com/">Google Summer of Code 2023</a> program, and the progress is summarized below:</p> <ul> <li><p>GPU Implementation: The GPU implementations were prototyped using CUDA, starting with 1D equation kernels and gradually extending to more complex 2D and 3D equation kernels. These developments formed the backbone of the Discontinuous Galerkin Collocation Spectral Element Method &#40;DGSEM&#41; in the framework.</p> <li><p>Performance Benchmarks: A series of benchmarks were conducted on the developed CUDA kernels to assess their efficiency. These benchmarks demonstrated substantial performance enhancements through a strategic integration of various factors including data transfer, kernel architecture, and method characteristics.</p> <li><p>Acceleration Extension: The GPU support was not limited to basic kernels but was extended to more intricate methods within the framework. This included integration with the DG solver that allows meshes with simplex elements &#40;DGMulti&#41; and other summation-by-parts &#40;SBP&#41; schemes like Finite Differences &#40;FD&#41; and Continuous Galerkin Spectral Element Method &#40;CGSEM&#41; through the DGMulti solver.</p> </ul> <p>Please note that the third step was planned but remains incomplete due to time constraints and this step will be completed in the future if possible.</p> <h4 id=how_to_setup ><a href="#how_to_setup" class=header-anchor >How to Setup </a></h4> <p>This project was entirely set up and tested on Amazon Web Services &#40;AWS&#41;, and the instance type chosen was <code>p3.2xlarge</code> &#40;see <a href="https://aws.amazon.com/ec2/instance-types/#Accelerated_Computing">the link</a> for more details&#41;. Here is the link to the specific information of both <a href="https://github.com/huiyuxie/trixi_cuda/blob/main/docs/env_info.md">CPU and GPU</a> used for this project. Note that this project is reproducible by following the setup instructions provided link aboout <a href="https://github.com/huiyuxie/trixi_cuda/blob/main/docs/project_setup.md">how to set up environment</a>. Also, for individuals without an Nvidia GPU but interested in experimenting with CUDA, here is a link detailing how to <a href="https://github.com/huiyuxie/trixi_cuda/blob/main/docs/aws_gpu_setup.md">set up a cloud GPU on AWS</a>.</p> <h2 id=key_highlights ><a href="#key_highlights" class=header-anchor >Key Highlights</a></h2> <p>The overview of the project repository can be accessed through this <a href="https://github.com/huiyuxie/trixi_cuda">README</a> file. Here is a detailed description of the highlights of this project.</p> <h4 id=kernel_prototyping ><a href="#kernel_prototyping" class=header-anchor ><ol> <li><p>Kernel Prototyping</p> </ol> </a></h4> <p>Several function &#40;kernel&#41; naming rules were applied in the kernel prototyping process:</p> <ul> <li><p>The functions for GPU kernel parallel computing must end with <code>_kernel</code></p> <li><p>The functions for calling the GPU kernels must begin with <code>cuda_</code></p> </ul> <p>These rules make the whole structure of the GPU code consistent with the original CPU code. Also, the implementation essentially revolves around three points: </p> <ul> <li><p>Using custom kernel implementation instead of direct array &#40;and matrix&#41; operations through <code>CuArray</code> type</p> <li><p>Avoiding the use of conditional statement &#40;like <code>if/else</code> branches&#41; from the original CPU code</p> <li><p>Minimizing the number of GPU kernel calls within a certain function &#40;like <code>cuda_volume_integral&#33;</code>&#41;</p> </ul> <p>Based on these points, the work began with <code>dg_1d.jl</code>, and then extended to <code>dg_2d.jl</code> and <code>dg_3d.jl</code> under the <code>src/solvers/dgsem_tree</code> directory. The prototying mainly focused on the <code>rhs&#33;</code> functions that are called in the <code>semidiscretize</code> process. Besides, it is worthwhile to mention some caveats in this GPU prototyping process:</p> <ul> <li><p>CUDA.jl does not support dynamic array access and the use of other dynamic types inside kernels &#40;<a href="https://github.com/huiyuxie/trixi_cuda/issues/6">Issue #6</a>, <a href="https://github.com/huiyuxie/trixi_cuda/issues/8">Issue #8</a>, and <a href="https://github.com/huiyuxie/trixi_cuda/issues/11">Issue #11</a>&#41; </p> <li><p>GPU parallel computing can run into race conditions &#40;<a href="https://github.com/huiyuxie/trixi_cuda/issues/5">Issue #5</a>&#41;</p> <li><p>The <code>Float32</code> type can be promoted to <code>Float64</code> type in the GPU computing process &#40;<a href="https://github.com/huiyuxie/trixi_cuda/issues/3">Issue #3</a> and <a href="https://github.com/trixi-framework/Trixi.jl/pull/1604">PR #1604</a>&#41;</p> </ul> <h4 id=ol_start2_kernel_configuration ><a href="#ol_start2_kernel_configuration" class=header-anchor ><ol start=2 > <li><p>Kernel Configuration </p> </ol> </a></h4> <p>The GPU kernels were designed to be launched with the appropriate size of threads and blocks. The occupancy API <code>CUDA.launch_configuration</code> was used to create kernel configurator functions for 1D, 2D, and 3D kernels &#40;i.e., <code>configurator_1d</code>, <code>configurator_2d</code>, and <code>configurator_3d</code>&#41;. </p> <p>Specifically, in kernel configurator functions, <code>CUDA.launch_configuration</code> would first return a suggested number of threads for the compiled but not yet run kernel, and then the number of blocks would be computed through dividing the corresponding array size by the number of threads. </p> <p>Thus, in the process of calling a GPU kernel, the kernel is first compiled and then run based on the returned launching data from the configurator. For example, it is common to see code like </p> <pre><code class="Julia hljs">sample_kernel = <span class=hljs-meta >@cuda</span> launch = <span class=hljs-literal >false</span> sample_kernel!(arg1, arg2, arg3)
+<!doctype html> <html lang=en > <meta charset=UTF-8 > <meta name=viewport  content="width=device-width, initial-scale=1"> <link rel=stylesheet  href="/libs/highlight/github.min.css"> <link rel=stylesheet  href="/css/franklin.css"> <link rel=stylesheet  href="/css/basic.css"> <link rel=icon  href="/assets/favicon.ico"> <title>GSoC 2023: GPU acceleration in Trixi.jl using CUDA.jl</title> <header> <div class=blog-name ><a href="/"><img src="/assets/logo.png" width=100px ></a></div> <nav> <ul> <li><a href="/">Home</a> <li><a href="https://github.com/trixi-framework" target=_blank  rel="noopener noreferrer">Trixi on GitHub</a> </ul> </nav> </header> <div class=franklin-content ><h1 id=gsoc_2023_gpu_acceleration_in_trixijl_using_cudajl ><a href="#gsoc_2023_gpu_acceleration_in_trixijl_using_cudajl" class=header-anchor >GSoC 2023: GPU acceleration in Trixi.jl using CUDA.jl</a></h1> <ul> <li><p>Mentee: <a href="https://github.com/huiyuxie">Huiyu Xie</a></p> <li><p>Mentors: <a href="https://github.com/ranocha">Hendrik Ranocha</a> and <a href="https://github.com/sloede">Michael Schlottke-Lakemper</a></p> <li><p>Project Link: <a href="https://github.com/czha/TrixiGPU.jl/tree/legacy">https://github.com/huiyuxie/trixi&#95;cuda</a></p> </ul> <p>The goal of this GSoC project was to accelerate Trixi.jl using GPUs.</p> <p><strong>Table of contents</strong> <div class=franklin-toc ><ol><li><a href="#project_overview">Project Overview</a><li><a href="#key_highlights">Key Highlights</a><li><a href="#performance_benchmarks">Performance Benchmarks</a><li><a href="#future_work">Future Work</a><li><a href="#acknowledgements">Acknowledgements</a></ol></div></p> <h2 id=project_overview ><a href="#project_overview" class=header-anchor >Project Overview</a></h2> <p>The project was focused on enhancing the <a href="https://github.com/trixi-framework/Trixi.jl">Trixi.jl</a> numerical simulation framework, a prominent tool used for solving hyperbolic conservation laws within the Julia programming language. The primary aim was to introduce GPU support through <a href="https://github.com/JuliaGPU/CUDA.jl">CUDA.jl</a> and essentially <a href="https://docs.nvidia.com/cuda">CUDA</a> to accelerate the discretization processes used in solving partial differential equations &#40;PDEs&#41;. This work was undertaken as part of the <a href="https://summerofcode.withgoogle.com/">Google Summer of Code 2023</a> program, and the progress is summarized below:</p> <ul> <li><p>GPU Implementation: The GPU implementations were prototyped using CUDA, starting with 1D equation kernels and gradually extending to more complex 2D and 3D equation kernels. These developments formed the backbone of the Discontinuous Galerkin Collocation Spectral Element Method &#40;DGSEM&#41; in the framework.</p> <li><p>Performance Benchmarks: A series of benchmarks were conducted on the developed CUDA kernels to assess their efficiency. These benchmarks demonstrated substantial performance enhancements through a strategic integration of various factors including data transfer, kernel architecture, and method characteristics.</p> <li><p>Acceleration Extension: The GPU support was not limited to basic kernels but was extended to more intricate methods within the framework. This included integration with the DG solver that allows meshes with simplex elements &#40;DGMulti&#41; and other summation-by-parts &#40;SBP&#41; schemes like Finite Differences &#40;FD&#41; and Continuous Galerkin Spectral Element Method &#40;CGSEM&#41; through the DGMulti solver.</p> </ul> <p>Please note that the third step was planned but remains incomplete due to time constraints and this step will be completed in the future if possible.</p> <h3 id=how_to_setup ><a href="#how_to_setup" class=header-anchor >How to Setup </a></h3> <p>This project was entirely set up and tested on Amazon Web Services &#40;AWS&#41;, and the instance type chosen was <a href="https://aws.amazon.com/ec2/instance-types/#Accelerated_Computing"><code>p3.2xlarge</code></a>. Here is the link to the specific information of both <a href="https://github.com/czha/TrixiGPU.jl/blob/legacy/docs/env_info.md">CPU and GPU</a> used for this project. Note that this project is reproducible by following the setup instructions provided link aboout <a href="https://github.com/czha/TrixiGPU.jl/blob/legacy/docs/project_setup.md">how to set up environment</a>. Also, for individuals without an Nvidia GPU but interested in experimenting with CUDA, here is a link detailing how to <a href="https://github.com/czha/TrixiGPU.jl/blob/legacy/docs/aws_gpu_setup.md">set up a cloud GPU on AWS</a>.</p> <h2 id=key_highlights ><a href="#key_highlights" class=header-anchor >Key Highlights</a></h2> <p>The overview of the project repository can be accessed through this <a href="https://github.com/czha/TrixiGPU.jl/blob/legacy/README.md">README.md</a> file. Here is a detailed description of the highlights of this project.</p> <h3 id=kernel_prototyping ><a href="#kernel_prototyping" class=header-anchor ><ol> <li><p>Kernel Prototyping</p> </ol> </a></h3> <p>Several function &#40;kernel&#41; naming rules were applied in the kernel prototyping process:</p> <ul> <li><p>The functions for GPU kernel parallel computing must end with <code>_kernel</code></p> <li><p>The functions for calling the GPU kernels must begin with <code>cuda_</code></p> </ul> <p>These rules make the whole structure of the GPU code consistent with the original CPU code. Also, the implementation essentially revolves around three points: </p> <ul> <li><p>Using custom kernel implementation instead of direct array &#40;and matrix&#41; operations through <code>CuArray</code> type</p> <li><p>Avoiding the use of conditional statement &#40;like <code>if/else</code> branches&#41; from the original CPU code</p> <li><p>Minimizing the number of GPU kernel calls within a certain function &#40;like <code>cuda_volume_integral&#33;</code>&#41;</p> </ul> <p>Based on these points, the work began with <code>dg_1d.jl</code>, and then extended to <code>dg_2d.jl</code> and <code>dg_3d.jl</code> under the <code>src/solvers/dgsem_tree</code> directory. The prototying mainly focused on the <code>rhs&#33;</code> functions that are called in the <code>semidiscretize</code> process. Besides, it is worthwhile to mention some caveats in this GPU prototyping process:</p> <ul> <li><p>CUDA.jl does not support dynamic array access and the use of other dynamic types inside kernels &#40;<a href="https://github.com/huiyuxie/trixi_cuda/issues/6">Issue #6</a>, <a href="https://github.com/huiyuxie/trixi_cuda/issues/8">Issue #8</a>, and <a href="https://github.com/huiyuxie/trixi_cuda/issues/11">Issue #11</a>&#41; </p> <li><p>GPU parallel computing can run into race conditions &#40;<a href="https://github.com/huiyuxie/trixi_cuda/issues/5">Issue #5</a>&#41;</p> <li><p>The <code>Float32</code> type can be promoted to <code>Float64</code> type in the GPU computing process &#40;<a href="https://github.com/huiyuxie/trixi_cuda/issues/3">Issue #3</a> and <a href="https://github.com/trixi-framework/Trixi.jl/pull/1604">PR #1604</a>&#41;</p> </ul> <h3 id=ol_start2_kernel_configuration ><a href="#ol_start2_kernel_configuration" class=header-anchor ><ol start=2 > <li><p>Kernel Configuration </p> </ol> </a></h3> <p>The GPU kernels were designed to be launched with the appropriate size of threads and blocks. The occupancy API <code>CUDA.launch_configuration</code> was used to create kernel configurator functions for 1D, 2D, and 3D kernels &#40;i.e., <code>configurator_1d</code>, <code>configurator_2d</code>, and <code>configurator_3d</code>&#41;. </p> <p>Specifically, in kernel configurator functions, <code>CUDA.launch_configuration</code> would first return a suggested number of threads for the compiled but not yet run kernel, and then the number of blocks would be computed through dividing the corresponding array size by the number of threads. </p> <p>Thus, in the process of calling a GPU kernel, the kernel is first compiled and then run based on the returned launching data from the configurator. For example, it is common to see code like </p> <pre><code class="Julia hljs">sample_kernel = <span class=hljs-meta >@cuda</span> launch = <span class=hljs-literal >false</span> sample_kernel!(arg1, arg2, arg3)
 sample_kernel(arg1, arg2, arg3; configurator_1d(sample_kernel, size_arr)...) <span class=hljs-comment ># Similar for `configurator_2d` and configurator_3d`</span></code></pre> <p>in kernel calling functions. In this way, the GPU kernels are configured and launched with the intention of achieving maximal occupancy &#40;i.e., optimizing the utilization of GPU computing resources&#41;, potentially enhancing overall performance. However, it should be noted that the maximal occupancy cannot be reached in most cases.</p> <p>Also, this method of kernel configuration may not be optimal, considering that it may not be applicable to different GPU versions. Given that</p> <pre><code class="Julia hljs">julia&gt; attribute(device(),CUDA.DEVICE_ATTRIBUTE_MAX_GRID_DIM_X) <span class=hljs-number >2147483647</span> 
-julia&gt; attribute(device(),CUDA.DEVICE_ATTRIBUTE_MAX_THREADS_PER_BLOCK) <span class=hljs-number >1024</span></code></pre> <p>the kernel could be addressed in the crrent GPU version but may not in some other GPU versions &#40;as different GPU gives different attribute data like <code>CUDA.DEVICE_ATTRIBUTE_MAX_GRID_DIM_X</code> and <code>CUDA.DEVICE_ATTRIBUTE_MAX_THREADS_PER_BLOCK</code>&#41;. So it was suggested to introduce the use of a stride loop for the current GPU kernels.</p> <h4 id=ol_start3_kernel_optimization ><a href="#ol_start3_kernel_optimization" class=header-anchor ><ol start=3 > <li><p>Kernel Optimization</p> </ol> </a></h4> <p>Some work on kernel optimization has already been done during the process of kernel prototyping, such as avoiding the use of conditional branches and minimizing kernel calls. But the general work for kernel optimization has not yet been introdued &#40;so this part is somewhat related to the future work&#41;. </p> <p>In summary, the kernel optimization should be based on kernel benchmarks and kernel profiling, and here are some factors that can be considered to improve performance:</p> <ul> <li><p>Data Transfer: The process of data transfer from CPU to GPU &#40;and back&#41; mainly occurs in the <code>rhs&#33;</code> function when calling <code>semidiscretize</code>. Since <code>rhs&#33;</code> is called multiple times during the process of time integration, it would be more efficient to complete the data transfer before calling the <code>rhs&#33;</code> function.</p> <li><p>Stride Loop Tuning: The stride loop has not yet been introduced to the current GPU kernels. By applying stride loop tuning, the loop structure &#40;i.e., stride length&#41; can be modified to improve performance.</p> <li><p>Multi-GPU/Multi-Thread: The performance can be further improved if multiple GPUs or multiple threads are used.</p> </ul> <h2 id=performance_benchmarks ><a href="#performance_benchmarks" class=header-anchor >Performance Benchmarks</a></h2> <p>The performance benchmarks were conducted for both CPU and GPU on <code>Float64</code> and <code>Float32</code> types, respectively. The example files <code>elixir_advection_basic.jl</code>, <code>elixir_euler_ec.jl</code>, and <code>elixir_euler_source_terms.jl</code> were chosen from <code>tree_1d_dgsem</code>, <code>tree_2d_dgsem</code>, and <code>tree_3d_dgsem</code> under the <code>src/examples</code> directory. These examples were chosen because they are consistent in case of 1D, 2D, and 3D. Please note that all the examples have passed the accuracy tests and you can check them using this <a href="https://github.com/huiyuxie/trixi_cuda/tree/main/cuda_julia/examples">link to examples</a>.</p> <p>The benchmark results were archived in another file and please use this <a href="https://github.com/huiyuxie/trixi_cuda/blob/main/docs/cuda_benchmarks.md">link to benchmarks</a> to check them. Also note that the benchmarks were focuesd on the time integration part &#40;i.e., on <code>OrdinaryDiffEq.solve</code>&#41;, see a benchmark exmaple below </p> <pre><code class="Julia hljs"><span class=hljs-comment ># Run on CPU</span>
+julia&gt; attribute(device(),CUDA.DEVICE_ATTRIBUTE_MAX_THREADS_PER_BLOCK) <span class=hljs-number >1024</span></code></pre> <p>the kernel could be addressed in the crrent GPU version but may not in some other GPU versions &#40;as different GPU gives different attribute data like <code>CUDA.DEVICE_ATTRIBUTE_MAX_GRID_DIM_X</code> and <code>CUDA.DEVICE_ATTRIBUTE_MAX_THREADS_PER_BLOCK</code>&#41;. So it was suggested to introduce the use of a stride loop for the current GPU kernels.</p> <h3 id=ol_start3_kernel_optimization ><a href="#ol_start3_kernel_optimization" class=header-anchor ><ol start=3 > <li><p>Kernel Optimization</p> </ol> </a></h3> <p>Some work on kernel optimization has already been done during the process of kernel prototyping, such as avoiding the use of conditional branches and minimizing kernel calls. But the general work for kernel optimization has not yet been introdued &#40;so this part is somewhat related to the future work&#41;. </p> <p>In summary, the kernel optimization should be based on kernel benchmarks and kernel profiling, and here are some factors that can be considered to improve performance:</p> <ul> <li><p>Data Transfer: The process of data transfer from CPU to GPU &#40;and back&#41; mainly occurs in the <code>rhs&#33;</code> function when calling <code>semidiscretize</code>. Since <code>rhs&#33;</code> is called multiple times during the process of time integration, it would be more efficient to complete the data transfer before calling the <code>rhs&#33;</code> function.</p> <li><p>Stride Loop Tuning: The stride loop has not yet been introduced to the current GPU kernels. By applying stride loop tuning, the loop structure &#40;i.e., stride length&#41; can be modified to improve performance.</p> <li><p>Multi-GPU/Multi-Thread: The performance can be further improved if multiple GPUs or multiple threads are used.</p> </ul> <h2 id=performance_benchmarks ><a href="#performance_benchmarks" class=header-anchor >Performance Benchmarks</a></h2> <p>The performance benchmarks were conducted for both CPU and GPU on <code>Float64</code> and <code>Float32</code> types, respectively. The example files <code>elixir_advection_basic.jl</code>, <code>elixir_euler_ec.jl</code>, and <code>elixir_euler_source_terms.jl</code> were chosen from <code>tree_1d_dgsem</code>, <code>tree_2d_dgsem</code>, and <code>tree_3d_dgsem</code> under the <code>src/examples</code> directory. These examples were chosen because they are consistent in case of 1D, 2D, and 3D. Please note that all the examples have passed the accuracy tests and you can check them using this <a href="https://github.com/czha/TrixiGPU.jl/tree/legacy/src/examples">link to examples</a>.</p> <p>The benchmark results were archived in another file and please use this <a href="https://github.com/czha/TrixiGPU.jl/blob/legacy/docs/cuda_benchmark.md">link to benchmarks</a> to check them. Also note that the benchmarks were focuesd on the time integration part &#40;i.e., on <code>OrdinaryDiffEq.solve</code>&#41;, see a benchmark exmaple below </p> <pre><code class="Julia hljs"><span class=hljs-comment ># Run on CPU</span>
 <span class=hljs-meta >@benchmark</span> <span class=hljs-keyword >begin</span>
     sol_cpu = OrdinaryDiffEq.solve(ode_cpu, BS3(), adaptive=<span class=hljs-literal >false</span>, dt=<span class=hljs-number >0.01</span>;
         abstol=<span class=hljs-number >1.0e-6</span>, reltol=<span class=hljs-number >1.0e-6</span>, ode_default_options()...)
@@ -10,4 +10,4 @@
 <span class=hljs-meta >@benchmark</span> <span class=hljs-keyword >begin</span>
     sol_gpu = OrdinaryDiffEq.solve(ode_gpu, BS3(), adaptive=<span class=hljs-literal >false</span>, dt=<span class=hljs-number >0.01</span>;
         abstol=<span class=hljs-number >1.0e-6</span>, reltol=<span class=hljs-number >1.0e-6</span>, ode_default_options()...)
-<span class=hljs-keyword >end</span></code></pre> <p>From the benchmark results, it is shown that the GPU did not perform better than the CPU in general &#40;but there were some exceptions&#41;. Furthermore, the <code>Memory estimate</code> and <code>allocs estimate</code> statistics from the GPU are much larger than those from the CPU. This is probably due to the design that all the data transfer happens in the <code>rhs&#33;</code> function and thus the memory cost is extremely high when transferring data repeatedly between the CPU and GPU. </p> <p>In addition, the results indicate that the GPU performs better with 2D and 3D examples than with 1D examples. That is because GPUs are designed to handle a large number of parallel tasks, and 2D and 3D problems usually offer more parallelism compared to 1D problems. Essentially, the more data you can process simultaneously, the more efficiently you can utilize the GPU. 1D problems may not be complex enough to take full advantage of the GPU parallel processing capability.</p> <h2 id=future_work ><a href="#future_work" class=header-anchor >Future Work</a></h2> <p>The future work is listed here, ranging from specific to more general, from top to bottom:</p> <ol> <li><p>Resolve <a href="https://github.com/huiyuxie/trixi_cuda/issues/9">Issue #9</a> and <a href="https://github.com/huiyuxie/trixi_cuda/issues/11">Issue #11</a> &#40;and any upcoming issues&#41; </p> <li><p>Complete the prototype for the remaining kernels &#40;please refer to the Kernel to be Implemented from the <a href="https://github.com/huiyuxie/trixi_cuda/blob/main/README.md">README</a> file&#41;.</p> <li><p>Update <a href="https://github.com/trixi-framework/Trixi.jl/pull/1604">PR #1604</a> and make it merged into the repository</p> <li><p>Optimize CUDA kernels to improve performance &#40;especially data transfer, please refer to the kernel optimization part&#41;</p> <li><p>Prototype the GPU kernels for other DG solvers &#40;for example, <code>DGMulti</code>, etc.&#41;</p> <li><p>Extend the single-GPU support to multi-GPU support &#40;similarly, from single-thread to multi-thread&#41;</p> <li><p>Broaden compatibility to other GPU types beyond Nvidia &#40;such as those from Apple, Intel, and AMD&#41;</p> </ol> <h2 id=acknowledgements ><a href="#acknowledgements" class=header-anchor >Acknowledgements</a></h2> <p>I would like to express my gratitude to Google, the Julia community, and my mentors &#40;<a href="https://github.com/ranocha">Hendrik Ranocha</a>, <a href="https://github.com/sloede">Michael Schlottke-Lakemper</a>, and <a href="https://github.com/jlchan">Jesse Chan</a>&#41; for this enriching experience during the Google Summer of Code 2023 program. This opportunity to participate, enhance my skills, and contribute to the advancement of Julia has been both challenging and rewarding.</p> <p>Special thanks go to my GSoC mentor <a href="https://github.com/ranocha">Hendrik Ranocha</a> &#40;@ranocha&#41; and another person from JuliaGPU <a href="https://github.com/maleadt"> Tim Besard</a> &#40;@maleadt, though he is not my mentor&#41;, whose guidance and support throughout our regular discussions have been instrumental in answering my questions and overcoming hurdles. The Julia community is incredibly welcoming and supportive, and I am proud to have been a part of this endeavor.</p> <p>I am filled with appreciation for this fantastic summer of learning and development, and I look forward to seeing the continued growth of Julia and the contributions of its vibrant community.</p> <div class=page-foot > <div class=copyright > &copy; <a href="https://github.com/trixi-framework/Trixi.jl/blob/main/AUTHORS.md" target=_blank  rel="noopener noreferrer">The Trixi Authors</a>. Last modified: August 22, 2024. Website built with <a href="https://github.com/tlienart/Franklin.jl">Franklin.jl</a> and the <a href="https://julialang.org">Julia programming language</a>. </div> </div> </div>
\ No newline at end of file
+<span class=hljs-keyword >end</span></code></pre> <p>From the benchmark results, it is shown that the GPU did not perform better than the CPU in general &#40;but there were some exceptions&#41;. Furthermore, the <code>Memory estimate</code> and <code>allocs estimate</code> statistics from the GPU are much larger than those from the CPU. This is probably due to the design that all the data transfer happens in the <code>rhs&#33;</code> function and thus the memory cost is extremely high when transferring data repeatedly between the CPU and GPU. </p> <p>In addition, the results indicate that the GPU performs better with 2D and 3D examples than with 1D examples. That is because GPUs are designed to handle a large number of parallel tasks, and 2D and 3D problems usually offer more parallelism compared to 1D problems. Essentially, the more data you can process simultaneously, the more efficiently you can utilize the GPU. 1D problems may not be complex enough to take full advantage of the GPU parallel processing capability.</p> <h2 id=future_work ><a href="#future_work" class=header-anchor >Future Work</a></h2> <p>The future work is listed here, ranging from specific to more general, from top to bottom:</p> <ol> <li><p>Resolve <a href="https://github.com/huiyuxie/trixi_cuda/issues/9">Issue #9</a> and <a href="https://github.com/huiyuxie/trixi_cuda/issues/11">Issue #11</a> &#40;and any upcoming issues&#41; </p> <li><p>Complete the prototype for the remaining kernels &#40;please refer to the Kernel to be Implemented from the <a href="https://github.com/czha/TrixiGPU.jl/blob/legacy/README.md">README.md</a> file&#41;.</p> <li><p>Update <a href="https://github.com/trixi-framework/Trixi.jl/pull/1604">PR #1604</a> and make it merged into the repository</p> <li><p>Optimize CUDA kernels to improve performance &#40;especially data transfer, please refer to the kernel optimization part&#41;</p> <li><p>Prototype the GPU kernels for other DG solvers &#40;for example, <code>DGMulti</code>, etc.&#41;</p> <li><p>Extend the single-GPU support to multi-GPU support &#40;similarly, from single-thread to multi-thread&#41;</p> <li><p>Broaden compatibility to other GPU types beyond Nvidia &#40;such as those from Apple, Intel, and AMD&#41;</p> </ol> <h2 id=acknowledgements ><a href="#acknowledgements" class=header-anchor >Acknowledgements</a></h2> <p>I would like to express my gratitude to Google, the Julia community, and my mentors &#40;<a href="https://github.com/ranocha">Hendrik Ranocha</a>, <a href="https://github.com/sloede">Michael Schlottke-Lakemper</a>, and <a href="https://github.com/jlchan">Jesse Chan</a>&#41; for this enriching experience during the Google Summer of Code 2023 program. This opportunity to participate, enhance my skills, and contribute to the advancement of Julia has been both challenging and rewarding.</p> <p>Special thanks go to my GSoC mentor <a href="https://github.com/ranocha">Hendrik Ranocha</a> &#40;@ranocha&#41; and another person from JuliaGPU <a href="https://github.com/maleadt"> Tim Besard</a> &#40;@maleadt, though he is not my mentor&#41;, whose guidance and support throughout our regular discussions have been instrumental in answering my questions and overcoming hurdles. The Julia community is incredibly welcoming and supportive, and I am proud to have been a part of this endeavor.</p> <p>I am filled with appreciation for this fantastic summer of learning and development, and I look forward to seeing the continued growth of Julia and the contributions of its vibrant community.</p> <div class=page-foot > <div class=copyright > &copy; <a href="https://github.com/trixi-framework/Trixi.jl/blob/main/AUTHORS.md" target=_blank  rel="noopener noreferrer">The Trixi Authors</a>. Last modified: September 04, 2024. Website built with <a href="https://github.com/tlienart/Franklin.jl">Franklin.jl</a> and the <a href="https://julialang.org">Julia programming language</a>. </div> </div> </div>
\ No newline at end of file
diff --git a/sitemap.xml b/sitemap.xml
index c80256d..d978a23 100644
--- a/sitemap.xml
+++ b/sitemap.xml
@@ -3,13 +3,13 @@
 
 <url>
     <loc>https://trixi-framework.github.io/outreach/gsoc/2023/gpu-acceleration-in-trixi-jl-using-cuda-jl/index.html</loc>
-    <lastmod>2024-08-22</lastmod>
+    <lastmod>2024-09-04</lastmod>
     <changefreq>monthly</changefreq>
     <priority>0.5</priority>
 </url>
 <url>
     <loc>https://trixi-framework.github.io/index.html</loc>
-    <lastmod>2024-08-22</lastmod>
+    <lastmod>2024-09-04</lastmod>
     <changefreq>monthly</changefreq>
     <priority>0.5</priority>
 </url>