Initial stab at actx.trace_call #210

MTCam · 2022-12-13T17:27:19Z

Augments inducer/pytato/#364 by adding array context version of trace_call that handles array container types.

inducer · 2022-12-13T17:52:33Z

arraycontext/impl/pytato/__init__.py

+# {{{ utilities
+
+
+def _ary_container_key_stringifier(keys: Tuple[Any, ...]) -> str:


Don't these functions already exist in impl.pytato.compile? What has changed?

inducer · 2022-12-13T17:55:09Z

arraycontext/context.py

@@ -520,6 +520,12 @@ def compile(self, f: Callable[..., Any]) -> Callable[..., Any]:
        """
        return f

+    # Supporting interface for function/call tracing in actx implementations
+    def trace_call(self, f: Callable[..., Any],
+                   *args, identifier=None, **kwargs):


Missing return type annotation. More generally, trace_call really should permit the user to do two things:

Use the result of the traced call.

Call the trace result using new data.

I'm not sure we'll be able to do both with just a single return value.

inducer · 2022-12-13T17:55:32Z

arraycontext/context.py

@@ -520,6 +520,12 @@ def compile(self, f: Callable[..., Any]) -> Callable[..., Any]:
        """
        return f

+    # Supporting interface for function/call tracing in actx implementations
+    def trace_call(self, f: Callable[..., Any],
+                   *args, identifier=None, **kwargs):


I think you also want the interface to apply tags to the call site.

kaushikcfd · 2023-03-08T20:56:52Z

Is this something that is still being worked on? If yes, I can help with some stuff here, I have some bandwidth for it this week.

If not, then what's the plan for addressing illinois-ceesd/mirgecom#777? (I don't think compilation times of those orders should be tolerated)

MTCam · 2023-03-11T16:49:27Z

Is this something that is still being worked on? If yes, I can help with some stuff here, I have some bandwidth for it this week.

I pushed and updated my latest on this. My apologies for the state of it; it is very much wip.

I had a fork/branch of meshmode with a simple test of using the actx-provided trace_call.
https://github.com/MTCam/meshmode/tree/use-actx-trace-call

kaushikcfd · 2023-03-11T17:00:11Z

I pushed and updated my latest on this. My apologies for the state of it; it is very much wip.

Thanks! If you want, I can take over and maybe it will be faster that way? My ETA for this is the weekend. However, if you think you want to get accustomed to these parts of the codebase, I'm happy to step away.

MTCam · 2023-03-11T17:05:53Z

If not, then what's the plan for addressing illinois-ceesd/mirgecom#777? (I don't think compilation times of those orders should be tolerated)

The compilation times seem well under control for our current prediction I blv. We need to run some tests at scale to make sure that this is the case, but currently the build times are hovering around less than an hour for high order 3D with all physics and features enabled on 128 ranks. That is well into the prediction-enabling realm, and also not a giant bottleneck. The build time for serial is around 600-1000s, and 3000+ for many ranks.

I understand the current situation is brought about mostly by mitigation and not a permanent solution. 1-dimensional domain decomposition is currently limiting our max number of MPI neighbors to 2, minimizing the impact of the splatted flux dags for partition boundaries. It does seem like, for our problem and geometry, we could stick with that strategy indefinitely.

Solving dag splat I think will help us avoid splatting the flux calculation for each unique domain boundary. That would be useful.

MTCam · 2023-03-11T17:09:40Z

I pushed and updated my latest on this. My apologies for the state of it; it is very much wip.

Thanks! If you want, I can take over and maybe it will be faster that way? My ETA for this is the weekend. However, if you think you want to get accustomed to these parts of the codebase, I'm happy to step away.

I would be delighted for you to take over and am very interested in following what happens here. I would get back to this after the review, but won't be anywhere near as efficient at getting it going. I believe it is close to functioning as a demo - but not close to final form.

kaushikcfd · 2023-03-11T17:14:41Z

but currently the build times are hovering around less than an hour for high order 3D with all physics and features enabled on 128 ranks

Compilation times of 1 hour (even 20 minutes tbh) are very bad and something that must be fixed ASAP. Another problem that is hovering around is that this also leads to a poorly generated device code, as each sub-array from a neighboring rank is assigned a small-ish buffer, and launching GPU kernels for such small problem sizes brings down the throughput.

I would be delighted for you to take over and am very interested in following what happens here.

Yep, let's do this then.

kaushikcfd · 2023-03-14T18:36:41Z

See #221.

kaushikcfd · 2023-03-14T20:36:13Z

In #221, I added a test under examples/, which demonstrates the intended purpose as:

(py311_env) [line@line examples]$ python how_to_outline.py
[Pre-concatenation] Number of nodes = 32701
[Post-concatenation] Number of nodes = 3710

I think we can close this PR.

Initial stab at actx.trace_call

d2dde87

inducer reviewed Dec 13, 2022

View reviewed changes

MTCam added 3 commits December 20, 2022 11:04

Merge remote-tracking branch 'origin/main' into trace-call

198d2cf

Merge with inducer/main

968b5d3

add (commented) output parsing section

5472972

MTCam added 2 commits March 11, 2023 10:45

discover content of output_template

61af2e1

Merge remote-tracking branch 'origin/main' into mrg-upstream

f055987

inducer closed this Apr 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial stab at actx.trace_call #210

Initial stab at actx.trace_call #210

MTCam commented Dec 13, 2022 •

edited

Loading

inducer Dec 13, 2022

inducer Dec 13, 2022

inducer Dec 13, 2022

kaushikcfd commented Mar 8, 2023 •

edited

Loading

MTCam commented Mar 11, 2023 •

edited

Loading

kaushikcfd commented Mar 11, 2023

MTCam commented Mar 11, 2023

MTCam commented Mar 11, 2023

kaushikcfd commented Mar 11, 2023

kaushikcfd commented Mar 14, 2023

kaushikcfd commented Mar 14, 2023 •

edited

Loading

		# {{{ utilities


		def _ary_container_key_stringifier(keys: Tuple[Any, ...]) -> str:

Initial stab at actx.trace_call #210

Initial stab at actx.trace_call #210

Conversation

MTCam commented Dec 13, 2022 • edited Loading

inducer Dec 13, 2022

Choose a reason for hiding this comment

inducer Dec 13, 2022

Choose a reason for hiding this comment

inducer Dec 13, 2022

Choose a reason for hiding this comment

kaushikcfd commented Mar 8, 2023 • edited Loading

MTCam commented Mar 11, 2023 • edited Loading

kaushikcfd commented Mar 11, 2023

MTCam commented Mar 11, 2023

MTCam commented Mar 11, 2023

kaushikcfd commented Mar 11, 2023

kaushikcfd commented Mar 14, 2023

kaushikcfd commented Mar 14, 2023 • edited Loading

MTCam commented Dec 13, 2022 •

edited

Loading

kaushikcfd commented Mar 8, 2023 •

edited

Loading

MTCam commented Mar 11, 2023 •

edited

Loading

kaushikcfd commented Mar 14, 2023 •

edited

Loading