Add documentation for DirectiveBreakdown #172

matthew-richerson · 2024-06-26T19:29:37Z

Document the DirectiveBreakdown resource and how the WLM should interpret its fields.

Document the DirectiveBreakdown resource and how the WLM should interpret its fields. Signed-off-by: Matt Richerson <[email protected]>

bdevcich · 2024-06-26T20:29:36Z

docs/guides/directive-breakdown/readme.md

+  ...
+```
+
+* `colocation` specifies how two or more allocations influence the location of each other. The colocation constraint has two fields, `type` and `key`. Currently, the only value for `type` is `exlusive`. `key` can be any value. This constraint means that the allocations from an allocation set with the colocation constraint can't be placed on an NNF node with another allocation whose allocation set has a colocation constraint with the same key. Allocations from allocation sets with colocation constraints with different keys or allocation sets without the colocation constraint are okay to put on the same NNF node.


Suggested change

* `colocation` specifies how two or more allocations influence the location of each other. The colocation constraint has two fields, `type` and `key`. Currently, the only value for `type` is `exlusive`. `key` can be any value. This constraint means that the allocations from an allocation set with the colocation constraint can't be placed on an NNF node with another allocation whose allocation set has a colocation constraint with the same key. Allocations from allocation sets with colocation constraints with different keys or allocation sets without the colocation constraint are okay to put on the same NNF node.

* `colocation` specifies how two or more allocations influence the location of each other. The colocation constraint has two fields, `type` and `key`. Currently, the only value for `type` is `exclusive`. `key` can be any value. This constraint means that the allocations from an allocation set with the colocation constraint can't be placed on an NNF node with another allocation whose allocation set has a colocation constraint with the same key. Allocations from allocation sets with colocation constraints with different keys or allocation sets without the colocation constraint are okay to put on the same NNF node.

bdevcich · 2024-06-26T20:30:18Z

docs/guides/directive-breakdown/readme.md

+    key: lustre-mgt
+```
+
+* `count` this field specified the number of allocations to make when `status.storage.allocationSets.allocationStrategy` is `AllocateAcrossServers`


Suggested change

* `count` this field specified the number of allocations to make when `status.storage.allocationSets.allocationStrategy` is `AllocateAcrossServers`

* `count` this field specifies the number of allocations to make when `status.storage.allocationSets.allocationStrategy` is `AllocateAcrossServers`

roehrich-hpe · 2024-06-27T17:49:48Z

docs/guides/directive-breakdown/readme.md

+  count: 5
+```
+
+* `scale` is a unitless value from 1-10 that is meant to guide the WLM on how many allocations to make when `status.storage.allocationSets.allocationStrategy` is `AllocateAcrossServers`. The actual number of allocations is not meant to correspond to the value of scale. Rather, 1 would indicate the minimum number of allocations to reach `status.storage.allocationSets.minimumCapacity`, and 10 would be the maximum number of allocations that make sense given the `status.storage.allocationSets.minimumCapacity` and the compute node count. The NNF software does not interpret this value, and it is up to the WLM to define its meaning.


Matt, when does an admin create a storage profile that uses 'scale' rather than 'count'? What is the admin trying to accomplish when they do that? The webhook does not allow both to be specified together.

count gives you consistent performance of the file system regardless of how big your job is. scale gives you file system performance that scales with the number of computes.

I think different workload types will benefit from setting the parameters differently.

Single shared file jobs might only need a single MDT since metadata requirements are low.

File per process jobs might want OSTs to scale with the number of computes.

Creating a persistent storage allocation would benefit from setting count for both OSTs and MDTs since there's no compute job, so the compute node count will be 1 (or 0 if flux adds support for jobs with no compute resources)

I think scale will be the more used option, but count is good for the situations where you want to do something specific and want full control.

All of that sounds like useful info to pass on to the WLM developer.

Yeah, I think you're right. I think I'll add it to the documentation for the corresponding fields in the NnfStorageProfile, though (from the year old PR). That's more of the admin/user facing documentation. That's who would be interested in knowing why you'd want to choose one value over another. The WLM documentation is more about how to take the information NNF provides in the DirectiveBreakdown and use that for scheduling computes and Rabbits.

I'm adding the contents of that old PR to the existing documentation for storage profiles. When that's done, I'll put a link to that section here.

roehrich-hpe · 2024-06-27T18:55:23Z

docs/guides/directive-breakdown/readme.md

+
+## Background
+
+The `#DW` directives in a job script are not intended to be interpreted by the workload manager. The workload manager passes the `#DW` directives to the NNF software through the DWS `workflow` resource, and the NNF software determines what resources are needed to satisfy the directives. The NNF software communicates this information back to the workload manager through the DWS `DirectiveBreakdown` resource. This document describes how to interpret the information in the `DirectiveBreakdown`.


"This document describes how to interpret..." should that be "how the WLM should interpret" ?

roehrich-hpe · 2024-06-27T18:56:42Z

docs/guides/directive-breakdown/readme.md

+
+## DirectiveBreakdown Overview
+
+The DWS `DirectiveBreakdown` contains all the information to inform the WLM how to pick storage and compute nodes for a job. The `DirectiveBreakdown` resource is created by the NNF software during the `proposal` phase of the DWS workflow. The `spec` section of the `DirectiveBreakdown` is filled in with the `#DW` directive by the NNF software, and the `status` section contains the information for the WLM. The WLM should wait until the `status.ready` field is true before interpreting the rest of the `status` fields.


Suggested change

The DWS `DirectiveBreakdown` contains all the information to inform the WLM how to pick storage and compute nodes for a job. The `DirectiveBreakdown` resource is created by the NNF software during the `proposal` phase of the DWS workflow. The `spec` section of the `DirectiveBreakdown` is filled in with the `#DW` directive by the NNF software, and the `status` section contains the information for the WLM. The WLM should wait until the `status.ready` field is true before interpreting the rest of the `status` fields.

The DWS `DirectiveBreakdown` contains all the information necessary to inform the WLM about how to pick storage and compute nodes for a job. The `DirectiveBreakdown` resource is created by the NNF software during the `Proposal` phase of the DWS workflow. The `spec` section of the `DirectiveBreakdown` is filled in with the `#DW` directive by the NNF software, and the `status` section contains the information for the WLM. The WLM should wait until the `status.ready` field is true before interpreting the rest of the `status` fields.

Signed-off-by: Matt Richerson <[email protected]>

Add documentation for DirectiveBreakdown

8792dd8

Document the DirectiveBreakdown resource and how the WLM should interpret its fields. Signed-off-by: Matt Richerson <[email protected]>

matthew-richerson requested review from roehrich-hpe, bdevcich and ajfloeder June 26, 2024 19:29

bdevcich reviewed Jun 26, 2024

View reviewed changes

bdevcich approved these changes Jun 26, 2024

View reviewed changes

roehrich-hpe reviewed Jun 27, 2024

View reviewed changes

review comments

15e5839

Signed-off-by: Matt Richerson <[email protected]>

roehrich-hpe approved these changes Jun 28, 2024

View reviewed changes

matthew-richerson merged commit 3be97ac into main Jun 28, 2024
1 check passed

matthew-richerson deleted the directivebreakdown branch June 28, 2024 15:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add documentation for DirectiveBreakdown #172

Add documentation for DirectiveBreakdown #172

matthew-richerson commented Jun 26, 2024

bdevcich Jun 26, 2024

bdevcich Jun 26, 2024

roehrich-hpe Jun 27, 2024

matthew-richerson Jun 27, 2024

roehrich-hpe Jun 27, 2024

matthew-richerson Jun 28, 2024

roehrich-hpe Jun 27, 2024

roehrich-hpe Jun 27, 2024

	* `count` this field specified the number of allocations to make when `status.storage.allocationSets.allocationStrategy` is `AllocateAcrossServers`
	* `count` this field specifies the number of allocations to make when `status.storage.allocationSets.allocationStrategy` is `AllocateAcrossServers`


		## Background

		The `#DW` directives in a job script are not intended to be interpreted by the workload manager. The workload manager passes the `#DW` directives to the NNF software through the DWS `workflow` resource, and the NNF software determines what resources are needed to satisfy the directives. The NNF software communicates this information back to the workload manager through the DWS `DirectiveBreakdown` resource. This document describes how to interpret the information in the `DirectiveBreakdown`.


		## DirectiveBreakdown Overview

		The DWS `DirectiveBreakdown` contains all the information to inform the WLM how to pick storage and compute nodes for a job. The `DirectiveBreakdown` resource is created by the NNF software during the `proposal` phase of the DWS workflow. The `spec` section of the `DirectiveBreakdown` is filled in with the `#DW` directive by the NNF software, and the `status` section contains the information for the WLM. The WLM should wait until the `status.ready` field is true before interpreting the rest of the `status` fields.

Add documentation for DirectiveBreakdown #172

Add documentation for DirectiveBreakdown #172

Conversation

matthew-richerson commented Jun 26, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment