Skip to content

Commit

Permalink
Update description of node taints and labels. (#170)
Browse files Browse the repository at this point in the history
Node taints and labels are handled by the SystemConfiguration controller,
rather than `nnf-deploy init`.

Cert-manager is owned by the manifests which are deployed via ArgoCD, and
it no longer needs to be described as a manual step.

Signed-off-by: Dean Roehrich <[email protected]>
  • Loading branch information
roehrich-hpe authored Jun 25, 2024
1 parent d3b2792 commit de3b327
Showing 1 changed file with 1 addition and 10 deletions.
11 changes: 1 addition & 10 deletions docs/guides/initial-setup/readme.md
Original file line number Diff line number Diff line change
Expand Up @@ -47,15 +47,6 @@ Installation of Kubernetes (k8s) nodes proceeds by installing k8s components ont
- Worker: 1 or more worker nodes which run the system level controller manager (SLCM) and Data Workflow Services (DWS) pods. In production, at least 3 nodes should be dedicated to this role.
- Rabbit: 1 or more Rabbit nodes which run the node level controller manager (NLCM) code. The NLCM daemonset pods are exclusively scheduled on Rabbit nodes. All Rabbit nodes are joined to the cluster as k8s workers, and they are tainted to restrict the type of work that may be scheduled on them. The NLCM pod has a toleration that allows it to run on the tainted (i.e. Rabbit) nodes.

### Certificate manager

Webhooks require the Jetstack `cert-manager`. Installation is shown below.

```bash
export certver="v1.13.1"
# Required for webhooks
kubectl apply -f https://github.com/jetstack/cert-manager/releases/download/"$certver"/cert-manager.yaml
```

### Kubernetes Node Labels

Expand All @@ -70,7 +61,7 @@ kubectl apply -f https://github.com/jetstack/cert-manager/releases/download/"$ce
| :------------------------------| :---------------------------- |
| Rabbit Node | cray.nnf.node=true:NoSchedule |

See [Taints and Tolerations](https://kubernetes.io/docs/concepts/scheduling-eviction/taint-and-toleration/). The [`nnf-deploy init`](https://github.com/NearNodeFlash/nnf-deploy) command provides examples of labeling and tainting k8s nodes for use with Rabbit.
See [Taints and Tolerations](https://kubernetes.io/docs/concepts/scheduling-eviction/taint-and-toleration/). The SystemConfiguration controller will handle node taints and labels for the rabbit nodes based on the contents of the SystemConfiguration resource described below.

## Rabbit System Configuration

Expand Down

0 comments on commit de3b327

Please sign in to comment.