You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fleet-agent is deployed as statefulset with a single replica. In this case there is NOT automatic fail over in case the node hosting that pod fails. (this is per design in statefulsets). An administrator has to delete the pod manually to get fleet back running. This is inacceptable because it is reactive and not automatic.
We have to deploy stateful-sets with a replica >1 to have fault tolerance or we have to use deployments.
A deployment with replica count 1 can take a long time to migrate to another node. We should make the replica count configurable.
Business impact: High as it causes downtime
Repro steps:
Deploy fleet-agent
Poweroff the node hosting the fleet agent
Acceptance Criteria
Fleet controllers replica count and fleet agent replica count are configurable via the helm chart. We default to one.
Fleet agent init container and containers use leader election
Optional: merge clusterstatus ticker into controller container, migrate it to c-r. That way we have one leader election loop less. Also, clusterstatus is tiny nowadays.
Fleet-agent is deployed as statefulset with a single replica. In this case there is NOT automatic fail over in case the node hosting that pod fails. (this is per design in statefulsets). An administrator has to delete the pod manually to get fleet back running. This is inacceptable because it is reactive and not automatic.
We have to deploy stateful-sets with a replica >1 to have fault tolerance or we have to use deployments.
A deployment with replica count 1 can take a long time to migrate to another node. We should make the replica count configurable.
Business impact: High as it causes downtime
Repro steps:
Acceptance Criteria
The text was updated successfully, but these errors were encountered: