Ensure re-sync is triggered #773

davidcassany · 2024-06-21T09:56:51Z

No description provided.

Signed-off-by: David Cassany <[email protected]>

davidcassany · 2024-06-21T09:57:55Z

controllers/managedosversionchannel_controller_test.go

 		patchBase := client.MergeFrom(ch.DeepCopy())
-		ch.Spec.SyncInterval = "10m"
+		ch.Spec.SyncInterval = "10s"


Sets sync interval to 10 seconds for this test

davidcassany · 2024-06-21T09:59:18Z

controllers/managedosversionchannel_controller_test.go

 		patchBase := client.MergeFrom(ch.DeepCopy())
-		ch.Spec.SyncInterval = "10m"
+		ch.Spec.SyncInterval = "10s"
 		Expect(cl.Patch(ctx, ch, patchBase)).To(Succeed())

 		// Pod is created


A pod is created immediately after patching the channel as a channel resource update triggers a new sync

controllers/managedosversionchannel_controller_test.go

davidcassany · 2024-06-21T10:01:45Z

controllers/managedosversionchannel_controller_test.go


 		// After channel update already existing versions were patched
 		Expect(cl.Get(ctx, client.ObjectKey{
 			Name:      "v0.1.0",
 			Namespace: ch.Namespace,
 		}, managedOSVersion)).To(Succeed())
 		Expect(managedOSVersion.Spec.Version).To(Equal("v0.1.0-patched"))
+
+		// Simulate another channel content change
+		syncerProvider.SetJSON(deprecatingJSON)


Changing channel content but not channel resource, hence this does not trigger a new resync, we have to wait for the interval (10s).

davidcassany · 2024-06-21T10:53:44Z

controllers/managedosversionchannel_controller.go

 	}

 	if managedOSVersionChannel.Status.FailedSynchronizationAttempts > maxConscutiveFailures {
 		logger.Error(fmt.Errorf("stop retrying"), "sychronization failed consecutively too many times", "failed attempts", managedOSVersionChannel.Status.FailedSynchronizationAttempts)
-		return ctrl.Result{}, nil
+		return ctrl.Result{RequeueAfter: time.Until(lastSync.Add(interval))}, nil


I think this was an actual bug or leftover

yep, nice fix!

davidcassany · 2024-06-21T10:54:31Z

controllers/managedosversionchannel_controller.go

@@ -187,12 +187,12 @@ func (r *ManagedOSVersionChannelReconciler) reconcile(ctx context.Context, manag

 	if readyCondition.Status == metav1.ConditionTrue {
 		logger.Info("synchronization already done", "lastSync", lastSync)
-		return ctrl.Result{}, nil
+		return ctrl.Result{RequeueAfter: time.Until(lastSync.Add(interval))}, nil


IMHO this shouldn't be needed, but it certainly does not hurt and helps on making the logic more robust. The unit test verifying the automatic resync after the interval passes without this.

Yeah, I was wondering if this could let us to queue extra, unneeded reconcile loops, but seems in practice never happens. Moreover, if we ever would reconcile once more, nothing bad could happen 👍🏼

davidcassany · 2024-06-21T11:04:48Z

controllers/managedosversionchannel_controller.go

@@ -525,5 +530,16 @@ func filterChannelEvents() predicate.Funcs {
 			logger.V(log.DebugDepth).Info("Processing generic event", "Obj", e.Object.GetName())
 			return true
 		},
+		// Ignore pods creation
+		CreateFunc: func(e event.CreateEvent) bool {


This is to prevent reconciling again immediately after creating the pod resource. we should only re-reconcile on pod status changes.

well done, this was the extra reconcile loop we saw

fgiudici

Well done, seems the channel sync is in pretty good shape now!
(tested and checked on a test deployment)

fgiudici · 2024-06-21T14:11:58Z

controllers/managedosversionchannel_controller.go

@@ -187,12 +187,12 @@ func (r *ManagedOSVersionChannelReconciler) reconcile(ctx context.Context, manag

 	if readyCondition.Status == metav1.ConditionTrue {
 		logger.Info("synchronization already done", "lastSync", lastSync)
-		return ctrl.Result{}, nil
+		return ctrl.Result{RequeueAfter: time.Until(lastSync.Add(interval))}, nil


Yeah, I was wondering if this could let us to queue extra, unneeded reconcile loops, but seems in practice never happens. Moreover, if we ever would reconcile once more, nothing bad could happen 👍🏼

fgiudici · 2024-06-21T14:12:12Z

controllers/managedosversionchannel_controller.go

 	}

 	if managedOSVersionChannel.Status.FailedSynchronizationAttempts > maxConscutiveFailures {
 		logger.Error(fmt.Errorf("stop retrying"), "sychronization failed consecutively too many times", "failed attempts", managedOSVersionChannel.Status.FailedSynchronizationAttempts)
-		return ctrl.Result{}, nil
+		return ctrl.Result{RequeueAfter: time.Until(lastSync.Add(interval))}, nil


yep, nice fix!

fgiudici · 2024-06-21T14:12:41Z

controllers/managedosversionchannel_controller.go

@@ -525,5 +530,16 @@ func filterChannelEvents() predicate.Funcs {
 			logger.V(log.DebugDepth).Info("Processing generic event", "Obj", e.Object.GetName())
 			return true
 		},
+		// Ignore pods creation
+		CreateFunc: func(e event.CreateEvent) bool {


well done, this was the extra reconcile loop we saw

davidcassany requested a review from a team as a code owner June 21, 2024 09:56

github-actions bot added the area/tests test related changes label Jun 21, 2024

davidcassany marked this pull request as draft June 21, 2024 10:35

Ensure re-sync is triggered

60f4b27

Signed-off-by: David Cassany <[email protected]>

davidcassany force-pushed the ensure_resync_is_triggered branch from 9b4c3bc to 60f4b27 Compare June 21, 2024 10:51

davidcassany marked this pull request as ready for review June 21, 2024 10:52

davidcassany commented Jun 21, 2024

View reviewed changes

fgiudici approved these changes Jun 21, 2024

View reviewed changes

davidcassany merged commit 61e76e2 into rancher:main Jun 21, 2024
22 checks passed

davidcassany deleted the ensure_resync_is_triggered branch June 21, 2024 14:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure re-sync is triggered #773

Ensure re-sync is triggered #773

davidcassany commented Jun 21, 2024

davidcassany Jun 21, 2024

davidcassany Jun 21, 2024

davidcassany Jun 21, 2024

davidcassany Jun 21, 2024

fgiudici Jun 21, 2024

davidcassany Jun 21, 2024

fgiudici Jun 21, 2024

davidcassany Jun 21, 2024

fgiudici Jun 21, 2024

fgiudici left a comment •

edited

Loading

fgiudici Jun 21, 2024

fgiudici Jun 21, 2024

fgiudici Jun 21, 2024

Ensure re-sync is triggered #773

Ensure re-sync is triggered #773

Conversation

davidcassany commented Jun 21, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fgiudici left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fgiudici left a comment •

edited

Loading