You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the case of gateway groups, all gateways within the group should have identical NVMf target configurations. Upon starting, the management daemon on a gateway within the group should check for ability to restore from an established NVMf target configuration to match the group. Currently, the restore is aborted and the daemon exits if there is an issue restoring some component of the target. This ensures the gateway isn't available while having a mismatched target to the others within the group.
As @trociny points out in #9, in a case where an image in ceph is accidentally deleted, a gateway attempting to restore the bdev will always fail. This could lead to all gateways within the group dying.
Should the gateway continuously check for availability of images to maintain the correctness of the OMAP specification?
As @trociny suggests, is it better to output an error message instead of aborting on restore failure? In this case, should the gateway keep track of errors to avoid attempting to restore components that have dependencies on ones that have already failed? Ex: Restoring subsystem-A fails, the gateway logs an error, continues restoring other subsystems and components but skips attempting to restore any namespaces, hosts, listeners associated with subsystem-A.
The text was updated successfully, but these errors were encountered:
In the case of gateway groups, all gateways within the group should have identical NVMf target configurations. Upon starting, the management daemon on a gateway within the group should check for ability to restore from an established NVMf target configuration to match the group. Currently, the restore is aborted and the daemon exits if there is an issue restoring some component of the target. This ensures the gateway isn't available while having a mismatched target to the others within the group.
As @trociny points out in #9, in a case where an image in ceph is accidentally deleted, a gateway attempting to restore the bdev will always fail. This could lead to all gateways within the group dying.
Should the gateway continuously check for availability of images to maintain the correctness of the OMAP specification?
As @trociny suggests, is it better to output an error message instead of aborting on restore failure? In this case, should the gateway keep track of errors to avoid attempting to restore components that have dependencies on ones that have already failed? Ex: Restoring subsystem-A fails, the gateway logs an error, continues restoring other subsystems and components but skips attempting to restore any namespaces, hosts, listeners associated with subsystem-A.
The text was updated successfully, but these errors were encountered: