Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add server-manager session management #547

Closed
4 tasks
marcelstanley opened this issue Jul 30, 2024 · 1 comment
Closed
4 tasks

Add server-manager session management #547

marcelstanley opened this issue Jul 30, 2024 · 1 comment
Milestone

Comments

@marcelstanley
Copy link
Contributor

marcelstanley commented Jul 30, 2024

📚 Context

Currently, the node cannot recover form server-manager failures on the fly. When such failures happen, the node has to be restarted and all existing input must be processed again in order to restore the state and resume operation.

✔️ Solution

As a further step to #541, we shall add support for session management on the node, based on snapshots saved by leveraging an existing server-manager feature which saves state between epochs.

This would allow the node to implement proper error handling, and the node would:

  • keep operating after a failure
  • recover its state from the most recent snapshot, avoid unnecessary reprocessing of inputs

📈 Subtasks

  • Implement stop on inspect if it detects a tainted session
  • Add Redis connection to advance and inspect
  • Advance must receive path to snapshot and start from it, if it exists
  • Implement inspect restart when new session is received from Redis
@vfusco
Copy link
Collaborator

vfusco commented Feb 21, 2025

1.6 will not be released

@vfusco vfusco closed this as completed Feb 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants