You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is "concepts of a plan", so I don't know specifics but...
We should have some kind of automated core dumping when processes behave badly to do postmortem analysis.
Roughly:
configure coredumpctl
Setup some monitoring worker which keeps checking health of all services
If service isn't healthy for a long time - coredump (and eventually restart)
Why:
No need to debug under fire, investigate things peacefully.
If not coredumps, we can also do process-specific "dump state" too, like py-spy or sigusr1 for python processes. This doesn't reveal the whole picture though.
The text was updated successfully, but these errors were encountered:
This is "concepts of a plan", so I don't know specifics but...
We should have some kind of automated core dumping when processes behave badly to do postmortem analysis.
Roughly:
coredumpctl
Why:
If not coredumps, we can also do process-specific "dump state" too, like py-spy or sigusr1 for python processes. This doesn't reveal the whole picture though.
The text was updated successfully, but these errors were encountered: