You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
I have a 3 node vault cluster using raft storage, in Kubernetes. If I restart one of the pods, it fails immediately and continuously with the following error:
The only way to solve this right now is to completely remove the persistent volume for the pod, and restart. This means it's impossible to update the Vault cluster without doing a full restore.
To Reproduce
Steps to reproduce the behavior:
Run a 3 node cluster
Restart one of the nodes
Expected behavior
Vault is able to recover from restarts.
Environment:
Vault Server Version (retrieve with vault status): 1.17.5
Vault CLI Version (retrieve with vault version): Vault v1.17.5 (4d0c53e), built 2024-08-30T15:54:57Z
Server Operating System/Architecture: Kubernetes, bare metal
It looks like your vault.db was somehow corrupted. It's hard to determine what exactly the cause of the corruption is without the bbolt file.
I'd suggest using the bbolt command line utility(https://developer.hashicorp.com/vault/tutorials/monitoring/inspect-data-boltdb) to inspect the contents of a copy of the vault.db file, particularly page 4190 that is throwing this error. You could also consider performing a bbolt compact operation on the copy of the database, to see if the free list compaction is able to resolve the corruption.
Unfortunately I have resorted to switching the storage backend to postgres, so I won't be able to try those commands. It was too unstable for our production requirements.
What seemed odd to me is that it occurred on every pod restart.
Describe the bug
I have a 3 node vault cluster using raft storage, in Kubernetes. If I restart one of the pods, it fails immediately and continuously with the following error:
The only way to solve this right now is to completely remove the persistent volume for the pod, and restart. This means it's impossible to update the Vault cluster without doing a full restore.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
Vault is able to recover from restarts.
Environment:
vault status
): 1.17.5vault version
): Vault v1.17.5 (4d0c53e), built 2024-08-30T15:54:57ZVault server configuration file(s):
Additional context
raft_wal
to fix the issue, but it looks as though boltdb is still used. This issue occurs regradless of settingraft_wal
The text was updated successfully, but these errors were encountered: