-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SetCursor: context deadline exceeded #390
Comments
It could be related to pausing. Any chance you could try disabling auto pausing for cursor partitions? cursors.stream.auto.pause.time: 0 https://liftbridge.io/docs/configuration.html#cursors-configuration-settings I'd be curious if the issue persists with that disabled. |
I disabled auto pausing, but I'm not sure if this has any effect. I still see these in the logs:
There are no SetCursor errors so far, but I suspect these will appear sometime. I will update the issue if this happens. |
The server requires a restart to take hold of the changed config, no hot reloading yet. That might be why auto pausing didn't disable? |
So scaling down to 0 and up to 3 won't work either? |
I would think that would work since the server would be restarting. But I'm not sure why it doesn't appear to be taking effect for you based on the logs you posted. |
Here is the config straight from the pod (nats connection string omitted):
I did scale down/up thing and I still get this:
Is there any chance this setting is behaving just like a cursors.stream.partitions? That one can not be changed once it is set. |
I did some local testing and debugging. The setting cursors.stream.auto.pause.time is indeed used only during the cursors stream creation in cursorManager.Initialize(). |
You're right. For some reason I was thinking that config could be changed after the cursors stream was created. I can probably include that as part of the forthcoming consumer groups release. |
The This will be included in the release forthcoming today. |
@abunjevac Are you able to test this with Liftbridge 1.8.0? This supports modifying |
Yes, I've been monitoring this for the last few days. Everything is fine now. The log is nice and clean. You can close this issue. Thanks. |
I'm inclined to keep the issue open since resuming streams should really not cause a deadline exceeded error. I'm thinking there may be a race condition between the cursor partition resume operation and the cursor publish that Liftbridge does internally. Basically, disabling I'd like to investigate this further. |
Sure. That makes sense. I'll be happy to test with pause enabled when you release a patch. |
Every now and then SetCursor API fails with "context deadline exceeded". Could it be something related to a paused cursors stream?
Liftbridge v1.7.1 on kubernetes.
liftbridge-0:
liftbridge-1:
liftbridge-2:
The text was updated successfully, but these errors were encountered: