-
Notifications
You must be signed in to change notification settings - Fork 73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
EventConsumer stops consuming after a certain period #77
Comments
A couple of things you could look at: Is there anything suspicious in the broker/zookeeper logs? |
Thanks : ) Could you please tell me how to enable logging of RdKafka? I turned on config["debug"] = "cgrp", but couldn't find where the log file is... |
It goes to stdout by default, you can override that via https://github.com/ah-/rdkafka-dotnet/blob/master/src/RdKafka/Config.cs#L66 |
Thanks! The logs of Brokers and ZK look good. And I captured the log when this "stop consuming issue" happens, in the following file: in which, the group seems did a series of [CGRPSTATE].. Any idea of why stop consuming? |
There's a couple of connection issues in that log:
How many brokers do you have running? Is it possible that there are network issues? |
We are running 18 Brokers. Quick question: is it possible to enable more detailed logging than config["debug"] = "cgrp"? Seems that current logging didn't tell much after consuming stops.. |
Wow, that's a lot of brokers! Yes, try one of these: See https://github.com/edenhill/librdkafka/blob/master/CONFIGURATION.md. Also @edenhill suggests trying out RdKafka 0.9.2-ci-177 which has a couple of edge case fixes around cgroups. It's based on the final 0.9.2 librdkafka and should be fairly stable. |
Hi, we observed an issue that EventConsumer stops consuming after a random amount of time (e.g., 30 min, 2 hr, 5 hr, 1 day, etc).
We tried to imitate the EventConsumer example of RdKafka, initializing our EventConsumer with follow code:
Then we use a new Thread to execute the following method, which simply starts consuming and commits every 6,000 messages:
We have 16 machines for the EventConsumer cluster, and they could consume messages fine at start (with incoming qps = 3,000 for each machine). But after a certain period, all 16 machines stop consuming at the same time with no error message.
Any idea for this issue? Thanks.
The text was updated successfully, but these errors were encountered: