-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Invalid prometheus configuration, lint error 45 duplicate rule(s) found.
after deploying hardware-observer
#152
Comments
Removing hw-obs <-> g-agent <-> cos prometheus/loki relations and recreating loki and prometheus pods results in active/idle status of prometheus and loki. After removing all relations between g-agent/hw-observer to COS, switching to a single hw-observer application and re-adding the relations, now I have this again:
with the same errors in debug-log. This might be related to old alert rules not being removed/updated after removing and re-adding relations. |
May be related to this issue which we are handling. We will take a look. |
Thanks, it sounds like this is the case. I managed to "workaround" by removing duplicated rules manually from |
Update: I don't see any alert rules in Loki specific to hw-observer so I believe it is an issue with multiple grafana-agent instances. |
Multiple hardware-observer applications in the same cluster is not supported. Do you mind to share the reason? I think it's not supported in grafana-agent either. |
Examples of why multiple hw-observer instances might need to be deployed:
|
After discuss with @przemeklal , this issue seemed to be between grafana-agent and prometheus. |
Versions:
hardware-observer rev 27
prometheus-k8s rev 159
grafana-agent rev 16
Relating hw-observer to grafana-agent related to COS Prometheus, resulted in Prometheus in a blocked state:
with the following errors in the debug-log output:
We have multiple hardware-observer and grafana-agent applications running on this cluster so it might be a possible reason, though I believe such deployments should be supported.
The text was updated successfully, but these errors were encountered: