Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIS2 IMS: Alert Management Mechanism with Prometheus and Alertmanager #23

Open
kurt-hectic opened this issue Sep 10, 2024 · 0 comments
Open

Comments

@kurt-hectic
Copy link

In WIS2 operations issues with the system are detected by Global Brokers, Global Caches or Global Monitoring. This may lead to the creation of an incident in the WIS2 Incident Management System (IMS). Incidents are mapped to a country, classified and then assigned to either the GISC responsible for the country, or directly to the country. Incidents can be created automatically via an API, or manually.
The WIS2 IMS is operated by the WMO Secretariat and integrated with the WMO Experts database. IMS users are created for designated WIS2 Global Services operator contacts, Expert Team for WIS Operations members, and National Focal Points for WIS, with group membership determining their role in the IMS.

Due to redundant infrastructure in WIS2, issues can potentially be duplicated, for example if two Global Monitoring centers, or two subscribed Global Brokers detect an issue in a country.

The exact workflow that determines which issues lead to the creation of an incident in the IMS and by who, whether new issues are validated, and to who they are assigned will be tracked in the discussion of this issue.

Morocco has validated automatic creation of incidents, based on a Grafana, Prometheus and Alertmanager architecture, whereby once a custom alerting threshold on a metric is reached, a custom integration script leads to the creation of an incident in the WIS2 IMS via the Atlassian JIRA API.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant