Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Retreive containers metrics per applications #71

Open
ashangit opened this issue Dec 31, 2018 · 0 comments
Open

Retreive containers metrics per applications #71

ashangit opened this issue Dec 31, 2018 · 0 comments
Labels
enhancement New feature or request

Comments

@ashangit
Copy link
Contributor

Time to times we are seeing some applications requested lots of containers (up to 20 millions) from different frameworks tez, flink.
This leads to lots of pending containers on the cluster and are usually due to bad request or bugs.
It is not very easy to find which application is the root cause of this high containers request, only debug log level on org.apache.hadoop.yarn.server.resourcemanager.scheduler package helps to find the application.
It will be much easier to have garmadon reporting different metrics about containers (running, pending...) from each app and then display aa top 10 of app with pending containers in compute grafana dashboards

@ashangit ashangit added the enhancement New feature or request label Dec 31, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant