-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
{WIP} To run some scale tests to understand the recent failures #16296
Conversation
hakuna-matatah
commented
Jan 29, 2024
- Investigate AWS 5K test failures that started happening in last 10 days test-infra#31755
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/test presubmit-kops-aws-small-scale-amazonvpc-using-cl2 |
testing on small scale to see if the prom stack comes up properly. |
/test presubmit-kops-aws-scale-amazonvpc-using-cl2 |
/retest |
It looks like prometheus is coming up fine on small scale tests
It tells me that there might be some resource constraints on large scale like 5k which is causing the issues for Prom to not come up ? But we schedule Prom pod on |
From the latest periodic run , it appears that prometheus pod is getting scheduled properly as expected on Has expected tolerations to schedule on
|
This looks suspicious, why this happens because of |
May be related to #16300. |
/test presubmit-kops-aws-scale-amazonvpc-using-cl2 |
Looks like the above fix around
|
closing this as I have accomplished its purpose, summarized here - kubernetes/test-infra#31755 (comment) and periodics are now succeeding. |