Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

emojivoto-demo: vote-bot deployment not coming up, image takes very long to pull #588

Closed
blenessy opened this issue Jun 17, 2024 · 7 comments
Assignees

Comments

@blenessy
Copy link
Contributor

blenessy commented Jun 17, 2024

Using contrast v0.7.0 I tried the emojivoto example again. Everything went smooth (according to the documentation) except that the vote-bot pod never started (confirmed with k9s).

A quick check in emojivoto-demo.yaml (attached) reveals that the io.katacontainers.config.agent.policy is missing from the vote-bot Deployment. Is the vote-bot Deployment suppose to be part of the TCB or not? In the documentation it is rendered in a different color (yellow), which suggests that it is NOT suppose to be trusted. Either way it is not starting properly.

emojivoto-demo.yml.gz

@katexochen katexochen self-assigned this Jun 17, 2024
@katexochen
Copy link
Member

The vote-bot in the demo simulates a user of the public service, and is not part of the confidential deployment. As such, it doesn't need a policy, and it also doesn't have a runtimeClass set.

Previous to v0.7.0, the policy generation tool hadn't had the ability to filter for runtime class when annotating policies, and thus would even add a policy to the bot, even when it didn't have the Contrast runtimeClass. This is now fixed.

Now trying to understand why the bot isn't starting for you. Can you provide any pod logs/ pods description?

@blenessy
Copy link
Contributor Author

Hold off please @katexochen . It seems to have started now. It was lagging (minutes) vs the other nodes.
I'll try to investigate why the lag occurred ...

@blenessy
Copy link
Contributor Author

blenessy commented Jun 17, 2024

Managed to dump all logs.

logs.tgz

For some reason kubectl describe did not include this event. Fortunately I noted i before:

Name:             vote-bot-64598fd555-z4h4l
...
Events:
  Type    Reason   Age   From     Message
  ----    ------   ----  ----     -------
  Normal  Pulled   51m   kubelet  Successfully pulled image "ghcr.io/3u13r/emojivoto-web:coco-1" in 20m23.926s (20m23.926s including waiting)
  Normal  Created  51m   kubelet  Created container vote-bot
  Normal  Started  51m   kubelet  Started container vote-bot

It looks like it took ~20m to pull ghcr.io/3u13r/emojivoto-web:coco-1, which definitely explains the problem @katexochen .

Quickly checked GitHub Status and there were no issues at the time.

My best guess is that it might be related to the CentralIndia Azure region, which I'm currently testing.

@blenessy
Copy link
Contributor Author

When trying to reproduce in a fresh AKS cluster, the vote-bot got deployed in nodegroup2 - like the rest of the services and it worked immediately. In my first attempt it got deployed to nodegroup1.

@katexochen
Copy link
Member

When trying to reproduce in a fresh AKS cluster, the vote-bot got deployed in nodegroup2 - like the rest of the services and it worked immediately. In my first attempt it got deployed to nodegroup1.

As the pod does not have any specific requirements, it's fine for Kubernetes to schedule it on either node group 1 or 2.

@blenessy
Copy link
Contributor Author

As nothing points to a problem in contrast - I'm closing this issue.

@katexochen
Copy link
Member

Thanks for the report anyway @blenessy. :)

@katexochen katexochen changed the title 0.7.0: contrast generate deployment/ ignores the vote-bot Deployment in the emojivoto example emojivoto-demo: vote-bot deployment not coming up, image takes very long to pull Jun 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants