-
Notifications
You must be signed in to change notification settings - Fork 297
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Installation with agent installer or assisted installer with UPI on baremetal fails for v4.16.0-0.okd-scos-2024-08-21-155613 #2018
Comments
Hi @titou10titou10, I tried the workaround but I think rhel is missing zincati
and installation stuck at |
I spoke too soon, It took some hours to get reflected in the console. It turns out the And the bootkube commands take a while and while running doesn't create any logs in systemctl or change status while in running. There was one issue though had to run this code to fix the network I am setting up single node installation cat << EOF | tee /etc/kubernetes/cni/net.d/10-containerd-net.conflist
{
"cniVersion": "1.0.0",
"name": "containerd-net",
"plugins": [
{
"type": "bridge",
"bridge": "cni0",
"isGateway": true,
"ipMasq": true,
"promiscMode": true,
"ipam": {
"type": "host-local",
"ranges": [
[{
"subnet": "10.128.0.0/14"
}]
],
"routes": [
{ "dst": "0.0.0.0/0" },
{ "dst": "::/0" }
]
}
},
{
"type": "portmap",
"capabilities": {"portMappings": true},
"externalSetMarkChain": "KUBE-MARK-MASQ"
}
]
}
EOF Ref: #1966 |
I'm not sure what exactly your code is doing but maybe you are not aware that "extra" manifests can be added before the creation of the iso image. Inside the directory where you set the install-config and agent-config files, create an "openshift" directory and create additional manifests: Refs: This page seems related to what you are doing, and maybe you can create a manifest with it and put in under the install/openshift directory? In my install, I have this extra "network-03-config.yaml" manifest file in install/openshift:
|
When I boot the OKD control for first time the network plugin was not configured in journalctl I had log saying After some time I restarted the server actually couple of time after that ovn was not working at all. So I am trying to reinstall. I had some issues in my network I resolved them let's see if it works or not. |
I was being too desperate it took some time and then the But @titou10titou10 thanks a lot for the investigation it was really big help saved a ton of time. |
Context
Trying to install a cluster (3 masters + 2 workers):
It is important to note that the install works perfectly well with the exact same agent and install config files for
Summary
It fails with the following error from the "release-image-pivot" service:
The cause of the problem is the OS image used as bootstrap: fedora-coreos-39.20231101.3.0-live.x86_64.iso
Details
All the details with debug info and configuration files are described in this discussion. The logs there etc are for v4.16.0-0.okd-scos-2024-08-01-132038 but they are the same for v4.16.0-0.okd-scos-2024-08-21-155613
Workarounds
Overriding the bootstrap OS image with a RHCOS image make the installation succeed
I did not choose a random bootstrap OS image, this is the one for v4.16 specified for an OCP installation via the ABI as specified here: https://github.com/openshift/assisted-service/blob/d3324b06a7c7772f4619c3ab13dd8c0706e55fd9/deploy/podman/configmap.yml#L25
It's probably possible to use another rhcos image as during the install process, the nodes upgrades to v418.9.202408211033-0
Workaround for an Agent Installer (ABI) successful install:
Before building the ISO image, override the bootstrap OS image like this:
Workaround for an Assisted Installer successfull install:
The procedure is described here: https://github.com/openshift/assisted-service/tree/master/deploy/podman
In the okd-configmap.yml file, replace (at least) the following variables:
The text was updated successfully, but these errors were encountered: