Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

qemu job failed at "rpc error: code = Unknown desc = unable to configur e cgroups: no such file or directory" #23250

Closed
joechchen opened this issue Jun 6, 2024 · 9 comments · Fixed by #23466

Comments

@joechchen
Copy link

joechchen commented Jun 6, 2024

Nomad version

Nomad v1.8.0
BuildDate 2024-05-28T17:38:17Z
Revision 28b82e4

Operating system and Environment details

Ubuntu 22.04.4 LTS

Issue

qemu job failed at:

...
2024-06-06T13:45:20.490-0400 [ERROR] client.alloc_runner.task_runner: running driver failed: alloc_id=40bd6120-39b6-5f19-7416-322de0c20ad5 task=freedos error="rpc error: code = Unknown desc = unable to configur
e cgroups: no such file or directory"
...

The same job runs fine on v1.7.1 and prior.

Reproduction steps

nomad run freedos.nomad
nomad monitor -log-level=DEBUG

Expected Result

The job should run.

Actual Result

The job failed.

Job file (if appropriate)

job "freedos" {
  datacenters = ["dc1"]

  group "g1" {
    task "freedos" {
      resources {
        cpu    = 500
        memory = 256
      }

      artifact {
        source      = "https://github.com/angrycub/nomad_example_jobs/raw/main/applications/vms/freedos/freedos.img.tgz"
        destination = "local"
        options {
          checksum  = "sha256:8d2817126bf46ba2b4fca0b0c49eed2cc208c6f6448651e82c6d973fcba36569"
        }
      }

      driver = "qemu"

      config {
        image_path  = "local/freedos.img"
        accelerator = "kvm"
        args = []
      }
    }
  }
}

Nomad Server logs (if appropriate)

Nomad Client logs (if appropriate)

2024-06-06T13:45:20.436-0400 [DEBUG] client.artifact: sub-process: OUTPUT="artifact download was a success"
2024-06-06T13:45:20.436-0400 [TRACE] client.alloc_runner.task_runner.task_hook.procisolation: setting up client process management: alloc_id=40bd6120-39b6-5f19-7416-322de0c20ad5 task=freedos task=40bd6120/freed
os
2024-06-06T13:45:20.437-0400 [TRACE] client.proclib: setup client process management: task=40bd6120/freedos
2024-06-06T13:45:20.437-0400 [TRACE] client.proclib: initialize cgroup: task=40bd6120/freedos
2024-06-06T13:45:20.438-0400 [DEBUG] client.driver_mgr.qemu: starting QEMU VM command : driver=qemu args="/usr/bin/qemu-system-x86_64 -machine type=pc,accel=kvm -name freedos.img -m 256M -drive file=local/freed
os.img,if=ide -nographic -vnc 127.0.0.1:1 -enable-kvm -cpu host"
2024-06-06T13:45:20.438-0400 [DEBUG] client.driver_mgr.qemu.executor: starting plugin: alloc_id=40bd6120-39b6-5f19-7416-322de0c20ad5 driver=qemu task_name=freedos path=/usr/bin/nomad args=["/usr/bin/nomad", "ex
ecutor", "{\"LogFile\":\"/opt/nomad/data/alloc/40bd6120-39b6-5f19-7416-322de0c20ad5/freedos/freedos-executor.out\",\"LogLevel\":\"debug\",\"FSIsolation\":false,\"Compute\":{\"tc\":15600,\"nc\":4}}"]
2024-06-06T13:45:20.439-0400 [DEBUG] client.driver_mgr.qemu.executor: plugin started: alloc_id=40bd6120-39b6-5f19-7416-322de0c20ad5 driver=qemu task_name=freedos path=/usr/bin/nomad pid=2667699
2024-06-06T13:45:20.439-0400 [DEBUG] client.driver_mgr.qemu.executor: waiting for RPC address: alloc_id=40bd6120-39b6-5f19-7416-322de0c20ad5 driver=qemu task_name=freedos plugin=/usr/bin/nomad
2024-06-06T13:45:20.485-0400 [DEBUG] client.driver_mgr.qemu.executor: using plugin: alloc_id=40bd6120-39b6-5f19-7416-322de0c20ad5 driver=qemu task_name=freedos version=2
2024-06-06T13:45:20.486-0400 [TRACE] client.driver_mgr.qemu.executor.stdio: waiting for stdio data: alloc_id=40bd6120-39b6-5f19-7416-322de0c20ad5 driver=qemu task_name=freedos
2024-06-06T13:45:20.488-0400 [DEBUG] client.driver_mgr.qemu.executor.stdio: received EOF, stopping recv loop: alloc_id=40bd6120-39b6-5f19-7416-322de0c20ad5 driver=qemu task_name=freedos err="rpc error: code = U
navailable desc = error reading from server: EOF"
2024-06-06T13:45:20.490-0400 [INFO]  client.driver_mgr.qemu.executor: plugin process exited: alloc_id=40bd6120-39b6-5f19-7416-322de0c20ad5 driver=qemu task_name=freedos plugin=/usr/bin/nomad id=2667699
2024-06-06T13:45:20.490-0400 [DEBUG] client.driver_mgr.qemu.executor: plugin exited: alloc_id=40bd6120-39b6-5f19-7416-322de0c20ad5 driver=qemu task_name=freedos
2024-06-06T13:45:20.490-0400 [INFO]  client.alloc_runner.task_runner: Task event: alloc_id=40bd6120-39b6-5f19-7416-322de0c20ad5 task=freedos type="Driver Failure" msg="rpc error: code = Unknown desc = unable to
 configure cgroups: no such file or directory" failed=false
2024-06-06T13:45:20.490-0400 [ERROR] client.alloc_runner.task_runner: running driver failed: alloc_id=40bd6120-39b6-5f19-7416-322de0c20ad5 task=freedos error="rpc error: code = Unknown desc = unable to configur
e cgroups: no such file or directory"
2024-06-06T13:45:20.490-0400 [INFO]  client.alloc_runner.task_runner: not restarting task: alloc_id=40bd6120-39b6-5f19-7416-322de0c20ad5 task=freedos reason="Error was unrecoverable"
2024-06-06T13:45:20.490-0400 [TRACE] client.alloc_runner.task_runner: setting task state: alloc_id=40bd6120-39b6-5f19-7416-322de0c20ad5 task=freedos state=dead
@fjoenichols
Copy link

fjoenichols commented Jun 10, 2024

I am also seeing the same behavior on nomad v1.8.0 attempting to use the qemu example at https://github.com/angrycub/nomad_example_jobs/blob/main/qemu/tc_ssh.nomad

Job file (if appropriate)

job "j1" {
  datacenters = ["dc1"]

  group "g1" {

    network {
      port "http" { 
        to = -1
      }
      port "ssh" {
        to = -1
      }
    }

    service {
      tags = ["tag1"]
      port = "http"

      check {
        type     = "http"
        port     = "http"
        path     = "/index.html"
        interval = "10s"
        timeout  = "2s"
      }
    }

    task "t1" {
      template {
        data = <<EOH
      Guest System
      EOH

        destination = "local/index.html"
      }

      artifact {
        source = "https://github.com/angrycub/nomad_example_jobs/raw/main/qemu/tinycore.qcow2"
        destination = "local/images"
      }

      driver = "qemu"

      config {
        image_path = "local/images/tinycore.qcow2"

        ## Uncomment if KVM is available on your system
        accelerator = "kvm"

        args = [
          "-device",
          "e1000,netdev=user.0",
          "-netdev",
          "user,id=user.0,hostfwd=tcp::${NOMAD_PORT_http}-:80,hostfwd=tcp::${NOMAD_PORT_ssh}-:22",
          "-drive", "file=fat:rw:./local,format=raw,media=disk"
        ]
      }
    }
  }
}

Nomad Client logs (if appropriate)

2024-06-10T15:11:30.137Z [DEBUG] client.alloc_runner.task_runner.task_hook.logmon: plugin started: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f task=t1 path=/usr/bin/nomad pid=2976713
2024-06-10T15:11:30.137Z [DEBUG] client.alloc_runner.task_runner.task_hook.logmon: waiting for RPC address: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f task=t1 plugin=/usr/bin/nomad
2024-06-10T15:11:30.141Z [DEBUG] consul.sync: sync complete: registered_services=1 deregistered_services=0 registered_checks=0 deregistered_checks=0
2024-06-10T15:11:30.158Z [DEBUG] client.alloc_runner.task_runner.task_hook.logmon.nomad: plugin address: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f task=t1 @module=logmon address=/tmp/plugin379596103 network=unix timestamp=2024-06-10T15:11:30.158Z
2024-06-10T15:11:30.158Z [DEBUG] client.alloc_runner.task_runner.task_hook.logmon: using plugin: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f task=t1 version=2
2024-06-10T15:11:30.159Z [DEBUG] client.alloc_runner.task_runner.task_hook.logmon.nomad: opening fifo: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f task=t1 @module=logmon path=/opt/nomad/alloc/5c5b53f6-cee4-e164-760f-fdfbdf46683f/alloc/logs/.t1.stdout.fifo timestamp=2024-06-10T15:11:30.159Z
2024-06-10T15:11:30.159Z [DEBUG] client.alloc_runner.task_runner.task_hook.logmon.nomad: opening fifo: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f task=t1 path=/opt/nomad/alloc/5c5b53f6-cee4-e164-760f-fdfbdf46683f/alloc/logs/.t1.stderr.fifo @module=logmon timestamp=2024-06-10T15:11:30.159Z
2024-06-10T15:11:30.161Z [INFO]  client.alloc_runner.task_runner: Task event: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f task=t1 type="Downloading Artifacts" msg="Client is downloading artifacts" failed=false
2024-06-10T15:11:30.161Z [DEBUG] client.alloc_runner.task_runner.task_hook.artifacts: downloading artifact: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f task=t1 artifact=https://github.com/angrycub/nomad_example_jobs/raw/main/qemu/tinycore.qcow2 aid="ndrnqA8yoTa8moykwN3ROW1+1fW4HMrui9iaat+T0Ag"
2024-06-10T15:11:30.161Z [DEBUG] client.artifact: get: source=https://github.com/angrycub/nomad_example_jobs/raw/main/qemu/tinycore.qcow2 destination=local/images
2024-06-10T15:11:30.652Z [DEBUG] client.artifact: sub-process: OUTPUT="artifact download was a success"
2024-06-10T15:11:30.656Z [INFO]  agent: (runner) creating new runner (dry: false, once: false)
2024-06-10T15:11:30.656Z [DEBUG] agent: (runner) final config: {"Consul":{"Address":"localhost:8501","Namespace":"","Auth":{"Enabled":false,"Username":""},"Retry":{"Attempts":0,"Backoff":250000000,"MaxBackoff":60000000000,"Enabled":true},"SSL":{"CaCert":"/etc/vault.d/tls/consul-ca.pem","CaCertBytes":"","CaPath":"","Cert":"","Enabled":true,"Key":"","ServerName":"","Verify":true},"Token":"8b7345b9-8e26-35f4-1452-2eb5b729ae39","TokenFile":"","Transport":{"CustomDialer":null,"DialKeepAlive":30000000000,"DialTimeout":30000000000,"DisableKeepAlives":false,"IdleConnTimeout":5000000000,"MaxIdleConns":0,"MaxIdleConnsPerHost":100,"MaxConnsPerHost":0,"TLSHandshakeTimeout":10000000000}},"Dedup":{"Enabled":false,"MaxStale":2000000000,"Prefix":"consul-template/dedup/","TTL":15000000000,"BlockQueryWaitTime":60000000000},"DefaultDelims":{"Left":null,"Right":null},"Exec":{"Command":[],"Enabled":false,"Env":{"Denylist":[],"Custom":[],"Pristine":false,"Allowlist":[]},"KillSignal":2,"KillTimeout":30000000000,"ReloadSignal":null,"Splay":0,"Timeout":0},"KillSignal":2,"LogLevel":"WARN","FileLog":{"LogFilePath":"","LogRotateBytes":0,"LogRotateDuration":86400000000000,"LogRotateMaxFiles":0},"MaxStale":315360000000000000,"PidFile":"","ReloadSignal":1,"Syslog":{"Enabled":false,"Facility":"LOCAL0","Name":"consul-template"},"Templates":[{"Backup":false,"Command":[],"CommandTimeout":30000000000,"Contents":"      Guest System\n","CreateDestDirs":true,"Destination":"/opt/nomad/alloc/5c5b53f6-cee4-e164-760f-fdfbdf46683f/t1/local/index.html","ErrMissingKey":false,"ErrFatal":true,"Exec":{"Command":[],"Enabled":false,"Env":{"Denylist":[],"Custom":[],"Pristine":false,"Allowlist":[]},"KillSignal":2,"KillTimeout":30000000000,"ReloadSignal":null,"Splay":0,"Timeout":30000000000},"Perms":420,"User":null,"Uid":null,"Group":null,"Gid":null,"Source":"","Wait":{"Enabled":false,"Min":0,"Max":0},"LeftDelim":"{{","RightDelim":"}}","FunctionDenylist":["plugin","writeToFile"],"SandboxPath":"/opt/nomad/alloc/5c5b53f6-cee4-e164-760f-fdfbdf46683f/t1","MapToEnvironmentVariable":""}],"TemplateErrFatal":null,"Vault":{"Address":"","Enabled":false,"Namespace":"","RenewToken":false,"Retry":{"Attempts":12,"Backoff":250000000,"MaxBackoff":60000000000,"Enabled":true},"SSL":{"CaCert":"","CaCertBytes":"","CaPath":"","Cert":"","Enabled":true,"Key":"","ServerName":"","Verify":true},"Transport":{"CustomDialer":null,"DialKeepAlive":30000000000,"DialTimeout":30000000000,"DisableKeepAlives":false,"IdleConnTimeout":5000000000,"MaxIdleConns":0,"MaxIdleConnsPerHost":100,"MaxConnsPerHost":0,"TLSHandshakeTimeout":10000000000},"UnwrapToken":false,"ClientUserAgent":null,"DefaultLeaseDuration":300000000000,"LeaseRenewalThreshold":0.9,"K8SAuthRoleName":"","K8SServiceAccountTokenPath":"/run/secrets/kubernetes.io/serviceaccount/token","K8SServiceAccountToken":"","K8SServiceMountPath":"kubernetes"},"Nomad":{"Address":"","Enabled":true,"Namespace":"default","SSL":{"CaCert":"","CaCertBytes":"","CaPath":"","Cert":"","Enabled":false,"Key":"","ServerName":"","Verify":true},"AuthUsername":"","AuthPassword":"","Transport":{"CustomDialer":{},"DialKeepAlive":30000000000,"DialTimeout":30000000000,"DisableKeepAlives":false,"IdleConnTimeout":5000000000,"MaxIdleConns":0,"MaxIdleConnsPerHost":100,"MaxConnsPerHost":0,"TLSHandshakeTimeout":10000000000},"Retry":{"Attempts":0,"Backoff":250000000,"MaxBackoff":60000000000,"Enabled":true}},"Wait":{"Enabled":true,"Min":5000000000,"Max":240000000000},"Once":false,"ParseOnly":false,"BlockQueryWaitTime":300000000000,"ErrOnFailedLookup":false}
2024-06-10T15:11:30.656Z [INFO]  agent: (runner) creating watcher
2024-06-10T15:11:30.656Z [INFO]  agent: (runner) starting
2024-06-10T15:11:30.656Z [DEBUG] agent: (runner) running initial templates
2024-06-10T15:11:30.656Z [DEBUG] agent: (runner) initiating run
2024-06-10T15:11:30.656Z [DEBUG] agent: (runner) checking template 7427b2c5513c1697de161296b216c3c2
2024-06-10T15:11:30.656Z [DEBUG] agent: (runner) rendering "(dynamic)" => "/opt/nomad/alloc/5c5b53f6-cee4-e164-760f-fdfbdf46683f/t1/local/index.html"
2024-06-10T15:11:30.667Z [INFO]  agent: (runner) rendered "(dynamic)" => "/opt/nomad/alloc/5c5b53f6-cee4-e164-760f-fdfbdf46683f/t1/local/index.html"
2024-06-10T15:11:30.667Z [DEBUG] agent: (runner) diffing and updating dependencies
2024-06-10T15:11:30.667Z [DEBUG] agent: (runner) watching 0 dependencies
2024-06-10T15:11:30.667Z [DEBUG] agent: (runner) all templates rendered
2024-06-10T15:11:30.667Z [DEBUG] agent: (runner) enabling global quiescence for "7427b2c5513c1697de161296b216c3c2"
2024-06-10T15:11:30.668Z [DEBUG] client.driver_mgr.qemu: starting QEMU VM command : driver=qemu args="/usr/bin/qemu-system-x86_64 -machine type=pc,accel=kvm -name tinycore.qcow2 -m 300M -drive file=local/images/tinycore.qcow2,if=ide -nographic -device e1000,netdev=user.0 -netdev user,id=user.0,hostfwd=tcp::21172-:80,hostfwd=tcp::21849-:22 -drive file=fat:rw:./local,format=raw,media=disk -enable-kvm -cpu host"
2024-06-10T15:11:30.669Z [DEBUG] client.driver_mgr.qemu.executor: starting plugin: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f driver=qemu task_name=t1 path=/usr/bin/nomad args=["/usr/bin/nomad", "executor", "{\"LogFile\":\"/opt/nomad/alloc/5c5b53f6-cee4-e164-760f-fdfbdf46683f/t1/t1-executor.out\",\"LogLevel\":\"debug\",\"FSIsolation\":false,\"Compute\":{\"tc\":136800,\"nc\":48}}"]
2024-06-10T15:11:30.669Z [DEBUG] client.driver_mgr.qemu.executor: plugin started: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f driver=qemu task_name=t1 path=/usr/bin/nomad pid=2976765
2024-06-10T15:11:30.669Z [DEBUG] client.driver_mgr.qemu.executor: waiting for RPC address: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f driver=qemu task_name=t1 plugin=/usr/bin/nomad
2024-06-10T15:11:30.689Z [DEBUG] client.driver_mgr.qemu.executor: using plugin: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f driver=qemu task_name=t1 version=2
2024-06-10T15:11:30.691Z [DEBUG] client.driver_mgr.qemu.executor.stdio: received EOF, stopping recv loop: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f driver=qemu task_name=t1 err="rpc error: code = Unavailable desc = error reading from server: EOF"
2024-06-10T15:11:30.693Z [INFO]  client.driver_mgr.qemu.executor: plugin process exited: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f driver=qemu task_name=t1 plugin=/usr/bin/nomad id=2976765
2024-06-10T15:11:30.693Z [DEBUG] client.driver_mgr.qemu.executor: plugin exited: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f driver=qemu task_name=t1
2024-06-10T15:11:30.693Z [INFO]  client.alloc_runner.task_runner: Task event: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f task=t1 type="Driver Failure" msg="rpc error: code = Unknown desc = unable to configure cgroups: no such file or directory" failed=false
2024-06-10T15:11:30.694Z [ERROR] client.alloc_runner.task_runner: running driver failed: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f task=t1 error="rpc error: code = Unknown desc = unable to configure cgroups: no such file or directory"
2024-06-10T15:11:30.694Z [INFO]  client.alloc_runner.task_runner: not restarting task: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f task=t1 reason="Error was unrecoverable"
2024-06-10T15:11:30.694Z [INFO]  client.alloc_runner.task_runner: Task event: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f task=t1 type="Not Restarting" msg="Error was unrecoverable" failed=true
2024-06-10T15:11:30.974Z [DEBUG] client: updated allocations: index=10455 total=5 pulled=0 filtered=5
2024-06-10T15:11:30.974Z [DEBUG] client: allocation updates: added=0 removed=0 updated=0 ignored=5
2024-06-10T15:11:30.974Z [DEBUG] client: allocation updates applied: added=0 removed=0 updated=0 ignored=5 errors=0
2024-06-10T15:11:34.696Z [WARN]  client.alloc_runner.task_runner.task_hook.logmon.nomad: timed out waiting for read-side of process output pipe to close: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f task=t1 @module=logmon timestamp=2024-06-10T15:11:34.696Z
2024-06-10T15:11:34.696Z [WARN]  client.alloc_runner.task_runner.task_hook.logmon.nomad: timed out waiting for read-side of process output pipe to close: alloc_id=5c5b53f6-cee4-e164-760f-fdfbdf46683f task=t1 @module=logmon timestamp=2024-06-10T15:11:34.696Z

@tgross
Copy link
Member

tgross commented Jun 24, 2024

Hi @joechchen! It looks like the task driver is trying to configure cgroups for the process we use to manage QEMU, but that's failing for some reason related to the cgroup setup on the host.

A couple of questions:

  • Are you running the Nomad client as root?
  • Are you running Nomad in a custom cgroup?
  • Can you provide the logs for the executor process? Those can be found in the allocation directory after a failure, in a location like: /opt/nomad/alloc/5c5b53f6-cee4-e164-760f-fdfbdf46683f/t1/t1-executor.out (this is from your client logs).

@tgross tgross self-assigned this Jun 24, 2024
@tgross tgross moved this from Needs Triage to In Progress in Nomad - Community Issues Triage Jun 24, 2024
@tgross tgross moved this from In Progress to Triaging in Nomad - Community Issues Triage Jun 24, 2024
@mwright-pivotal
Copy link

hello @tgross - I am seeing this in my homelab as well. Nomad on Ubuntu 22.04 bare metal and KVM support

I see this log output in the alloc directory:

root@host2:/opt/nomad/client/client_datadir/alloc/24e03b91-5d09-d7bc-4149-71eafdc4fa89/virtual# ls
local private secrets tmp virtual-executor.out
root@host2:/opt/nomad/client/client_datadir/alloc/24e03b91-5d09-d7bc-4149-71eafdc4fa89/virtual# cat virtual-executor.out
{"@Level":"debug","@message":"plugin address","@timestamp":"2024-06-25T20:03:45.927311Z","address":"/tmp/plugin2125446520","network":"unix"}
{"@Level":"error","@message":"failed to configure container, process isolation will not work","@module":"executor","@timestamp":"2024-06-25T20:03:45.930127Z","error":"no such file or directory"}

my job definition is here: https://github.com/mwright-pivotal/learn-terraform-cloud-agents/blob/main/windows2022vm-job.hcl

@mwright-pivotal
Copy link

I am running the client as root but have not done anything to setup a custom cgroup

@tgross
Copy link
Member

tgross commented Jun 27, 2024

Ok, thanks folks. I did what I should have done a few days ago when I first touched this issue, which is to run the example job @fjoenichols mentioned here: #23250 (comment). 🤦 I can reproduce the problem exactly now in my local development environment on Linux. I'll dig into this and report back when I know more.

@tgross
Copy link
Member

tgross commented Jun 28, 2024

Draft PR is up here: #23466

@tgross tgross added this to the 1.8.2 milestone Jun 28, 2024
@tgross tgross moved this from Triaging to In Progress in Nomad - Community Issues Triage Jun 28, 2024
tgross added a commit that referenced this issue Jul 1, 2024
As part of the work for 1.7.0 we moved portions of the task cgroup setup down
into the executor. This requires that the executor constructor get the
`TaskConfig.Resources` struct, and this was missing from the `qemu` driver. We
fixed a panic caused by this change in #19089 before we shipped, but this fix
was effectively undo after we added plumbing for custom cgroups for `raw_exec`
in 1.8.0. As a result, running `qemu` tasks always fail on Linux.

This was undetected in testing because our CI environment doesn't have QEMU
installed. I've got all the unit tests running locally again and have added QEMU
installation when we're running the drivers tests.

Fixes: #23250
@tgross tgross closed this as completed in eedbd36 Jul 1, 2024
@github-project-automation github-project-automation bot moved this from In Progress to Done in Nomad - Community Issues Triage Jul 1, 2024
@tgross
Copy link
Member

tgross commented Jul 1, 2024

#23466 has been merged and will ship in Nomad 1.8.2 (with backports to Nomad Enterprise 1.7.x and 1.6.x )

@djthorpe
Copy link

Sorry - I created a new issue here:
#23595

Copy link

I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Dec 20, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
Development

Successfully merging a pull request may close this issue.

5 participants