Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bin/helpers: check for unexpected LXD warnings/errors during cleanup() #135

Closed
wants to merge 1 commit into from

Conversation

simondeziel
Copy link
Member

This brings the question of should we keep warning about this:

time="2024-04-09T19:46:28-04:00" level=warning msg=" - Couldn't find the CGroup network priority controller, per-instance network priority will be ignored. Please use per-device limits.priority instead"

I have this warning locally despite not making conscious use of any network priority. I've also ACK'ed the corresponding lxc warning but it doesn't silence that message.

@simondeziel simondeziel marked this pull request as ready for review April 10, 2024 14:14
@simondeziel
Copy link
Member Author

FYI, this is expected to fail on many tests until canonical/lxd#13299 is merged.

Copy link
Member

@mihalicyn mihalicyn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks!

@simondeziel
Copy link
Member Author

This turned up many more errors than I expected, some of which I think shouldn't be errors like those:

time="2024-04-10T14:14:50Z" level=error msg="SFTP server stopped with error" err="signal: terminated" instance=cluster-2783-1 instanceType=container project=default stderr=
time="2024-04-10T14:10:39Z" level=error msg="Failed to statfs" err="no such file or directory" instance=c1 instanceType=container path=tmpfs project=default

Should I postpone this PR and open bugs in LXD to have those silenced or reported at a debug/info level maybe?

Also, I'll have to rethink how problems are ignored because many of them are legitimate. What would you think of having a list of messages regex to ignore at the top of scripts where some errors are to be expected? This way, if we take the storage-vm btrfs test for example, we'd probably want to ignore those:

 time="2024-04-10T14:22:28Z" level=warning msg="Unable to use virtio-fs for config drive, using 9p as a fallback" err="Stateful migration unsupported" instance=v1 instanceType=virtual-machine project=default
time="2024-04-10T14:22:28Z" level=warning msg="Using writeback cache I/O" devPath=/var/snap/lxd/common/lxd/storage-pools/vmpool-btrfs-3267/virtual-machines/v1/root.img device=root fsType=btrfs instance=v1 instanceType=virtual-machine project=default

Some other errors like in https://github.com/canonical/lxd-ci/actions/runs/8632671674/job/23663945355?pr=135 are a bit more unclear to me. Maybe they are just noise due to opportunistic cleanup done by LXD.

@simondeziel simondeziel marked this pull request as draft April 10, 2024 14:51
@tomponline
Copy link
Member

Should I postpone this PR and open bugs in LXD to have those silenced or reported at a debug/info level maybe?

I think WARN level would be better for these

tomponline added a commit to canonical/lxd that referenced this pull request Nov 5, 2024
@tomponline
Copy link
Member

@simondeziel do you still want this to stay open?

@simondeziel simondeziel deleted the lxd-warnings-errors branch December 20, 2024 16:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants