Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

24.02.0 Failed to verify multipath device serial. #941

Open
grubjack opened this issue Oct 24, 2024 · 3 comments
Open

24.02.0 Failed to verify multipath device serial. #941

grubjack opened this issue Oct 24, 2024 · 3 comments
Labels

Comments

@grubjack
Copy link

grubjack commented Oct 24, 2024

Describe the bug
Can't mount PV after trident-operator upgrade

Environment
Provide accurate information about the environment to help us reproduce the issue.

  • Trident-operator chart: [100.2402.0]
  • Trident version: [24.02.0]
  • Trident installation flags used:
  • Container runtime: [containerd 1.5.9]
  • Kubernetes version: [v1.23.17]
  • OS: [Ubuntu 20.04]
  • NetApp backend types: [ONTAP AFF 9.5]

Additional context

$ kubectl events
...
20s (x5 over 98s)      Warning   FailedMount              Pod/test-65689b588-kspfv    MountVolume.MountDevice failed for volume "pvc-47c47d19-c076-4b12-84ab-b35cacad7774"
 : rpc error: code = Internal desc = rpc error: 
code = Internal desc = failed to stage volume: multipath device 'dm-76' serial check failed

$ kubectl -n trident logs trident-node-linux-g7p5w
...
time="2024-10-24T10:21:52Z" level=error msg="GRPC error: rpc error: code = Internal desc = rpc error: code = Internal desc = failed to stage volume: multipath device 'dm-76' serial check failed" logLayer=csi_frontend requestID=42c4607d-1845-4f64-974e-b7886594e323 requestSource=CSI
time="2024-10-24T10:22:32Z" level=error msg="Failed to verify multipath device serial." logLayer=csi_frontend lunSerialNumber="81LgM$V4waSp" lunSerialNumberHex=38314c674d24563477615370 multipathDevice=dm-76 multipathDeviceUUID="mpath-3600a098038314c674d2456347761534b\n" requestID=28be1fb9-aada-456f-b344-7d715190b9cd requestSource=CSI workflow="node_server=stage"
time="2024-10-24T10:22:36Z" level=error msg="Failed to verify multipath device serial." logLayer=csi_frontend lunSerialNumber="81LgM$V4waSp" lunSerialNumberHex=38314c674d24563477615370 multipathDevice=dm-76 multipathDeviceUUID="mpath-3600a098038314c674d2456347761534b\n" requestID=28be1fb9-aada-456f-b344-7d715190b9cd requestSource=CSI workflow="node_server=stage"
time="2024-10-24T10:22:42Z" level=error msg="Failed to verify multipath device serial." logLayer=csi_frontend lunSerialNumber="81LgM$V4waSp" lunSerialNumberHex=38314c674d24563477615370 multipathDevice=dm-76 multipathDeviceUUID="mpath-3600a098038314c674d2456347761534b\n" requestID=28be1fb9-aada-456f-b344-7d715190b9cd requestSource=CSI workflow="node_server=stage"
time="2024-10-24T10:22:42Z" level=error msg="GRPC error: rpc error: code = Internal desc = rpc error: code = Internal desc = failed to stage volume: multipath device 'dm-76' serial check failed" logLayer=csi_frontend requestID=28be1fb9-aada-456f-b344-7d715190b9cd requestSource=CSI

$ cat /etc/multipath.conf
defaults {
    find_multipaths no
    user_friendly_names yes
}

$ sudo multipath -ll | grep -A 5 dm-76
3600a098038314c674d2456347761534b dm-76 NETAPP,LUN C-Mode
size=8.0G features='3 queue_if_no_path pg_init_retries 50' hwhandler='1 alua' wp=rw
|-+- policy='service-time 0' prio=0 status=active
| `- 15:0:0:20  sdcx 70:80   failed faulty running
`-+- policy='service-time 0' prio=0 status=enabled
  `- 16:0:0:20  sddq 71:128  failed faulty running

$ sudo cat /etc/iscsi/iscsid.conf
iscsid.startup = /bin/systemctl start iscsid.socket
node.startup = manual
node.leading_login = No
node.session.timeo.replacement_timeout = 120
node.conn[0].timeo.login_timeout = 15
node.conn[0].timeo.logout_timeout = 15
node.conn[0].timeo.noop_out_interval = 5
node.conn[0].timeo.noop_out_timeout = 5
node.session.err_timeo.abort_timeout = 15
node.session.err_timeo.lu_reset_timeout = 30
node.session.err_timeo.tgt_reset_timeout = 30
node.session.initial_login_retry_max = 8
node.session.cmds_max = 128
node.session.queue_depth = 32
node.session.xmit_thread_priority = -20
node.session.iscsi.InitialR2T = No
node.session.iscsi.ImmediateData = Yes
node.session.iscsi.FirstBurstLength = 262144
node.session.iscsi.MaxBurstLength = 16776192
node.conn[0].iscsi.MaxRecvDataSegmentLength = 262144
node.conn[0].iscsi.MaxXmitDataSegmentLength = 0
discovery.sendtargets.iscsi.MaxRecvDataSegmentLength = 32768
node.session.nr_sessions = 1
node.session.iscsi.FastAbort = Yes
node.session.scan = manual
@grubjack grubjack added the bug label Oct 24, 2024
@bryantidd
Copy link

Seeing the same error on a fresh install. @grubjack Did you find a workaround?

@grubjack
Copy link
Author

grubjack commented Nov 7, 2024

@sjpeeris
Copy link
Collaborator

sjpeeris commented Nov 13, 2024

@grubjack Are you able to reproduce the issue with the latest version of Trident (v 24.10.0) ? We had several bug fixes added to 24.10.0. If you are still able to reproduce the issue with 24.10.0, please open a NetApp Support case and our support team will investigate further.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants