Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SIGTERM no reach to rbd-nbd process when csi-rbdplugin pod restart #5068

Open
YiteGu opened this issue Jan 10, 2025 · 0 comments
Open

SIGTERM no reach to rbd-nbd process when csi-rbdplugin pod restart #5068

YiteGu opened this issue Jan 10, 2025 · 0 comments

Comments

@YiteGu
Copy link
Member

YiteGu commented Jan 10, 2025

Describe the bug

rbd-nbd process running in csi-rbdplugin pod if use rbd-nbd mounter. SIGTERM no reach to rbd-nbd process when csi-rbdplugin pod restart, this issues cause to rbd watcher no graceful release. This leads to healerStageTransaction can't no fast completed, further leads to I/O interrupt 30s.

Environment details

  • Image/version of Ceph CSI driver : v3.9.0
  • Helm chart version :
  • Kernel version :
  • Mounter used for mounting PVC (for cephFS its fuse or kernel. for rbd its
    krbd or rbd-nbd) : rbd-nbd
  • Kubernetes cluster version :
  • Ceph cluster version :

Steps to reproduce

Steps to reproduce the behavior:

  1. create a test pod with a pvc, and SC use rbd-nbd mounter
  2. use fio test filesystem of pvc mount
  3. restart csi-rbdplugin pod

Actual results

Image

Expected behavior

IO interrupt quick recovery

Logs

If the issue is in PVC creation, deletion, cloning please attach complete logs
of below containers.

  • csi-provisioner and csi-rbdplugin/csi-cephfsplugin container logs from the
    provisioner pod.

If the issue is in PVC resize please attach complete logs of below containers.

  • csi-resizer and csi-rbdplugin/csi-cephfsplugin container logs from the
    provisioner pod.

If the issue is in snapshot creation and deletion please attach complete logs
of below containers.

  • csi-snapshotter and csi-rbdplugin/csi-cephfsplugin container logs from the
    provisioner pod.

If the issue is in PVC mounting please attach complete logs of below containers.

  • csi-rbdplugin/csi-cephfsplugin and driver-registrar container logs from
    plugin pod from the node where the mount is failing.

  • if required attach dmesg logs.

Note:- If its a rbd issue please provide only rbd related logs, if its a
cephFS issue please provide cephFS logs.

Additional context

Add any other context about the problem here.

For example:

Any existing bug report which describe about the similar issue/behavior

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant