Skip to content

CSI driver crashes immediately on Kubernetes 1.34 (controller pod) #1224

@angustohrallegrinski

Description

@angustohrallegrinski

Description

The Hetzner CSI driver (hcloud-csi-driver:v2.18.3) starts in the controller pod, creates the socket /run/csi/socket, but then exits immediately without producing any logs. The sidecar containers (csi-attacher, csi-provisioner, csi-resizer) report:

Lost connection to CSI driver, exiting
grpc: addrConn.createTransport failed to connect to {Addr: "/run/csi/socket"}.
error reading server preface: read unix @->/run/csi/socket: use of closed network connection

Environment

  • Kubernetes version: v1.34 latest
  • CSI driver version: v2.18.3
  • Sidecar versions:
    • csi-attacher: v4.10.0
    • csi-provisioner: v6.1.0
    • csi-resizer: v2.0.0
    • liveness-probe: v2.17.0
  • Manifest: Based on the official Hetzner manifest (includes RBAC, CSIDriver, StorageClass).
  • Secret: Present (hcloud with key token), token is valid.
  • Network: Pod has connectivity to https://api.hetzner.cloud.

Expected Behavior

The CSI driver should remain running and respond to sidecar requests.


Observed Behavior

  • The hcloud-csi-driver container in the controller:
    • Starts.
    • Creates the socket /run/csi/socket.
    • Exits immediately (Exit Code: 2) without any logs.
  • No RBAC errors in logs.
  • NodePlugin (hcloud-csi-node) works correctly.

Steps to Reproduce

  1. Deploy the official Hetzner CSI manifest on Kubernetes 1.34.
  2. Create the Secret hcloud with a valid token.
  3. Observe the hcloud-csi-controller pod:
    kubectl -n kube-system get pods
    kubectl -n kube-system logs hcloud-csi-controller-xxx -c hcloud-csi-driver
  4. Verify that the container exits immediately without logs.

Pod Events

State: Waiting
Reason: CrashLoopBackOff
Last State: Terminated
Reason: Error
Exit Code: 2

Question

  • Is this a known issue with Kubernetes 1.34?
  • Is there any additional configuration required for the controller?
  • How can we enable more detailed logging to diagnose the crash?

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions