longhorn: [BUG][v1.5.x] Recurring job fails to create backup when volume detached

Describe the bug

Running test case test_recurring_jobs_when_volume_detached_unexpectedly[s3], it randomly fails to create backup with error message:

proxyServer=10.42.3.8:8501 destination=10.42.2.10:10010: 
failed to backup snapshot recurrin-532a51eb-2fda-4efb-8772-b4c41472eabb to backup-2bde369ad7864927: 
rpc error: code = Unknown desc = failed to get volume 10.42.2.10:10010: rpc error: code = Unavailable 
desc = error reading from server: EOF

https://ci.longhorn.io/job/private/job/longhorn-tests-regression/6238/consoleFull

backup

The reproducibility is about 1/10.

To Reproduce

Running test case test_recurring_jobs_when_volume_detached_unexpectedly[s3] repeatedly.

Expected behavior

Support bundle for troubleshooting

supportbundle_df8064e0-2445-44fd-9af4-63ed76159e29_2024-02-16T02-11-07Z.zip

Environment

  • Longhorn version: v1.5.x-head
  • Impacted volume (PV):
  • Installation method (e.g. Rancher Catalog App/Helm/Kubectl): kubectl
  • Kubernetes distro (e.g. RKE/K3s/EKS/OpenShift) and version: v1.28.4+k3s1
    • Number of control plane nodes in the cluster: 1
    • Number of worker nodes in the cluster: 3
  • Node config
    • OS type and version: sles 15-sp5
    • Kernel version:
    • CPU per node:
    • Memory per node:
    • Disk type (e.g. SSD/NVMe/HDD):
    • Network bandwidth between the nodes (Gbps):
  • Underlying Infrastructure (e.g. on AWS/GCE, EKS/GKE, VMWare/KVM, Baremetal):
  • Number of Longhorn volumes in the cluster:

Additional context

About this issue

  • Original URL
  • State: closed
  • Created 5 months ago
  • Reactions: 1
  • Comments: 22 (21 by maintainers)

Most upvoted comments

Looks like @PhanLe1010 proposed a better fix. I will also test it.

I still need more investigation but I think I kind of see the root cause of race condition that @c3y1huang also saw at https://github.com/longhorn/longhorn/issues/7937#issuecomment-1956182452:

The race condition is that we sometimes mark the volumeattachment ticket as satisfied even though the volume has empty spec.NodeID and has started the detaching process.

This is the flow when I watch the changes of the volume and volumeattachment object when the issue happen:

First, snapshot controller create a VA ticket for the snapshot:

spec:
  attachmentTickets:
    snapshot-controller-recurrin-dd970ef8-4431-49b6-9d6c-fabef7d86180:
    nodeID: phan-v620-recurring-job-pool2-517146b3-rd427
  volume: longhorn-testvol-plb3ix
status:
  attachmentTicketStatuses: {}

Then, volume.spec is set to the corresponding node:

  nodeID: phan-v620-recurring-job-pool2-517146b3-rd427

The snapshot VA ticket initially not satisfied:

spec:
  attachmentTickets:
    snapshot-controller-recurrin-dd970ef8-4431-49b6-9d6c-fabef7d86180:
      nodeID: phan-v620-recurring-job-pool2-517146b3-rd427
  volume: longhorn-testvol-plb3ix
status:
  attachmentTicketStatuses:
    snapshot-controller-recurrin-dd970ef8-4431-49b6-9d6c-fabef7d86180:
      satisfied: false

Then, once the volume is attached

  shareState: ""
  state: attached

Snapshot VA ticket has satisfied true

spec:
  attachmentTickets:
    snapshot-controller-recurrin-dd970ef8-4431-49b6-9d6c-fabef7d86180:
      nodeID: phan-v620-recurring-job-pool2-517146b3-rd427
  volume: longhorn-testvol-plb3ix
status:
  attachmentTicketStatuses:
    snapshot-controller-recurrin-dd970ef8-4431-49b6-9d6c-fabef7d86180:
      satisfied: true

Once the snapshot is created, snapshot controller delete the snapshot VA

spec:
  attachmentTickets: {}
  volume: longhorn-testvol-plb3ix
status:
  attachmentTicketStatuses:
    snapshot-controller-recurrin-dd970ef8-4431-49b6-9d6c-fabef7d86180:
      satisfied: true

This lead to volume.Spec.NodeID being reset

nodeID: ""

Note that at this moment the volume still have status.state as attached because the volume has not started to detach the volume yet:

  currentNodeID: phan-v620-recurring-job-pool2-517146b3-rd427
  robustness: unknown
  shareEndpoint: ""
  shareState: ""
  state: attached

Then a backup VA ticket is created

spec:
  attachmentTickets:
    backup-controller-backup-d5dbab0da00b422c:
      nodeID: phan-v620-recurring-job-pool2-517146b3-rd427
  volume: longhorn-testvol-plb3ix
status:
  attachmentTicketStatuses: {}

Then backup VA ticket is immediate set to have satisfied: true even though the volume is being detached with the spec.NodeID empty

======= Volume =======
spec:
  nodeID: ""
======= VA =========
spec:
  attachmentTickets:
    backup-controller-backup-d5dbab0da00b422c:
      nodeID: phan-v620-recurring-job-pool2-517146b3-rd427
  volume: longhorn-testvol-plb3ix
status:
  attachmentTicketStatuses:
    backup-controller-backup-d5dbab0da00b422c:
      satisfied: true

At this moment, backup controller starts to initiate the backup:

  773: 2024-02-22T03:14:11.675121943Z time="2024-02-22T03:14:11Z" level=info msg="Start creating backup backup-d5dbab0da00b422c" func=backup.DoBackupCreate file="main.go:136" pkg=backup

The backup later failed because the volume is in detaching process and it start killing engine process:

2024-02-22T03:14:11.759819409Z time="2024-02-22T03:14:11Z" level=info msg="Deleting instance longhorn-testvol-plb3ix-e-0" func="controller.(*InstanceHandler).deleteInstance" file="instance_handler.go:497"

This is the error:

  4417: 2024-02-22T03:14:12.399654154Z time="2024-02-22T03:14:12Z" level=warning msg="Failed to enable the backup monitor for backup backup-d5dbab0da00b422c" func="controller.(*BackupController).reconcile" file="backup_controller.go:384" backup=backup-d5dbab0da00b422c controller=longhorn-backup error="proxyServer=10.42.184.144:8501 destination=10.42.213.11:10010: failed to backup snapshot recurrin-dd970ef8-4431-49b6-9d6c-fabef7d86180 to backup-d5dbab0da00b422c: rpc error: code = Internal desc = failed to create backup: failed to create backup to s3://backupbucket@us-east-1/backupstore for volume longhorn-testvol-plb3ix: rpc error: code = Unavailable desc = error reading from server: EOF" node=phan-v620-recurring-job-pool2-517146b3-69z6d

Looks like I got different error on Longhorn v1.5.4-rc3: proxyServer=10.42.191.99:8501 destination=10.42.191.99:10010: failed to backup snapshot c-00cx94-c844d1f7-a38b-4151-8058-e77e9b2f422e to backup-dfeb5dad14aa4e6d: rpc error: code = Unknown desc = failed to create backup to for volume testvol: rpc error: code = Unknown desc = missing input parameter

Error logs from instance manager: 2024-02-21T23:04:13.127508386Z time="2024-02-21T23:04:13Z" level=error msg="Failed to initialize backup backup-55ef929c06664af9" func="rpc.(*SyncAgentServer).BackupCreate" file="server.go:714" error="missing input parameter"

We’ve also discussed this in the local meeting. Maybe we can check again after the fix.

On observation, the same issue can also be seen before the engine crash.

Test exit before engine crash:

        recurring_jobs = {
            RECURRING_JOB_NAME: {
                TASK: BACKUP,
                GROUPS: [],
                CRON: "*/2 * * * *",
                RETAIN: 1,
                CONCURRENCY: 1,
                LABELS: {},
            },
        }
        create_recurring_jobs(client, recurring_jobs)
        check_recurring_jobs(client, recurring_jobs)
    
        volume.recurringJobAdd(name=RECURRING_JOB_NAME, isGroup=False)
        wait_for_volume_recurring_job_update(volume,
                                             jobs=[RECURRING_JOB_NAME],
                                             groups=[DEFAULT])
    
        time.sleep(60)
>       wait_for_recurring_backup_to_start(client, core_api, volume_name,
                                           expected_snapshot_count=1,
                                           minimum_progress=50)

Analyse the log:

=== Created recurring job ===

[longhorn-manager-4hdqm] time="2024-02-21T06:12:00Z" level=info msg="Created recurring job recurring-test" func="manager.(*VolumeManager).CreateRecurringJob" file="recurringjob.go:52"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:12:00 +0000] "POST /v1/recurringjobs HTTP/1.1" 200 221 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:12:00 +0000] "GET /v1/recurringjobs/recurring-test HTTP/1.1" 200 221 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] time="2024-02-21T06:12:00Z" level=info msg="Recurring Job got new owner ip-10-0-2-62" func="controller.(*RecurringJobController).syncRecurringJob" file="recurring_job_controller.go:217" controller=longhorn-recurring-job node=ip-10-0-2-62 recurringJob=recurring-test
[longhorn-manager-4hdqm] time="2024-02-21T06:12:00Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Volume, namespace: longhorn-system, name: longhorn-testvol-llisw1, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]},{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhornvolume\":\"longhorn-testvol-llisw1\",\"recurring-job-group.longhorn.io/default\":\"enabled\",\"recurring-job.longhorn.io/recurring-test\":\"enabled\",\"setting.longhorn.io/remove-snapshots-during-filesystem-trim\":\"ignored\",\"setting.longhorn.io/replica-auto-balance\":\"ignored\",\"setting.longhorn.io/snapshot-data-integrity\":\"ignored\"}}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:12:00Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Volume longhorn-system/longhorn-testvol-llisw1 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-6rjkv] time="2024-02-21T06:12:00Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Volume longhorn-system/longhorn-testvol-llisw1 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:12:00Z" level=warning msg="Failed to reconcile recurring job longhorn-system/recurring-test" func="controller.(*RecurringJobController).syncRecurringJob" file="recurring_job_controller.go:242" controller=longhorn-recurring-job error="failed to create cron job: failed to create cron job: cronjobs.batch \"recurring-test\" already exists" node=ip-10-0-2-62 recurringJob=recurring-test
[longhorn-manager-6rjkv] time="2024-02-21T06:12:00Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36

=== Adding recurring job to volume ===

[longhorn-manager-4hdqm] time="2024-02-21T06:12:00Z" level=info msg="Added volume longhorn-testvol-llisw1 recurring job label recurring-job.longhorn.io/recurring-test" func="datastore.(*DataStore).AddRecurringJobLabelToVolume" file="longhorn.go:897"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:12:00 +0000] "POST /v1/volumes/longhorn-testvol-llisw1?action=recurringJobAdd HTTP/1.1" 200 492 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:12:00 +0000] "POST /v1/volumes/longhorn-testvol-llisw1?action=recurringJobList HTTP/1.1" 200 492 "" "python-requests/2.25.1"
[longhorn-manager-6rjkv] time="2024-02-21T06:12:06Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[longhorn-manager-nj822] time="2024-02-21T06:12:10Z" level=debug msg="Triggering sync backup target" func="controller.(*BackupStoreTimer).Start.func1" file="setting_controller.go:1061" component=backup-store-timer controller=longhorn-setting interval=30s node=ip-10-0-2-231
[csi-snapshotter-959b69d4b-8hqt8] W0221 06:12:15.309883       1 reflector.go:535] github.com/kubernetes-csi/external-snapshotter/client/v6/informers/externalversions/factory.go:133: failed to list *v1.VolumeSnapshotContent: the server could not find the requested resource (get volumesnapshotcontents.snapshot.storage.k8s.io)
[csi-snapshotter-959b69d4b-8hqt8] E0221 06:12:15.309934       1 reflector.go:147] github.com/kubernetes-csi/external-snapshotter/client/v6/informers/externalversions/factory.go:133: Failed to watch *v1.VolumeSnapshotContent: failed to list *v1.VolumeSnapshotContent: the server could not find the requested resource (get volumesnapshotcontents.snapshot.storage.k8s.io)
[csi-snapshotter-959b69d4b-8hqt8] W0221 06:12:15.735323       1 reflector.go:535] github.com/kubernetes-csi/external-snapshotter/client/v6/informers/externalversions/factory.go:133: failed to list *v1.VolumeSnapshotClass: the server could not find the requested resource (get volumesnapshotclasses.snapshot.storage.k8s.io)
[csi-snapshotter-959b69d4b-8hqt8] E0221 06:12:15.735372       1 reflector.go:147] github.com/kubernetes-csi/external-snapshotter/client/v6/informers/externalversions/factory.go:133: Failed to watch *v1.VolumeSnapshotClass: failed to list *v1.VolumeSnapshotClass: the server could not find the requested resource (get volumesnapshotclasses.snapshot.storage.k8s.io)
[longhorn-manager-6rjkv] time="2024-02-21T06:12:36Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[longhorn-manager-nj822] time="2024-02-21T06:12:40Z" level=debug msg="Triggering sync backup target" func="controller.(*BackupStoreTimer).Start.func1" file="setting_controller.go:1061" component=backup-store-timer controller=longhorn-setting interval=30s node=ip-10-0-2-231
[csi-snapshotter-959b69d4b-8hqt8] W0221 06:12:57.306683       1 reflector.go:535] github.com/kubernetes-csi/external-snapshotter/client/v6/informers/externalversions/factory.go:133: failed to list *v1.VolumeSnapshotClass: the server could not find the requested resource (get volumesnapshotclasses.snapshot.storage.k8s.io)
[csi-snapshotter-959b69d4b-8hqt8] E0221 06:12:57.306731       1 reflector.go:147] github.com/kubernetes-csi/external-snapshotter/client/v6/informers/externalversions/factory.go:133: Failed to watch *v1.VolumeSnapshotClass: failed to list *v1.VolumeSnapshotClass: the server could not find the requested resource (get volumesnapshotclasses.snapshot.storage.k8s.io)
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:00 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:01 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:02 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[csi-snapshotter-959b69d4b-8hqt8] W0221 06:13:03.656739       1 reflector.go:535] github.com/kubernetes-csi/external-snapshotter/client/v6/informers/externalversions/factory.go:133: failed to list *v1.VolumeSnapshotContent: the server could not find the requested resource (get volumesnapshotcontents.snapshot.storage.k8s.io)
[csi-snapshotter-959b69d4b-8hqt8] E0221 06:13:03.656783       1 reflector.go:147] github.com/kubernetes-csi/external-snapshotter/client/v6/informers/externalversions/factory.go:133: Failed to watch *v1.VolumeSnapshotContent: failed to list *v1.VolumeSnapshotContent: the server could not find the requested resource (get volumesnapshotcontents.snapshot.storage.k8s.io)
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:03 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:04 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:05 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-6rjkv] time="2024-02-21T06:13:06Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:06 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "pyt
hon-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:07 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "pyt
hon-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:08 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "pyt
hon-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:09 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "pyt
hon-requests/2.25.1"
[longhorn-manager-nj822] time="2024-02-21T06:13:10Z" level=debug msg="Triggering sync backup target" func="controller.(*BackupStoreTimer
).Start.func1" file="setting_controller.go:1061" component=backup-store-timer controller=longhorn-setting interval=30s node=ip-10-0-2-231
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:10 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:11 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:12 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:13 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:14 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:15 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:16 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:17 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:18 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:19 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:20 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:21 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:22 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:23 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:24 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:25 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:26 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:27 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:28 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:29 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:30 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:31 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:32 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:33 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:34 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:35 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-6rjkv] time="2024-02-21T06:13:36Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:36 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:37 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:38 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:39 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-nj822] time="2024-02-21T06:13:40Z" level=debug msg="Triggering sync backup target" func="controller.(*BackupStoreTimer).Start.func1" file="setting_controller.go:1061" component=backup-store-timer controller=longhorn-setting interval=30s node=ip-10-0-2-231
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:40 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:41 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:42 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:43 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:44 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:45 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:46 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[csi-snapshotter-959b69d4b-8hqt8] W0221 06:13:47.235493       1 reflector.go:535] github.com/kubernetes-csi/external-snapshotter/client/v6/informers/externalversions/factory.go:133: failed to list *v1.VolumeSnapshotClass: the server could not find the requested resource (get volumesnapshotclasses.snapshot.storage.k8s.io)
[csi-snapshotter-959b69d4b-8hqt8] E0221 06:13:47.235541       1 reflector.go:147] github.com/kubernetes-csi/external-snapshotter/client/v6/informers/externalversions/factory.go:133: Failed to watch *v1.VolumeSnapshotClass: failed to list *v1.VolumeSnapshotClass: the server could not find the requested resource (get volumesnapshotclasses.snapshot.storage.k8s.io)
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:47 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:48 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:49 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:50 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:51 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:52 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:53 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:54 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:55 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[csi-snapshotter-959b69d4b-8hqt8] W0221 06:13:56.175016       1 reflector.go:535] github.com/kubernetes-csi/external-snapshotter/client/v6/informers/externalversions/factory.go:133: failed to list *v1.VolumeSnapshotContent: the server could not find the requested resource (get volumesnapshotcontents.snapshot.storage.k8s.io)
[csi-snapshotter-959b69d4b-8hqt8] E0221 06:13:56.175070       1 reflector.go:147] github.com/kubernetes-csi/external-snapshotter/client/v6/informers/externalversions/factory.go:133: Failed to watch *v1.VolumeSnapshotContent: failed to list *v1.VolumeSnapshotContent: the server could not find the requested resource (get volumesnapshotcontents.snapshot.storage.k8s.io)
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:57 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:58 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:13:59 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:14:00 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7164 "" "python-requests/2.25.1"
[longhorn-manager-6rjkv] 10.42.2.90 - - [21/Feb/2024:06:14:00 +0000] "GET /v1 HTTP/1.1" 200 5467 "" "Go-http-client/1.1"
[longhorn-manager-nj822] 10.42.2.90 - - [21/Feb/2024:06:14:00 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7344 "" "Go-http-client/1.1"
[longhorn-manager-nj822] 10.42.2.90 - - [21/Feb/2024:06:14:00 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7344 "" "Go-http-client/1.1"
[longhorn-manager-nj822] 10.42.2.90 - - [21/Feb/2024:06:14:00 +0000] "POST /v1/volumes/longhorn-testvol-llisw1?action=snapshotCRList HTTP/1.1" 200 143 "" "Go-http-client/1.1"
[longhorn-manager-nj822] 10.42.2.90 - - [21/Feb/2024:06:14:00 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7344 "" "Go-http-client/1.1"
[longhorn-manager-nj822] 10.42.2.90 - - [21/Feb/2024:06:14:00 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7344 "" "Go-http-client/1.1"
[longhorn-manager-nj822] 10.42.2.90 - - [21/Feb/2024:06:14:00 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 7344 "" "Go-http-client/1.1"
[longhorn-manager-nj822] 10.42.2.90 - - [21/Feb/2024:06:14:00 +0000] "POST /v1/volumes/longhorn-testvol-llisw1?action=snapshotCRList HTTP/1.1" 200 143 "" "Go-http-client/1.1"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Snapshot, namespace: longhorn-system, name: recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a, operation: CREATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]},{\"op\": \"replace\", \"path\": \"/spec/dataEngine\", \"value\": \"v1\"},{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhornvolume\":\"longhorn-testvol-llisw1\"}},{\"op\": \"replace\", \"path\": \"/metadata/ownerReferences\", \"value\": [{\"apiVersion\":\"longhorn.io/v1beta2\",\"kind\":\"Volume\",\"name\":\"longhorn-testvol-llisw1\",\"uid\":\"e67523bf-18ea-458f-af6d-5a2160970c50\"}]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook

=== RecurringJob creating snapshot ===

[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=debug msg="admit result: CREATE longhorn.io/v1beta2, Kind=Snapshot longhorn-system/recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=debug msg="admit result: CREATE longhorn.io/v1beta2, Kind=Snapshot longhorn-system/recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"


=== SnapshotController updates volume attach ===

[longhorn-manager-6rjkv] time="2024-02-21T06:14:00Z" level=debug msg="[DEBUG] handleAttachmentTicketCreation/Update for snapshot recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a" func="controller.(*SnapshotController).reconcile" file="snapshot_controller.go:453"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:00Z" level=debug msg="[DEBUG] Updating volume attachment longhorn-testvol-llisw1" func="datastore.(*DataStore).UpdateLHVolumeAttachment" file="longhorn.go:4802" controller=ip-10-0-2-36
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=VolumeAttachment, namespace: longhorn-system, name: longhorn-testvol-llisw1, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=VolumeAttachment longhorn-system/longhorn-testvol-llisw1 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=VolumeAttachment longhorn-system/longhorn-testvol-llisw1 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"

=== VolumeAttachementController attach volume to node ===

[longhorn-manager-6rjkv] time="2024-02-21T06:14:00Z" level=info msg="Volume longhorn-testvol-llisw1 is selected to attach to node ip-10-0-2-36, ticket +&{snapshot-controller-recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a snapshot-controller ip-10-0-2-36 map[disableFrontend:any] 0}" func="controller.(*VolumeAttachmentController).handleVolumeAttachment" file="volume_attachment_controller.go:653" controller=longhorn-volume-attachment longhornVolumeAttachment=longhorn-testvol-llisw1 node=ip-10-0-2-36
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Volume, namespace: longhorn-system, name: longhorn-testvol-llisw1, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]},{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhornvolume\":\"longhorn-testvol-llisw1\",\"recurring-job-group.longhorn.io/default\":\"enabled\",\"recurring-job.longhorn.io/recurring-test\":\"enabled\",\"setting.longhorn.io/remove-snapshots-during-filesystem-trim\":\"ignored\",\"setting.longhorn.io/replica-auto-balance\":\"ignored\",\"setting.longhorn.io/snapshot-data-integrity\":\"ignored\"}}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Volume longhorn-system/longhorn-testvol-llisw1 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Volume longhorn-system/longhorn-testvol-llisw1 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:00Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36

=== VolumeAttachementController updates volume attachement status ===

[longhorn-manager-6rjkv] time="2024-02-21T06:14:00Z" level=debug msg="[DEBUG] Updating volume attachment longhorn-testvol-llisw1 status" func="datastore.(*DataStore).UpdateLHVolumeAttachmentStatus" file="longhorn.go:4815" controller=volume-attachment-controller
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Replica, namespace: longhorn-system, name: longhorn-testvol-llisw1-r-67c2043c, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhorn.io/backing-image\":\"\",\"longhorndiskuuid\":\"4ea76150-406f-424a-b1e0-c6f499a7d608\",\"longhornnode\":\"ip-10-0-2-36\",\"longhornvolume\":\"longhorn-testvol-llisw1\"}},{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook

=== Controllers attaching volume ===

[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-67c2043c user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-67c2043c user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Replica, namespace: longhorn-system, name: longhorn-testvol-llisw1-r-2c35a938, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhorn.io/backing-image\":\"\",\"longhorndiskuuid\":\"d24b38fa-0608-425b-b08f-0c6d921952c4\",\"longhornnode\":\"ip-10-0-2-62\",\"longhornvolume\":\"longhorn-testvol-llisw1\"}},{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-6rjkv] time="2024-02-21T06:14:00Z" level=info msg="Creating instance longhorn-testvol-llisw1-r-67c2043c" func="controller.(*InstanceHandler).createInstance" file="instance_handler.go:481"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=info msg="Creating instance" func="instance.(*Server).InstanceCreate" file="instance.go:113" dataEngine=DATA_ENGINE_V1 name=longhorn-testvol-llisw1-r-67c2043c type=replica
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-2c35a938 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=info msg="Creating process" func="client.(*ProcessManagerClient).ProcessCreate" file="process_manager.go:96" args="[--volume-name longhorn-testvol-llisw1 replica /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-64b5a279 --size 2147483648 --replica-instance-name longhorn-testvol-llisw1-r-67c2043c --snapshot-max-count 250 --snapshot-max-size 0 --sync-agent-port-count 7]" binary=/host/var/lib/longhorn/engine-binaries/longhornio-longhorn-engine-master-head/longhorn name=longhorn-testvol-llisw1-r-67c2043c portArgs="[--listen,0.0.0.0:]" portCount=10
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-2c35a938 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=info msg="Process Manager: prepare to create process longhorn-testvol-llisw1-r-67c2043c" func="process.(*Manager).ProcessCreate" file="process_manager.go:222"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:00Z" level=info msg="Event(v1.ObjectReference{Kind:\"Replica\", Namespace:\"longhorn-system\", Name:\"longhorn-testvol-llisw1-r-67c2043c\", UID:\"3495b15f-d79c-49b1-9133-ae6ecaf21e6a\", APIVersion:\"longhorn.io/v1beta2\", ResourceVersion:\"46303\", FieldPath:\"\"}): type: 'Normal' reason: 'Start' Starts longhorn-testvol-llisw1-r-67c2043c" func="record.(*eventBroadcasterImpl).StartLogging.func1" file="event.go:298"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=debug msg="Process Manager: validate process path: /host/var/lib/longhorn/engine-binaries/longhornio-longhorn-engine-master-head/longhorn dir: /host/var/lib/longhorn/engine-binaries/ image: longhornio-longhorn-engine-master-head binary: longhorn" func=process.ensureValidProcessPath file="process_manager.go:203"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=info msg="Process Manager: created process longhorn-testvol-llisw1-r-67c2043c" func="process.(*Manager).ProcessCreate" file="process_manager.go:264"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-r-67c2043c] warning: GOCOVERDIR not set, no coverage data emitted
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-r-67c2043c] time="2024-02-21T06:14:00Z" level=info msg="Listening on data server 0.0.0.0:10001" func=cmd.startReplica.func2 file="replica.go:164"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:00Z" level=info msg="Listening on sync agent server 0.0.0.0:10002" func=cmd.startReplica.func3 file="replica.go:193"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=info msg="Creating instance longhorn-testvol-llisw1-r-2c35a938" func="controller.(*InstanceHandler).createInstance" file="instance_handler.go:481"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-r-67c2043c] time="2024-02-21T06:14:00Z" level=info msg="Listening on gRPC Replica server 0.0.0.0:10000" func=cmd.startReplica.func1 file="replica.go:156"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Replica, namespace: longhorn-system, name: longhorn-testvol-llisw1-r-b1867404, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhorn.io/backing-image\":\"\",\"longhorndiskuuid\":\"f48bf0ec-2725-4cc8-802e-c502c53c8d61\",\"longhornnode\":\"ip-10-0-2-231\",\"longhornvolume\":\"longhorn-testvol-llisw1\"}},{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=info msg="Creating instance" func="instance.(*Server).InstanceCreate" file="instance.go:113" dataEngine=DATA_ENGINE_V1 name=longhorn-testvol-llisw1-r-2c35a938 type=replica
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-b1867404 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=info msg="Creating process" func="client.(*ProcessManagerClient).ProcessCreate" file="process_manager.go:96" args="[--volume-name longhorn-testvol-llisw1 replica /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-b60ff363 --size 2147483648 --replica-instance-name longhorn-testvol-llisw1-r-2c35a938 --snapshot-max-count 250 --snapshot-max-size 0 --sync-agent-port-count 7]" binary=/host/var/lib/longhorn/engine-binaries/longhornio-longhorn-engine-master-head/longhorn name=longhorn-testvol-llisw1-r-2c35a938 portArgs="[--listen,0.0.0.0:]" portCount=10
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=info msg="Event(v1.ObjectReference{Kind:\"Replica\", Namespace:\"longhorn-system\", Name:\"longhorn-testvol-llisw1-r-2c35a938\", UID:\"b9ce97aa-0d23-49fa-83ac-5217cb6c205f\", APIVersion:\"longhorn.io/v1beta2\", ResourceVersion:\"46304\", FieldPath:\"\"}): type: 'Normal' reason: 'Start' Starts longhorn-testvol-llisw1-r-2c35a938" func="record.(*eventBroadcasterImpl).StartLogging.func1" file="event.go:298"
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=info msg="Process Manager: prepare to create process longhorn-testvol-llisw1-r-2c35a938" func="process.(*Manager).ProcessCreate" file="process_manager.go:222"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-b1867404 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=debug msg="Process Manager: validate process path: /host/var/lib/longhorn/engine-binaries/longhornio-longhorn-engine-master-head/longhorn dir: /host/var/lib/longhorn/engine-binaries/ image: longhornio-longhorn-engine-master-head binary: longhorn" func=process.ensureValidProcessPath file="process_manager.go:203"
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=info msg="Process Manager: created process longhorn-testvol-llisw1-r-2c35a938" func="process.(*Manager).ProcessCreate" file="process_manager.go:264"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-r-67c2043c] warning: GOCOVERDIR not set, no coverage data emitted
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] warning: GOCOVERDIR not set, no coverage data emitted
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-r-67c2043c] time="2024-02-21T06:14:00Z" level=info msg="Listening on sync 0.0.0.0:10002" func=cmd.startSyncAgent file="sync_agent.go:89"
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] time="2024-02-21T06:14:00Z" level=info msg="Listening on sync agent server 0.0.0.0:10002" func=cmd.startReplica.func3 file="replica.go:193"
[longhorn-manager-nj822] time="2024-02-21T06:14:00Z" level=info msg="Creating instance longhorn-testvol-llisw1-r-b1867404" func="controller.(*InstanceHandler).createInstance" file="instance_handler.go:481"
[instance-manager-16d650f38cc11d9112b544b831ce8876] time="2024-02-21T06:14:00Z" level=info msg="Listening on data server 0.0.0.0:10001" func=cmd.startReplica.func2 file="replica.go:164"
[instance-manager-16d650f38cc11d9112b544b831ce8876] time="2024-02-21T06:14:00Z" level=info msg="Listening on gRPC Replica server 0.0.0.0:10000" func=cmd.startReplica.func1 file="replica.go:156"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=info msg="Creating instance" func="instance.(*Server).InstanceCreate" file="instance.go:113" dataEngine=DATA_ENGINE_V1 name=longhorn-testvol-llisw1-r-b1867404 type=replica
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=info msg="Creating PDB csi-attacher" func="controller.(*KubernetesPDBController).reconcile" file="kubernetes_pdb_controller.go:177" controller=kubernetes-pdb node=ip-10-0-2-62
[longhorn-manager-nj822] time="2024-02-21T06:14:00Z" level=info msg="Event(v1.ObjectReference{Kind:\"Replica\", Namespace:\"longhorn-system\", Name:\"longhorn-testvol-llisw1-r-b1867404\", UID:\"f2b627d9-d3f1-4a76-a27f-31297b61d394\", APIVersion:\"longhorn.io/v1beta2\", ResourceVersion:\"46306\", FieldPath:\"\"}): type: 'Normal' reason: 'Start' Starts longhorn-testvol-llisw1-r-b1867404" func="record.(*eventBroadcasterImpl).StartLogging.func1" file="event.go:298"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:00Z" level=info msg="Creating PDB csi-provisioner" func="controller.(*KubernetesPDBController).reconcile" file="kubernetes_pdb_controller.go:177" controller=kubernetes-pdb node=ip-10-0-2-62
[longhorn-manager-6rjkv] time="2024-02-21T06:14:00Z" level=info msg="Creating PDB csi-attacher" func="controller.(*KubernetesPDBController).reconcile" file="kubernetes_pdb_controller.go:177" controller=kubernetes-pdb node=ip-10-0-2-36
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] warning: GOCOVERDIR not set, no coverage data emitted
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] time="2024-02-21T06:14:00Z" level=info msg="Listening on sync 0.0.0.0:10002" func=cmd.startSyncAgent file="sync_agent.go:89"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=info msg="Creating process" func="client.(*ProcessManagerClient).ProcessCreate" file="process_manager.go:96" args="[--volume-name longhorn-testvol-llisw1 replica /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-9d006473 --size 2147483648 --replica-instance-name longhorn-testvol-llisw1-r-b1867404 --snapshot-max-count 250 --snapshot-max-size 0 --sync-agent-port-count 7]" binary=/host/var/lib/longhorn/engine-binaries/longhornio-longhorn-engine-master-head/longhorn name=longhorn-testvol-llisw1-r-b1867404 portArgs="[--listen,0.0.0.0:]" portCount=10
[longhorn-manager-6rjkv] time="2024-02-21T06:14:00Z" level=info msg="Creating PDB csi-provisioner" func="controller.(*KubernetesPDBController).reconcile" file="kubernetes_pdb_controller.go:177" controller=kubernetes-pdb node=ip-10-0-2-36
[longhorn-manager-nj822] time="2024-02-21T06:14:00Z" level=info msg="Creating PDB csi-attacher" func="controller.(*KubernetesPDBController).reconcile" file="kubernetes_pdb_controller.go:177" controller=kubernetes-pdb node=ip-10-0-2-231
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=info msg="Process Manager: prepare to create process longhorn-testvol-llisw1-r-b1867404" func="process.(*Manager).ProcessCreate" file="process_manager.go:222"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:00Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[longhorn-manager-nj822] time="2024-02-21T06:14:00Z" level=info msg="Creating PDB csi-provisioner" func="controller.(*KubernetesPDBController).reconcile" file="kubernetes_pdb_controller.go:177" controller=kubernetes-pdb node=ip-10-0-2-231
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=debug msg="Process Manager: validate process path: /host/var/lib/longhorn/engine-binaries/longhornio-longhorn-engine-master-head/longhorn dir: /host/var/lib/longhorn/engine-binaries/ image: longhornio-longhorn-engine-master-head binary: longhorn" func=process.ensureValidProcessPath file="process_manager.go:203"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=info msg="Process Manager: created process longhorn-testvol-llisw1-r-b1867404" func="process.(*Manager).ProcessCreate" file="process_manager.go:264"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-testvol-llisw1-r-b1867404] warning: GOCOVERDIR not set, no coverage data emitted
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-testvol-llisw1-r-b1867404] time="2024-02-21T06:14:00Z" level=info msg="Listening on gRPC Replica server 0.0.0.0:10000" func=cmd.startReplica.func1 file="replica.go:156"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] time="2024-02-21T06:14:00Z" level=info msg="Listening on sync agent server 0.0.0.0:10002" func=cmd.startReplica.func3 file="replica.go:193"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-testvol-llisw1-r-b1867404] time="2024-02-21T06:14:00Z" level=info msg="Listening on data server 0.0.0.0:10001" func=cmd.startReplica.func2 file="replica.go:164"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-testvol-llisw1-r-b1867404] warning: GOCOVERDIR not set, no coverage data emitted

=== VolumeManager finished createing snapshot CR ===

[longhorn-manager-nj822] time="2024-02-21T06:14:00Z" level=info msg="Created snapshot CR recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a with labels map[RecurringJob:recurring-test] for volume longhorn-testvol-llisw1" func="manager.(*VolumeManager).CreateSnapshotCR" file="snapshot.go:55"
[longhorn-manager-nj822] 10.42.2.90 - - [21/Feb/2024:06:14:00 +0000] "POST /v1/volumes/longhorn-testvol-llisw1?action=snapshotCRCreate HTTP/1.1" 200 549 "" "Go-http-client/1.1"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-testvol-llisw1-r-b1867404] time="2024-02-21T06:14:00Z" level=info msg="Listening on sync 0.0.0.0:10002" func=cmd.startSyncAgent file="sync_agent.go:89"
[longhorn-manager-nj822] 10.42.2.90 - - [21/Feb/2024:06:14:00 +0000] "POST /v1/volumes/longhorn-testvol-llisw1?action=snapshotCRGet HTTP/1.1" 200 549 "" "Go-http-client/1.1"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:00Z" level=info msg="Process longhorn-testvol-llisw1-r-67c2043c has started at localhost:10000" func="process.(*GRPCHealthChecker).WaitForRunning" file="healthchecker.go:33"
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-instance-manager] time="2024-02-21T06:14:01Z" level=info msg="Process longhorn-testvol-llisw1-r-2c35a938 has started at localhost:10000" func="process.(*GRPCHealthChecker).WaitForRunning" file="healthchecker.go:33"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-instance-manager] time="2024-02-21T06:14:01Z" level=info msg="Process longhorn-testvol-llisw1-r-b1867404 has started at localhost:10000" func="process.(*GRPCHealthChecker).WaitForRunning" file="healthchecker.go:33"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:14:01 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 5560 "" "python-requests/2.25.1"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:01Z" level=debug msg="[DEBUG] handleAttachmentTicketCreation/Update for snapshot recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a" func="controller.(*SnapshotController).reconcile" file="snapshot_controller.go:453"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:01Z" level=info msg="Event(v1.ObjectReference{Kind:\"Snapshot\", Namespace:\"longhorn-system\", Name:\"recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a\", UID:\"7a69c4bc-1baa-45f7-a8a1-cdac3f95e585\", APIVersion:\"longhorn.io/v1beta2\", ResourceVersion:\"46314\", FieldPath:\"\"}): type: 'Warning' reason: 'SnapshotError' failed to take snapshot because the volume engine longhorn-testvol-llisw1-e-0 is not running. Waiting for the volume to be attached" func="record.(*eventBroadcasterImpl).StartLogging.func1" file="event.go:298"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:01Z" level=warning msg="Instance longhorn-testvol-llisw1-r-2c35a938 starts running, Storage IP 10.42.3.6" func="controller.(*InstanceHandler).syncStatusWithInstanceManager" file="instance_handler.go:164"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:01Z" level=warning msg="Instance longhorn-testvol-llisw1-r-2c35a938 starts running, IP 10.42.3.6" func="controller.(*InstanceHandler).syncStatusWithInstanceManager" file="instance_handler.go:169"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:01Z" level=warning msg="Instance longhorn-testvol-llisw1-r-2c35a938 starts running, Port 10000" func="controller.(*InstanceHandler).syncStatusWithInstanceManager" file="instance_handler.go:173"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:01Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[longhorn-manager-nj822] time="2024-02-21T06:14:01Z" level=warning msg="Instance longhorn-testvol-llisw1-r-b1867404 starts running, Storage IP 10.42.1.10" func="controller.(*InstanceHandler).syncStatusWithInstanceManager" file="instance_handler.go:164"
[longhorn-manager-nj822] time="2024-02-21T06:14:01Z" level=warning msg="Instance longhorn-testvol-llisw1-r-b1867404 starts running, IP 10.42.1.10" func="controller.(*InstanceHandler).syncStatusWithInstanceManager" file="instance_handler.go:169"
[longhorn-manager-nj822] time="2024-02-21T06:14:01Z" level=warning msg="Instance longhorn-testvol-llisw1-r-b1867404 starts running, Port 10000" func="controller.(*InstanceHandler).syncStatusWithInstanceManager" file="instance_handler.go:173"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:01Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[longhorn-manager-6rjkv] time="2024-02-21T06:14:01Z" level=warning msg="Instance longhorn-testvol-llisw1-r-67c2043c starts running, Storage IP 10.42.2.7" func="controller.(*InstanceHandler).syncStatusWithInstanceManager" file="instance_handler.go:164"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:01Z" level=warning msg="Instance longhorn-testvol-llisw1-r-67c2043c starts running, IP 10.42.2.7" func="controller.(*InstanceHandler).syncStatusWithInstanceManager" file="instance_handler.go:169"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:01Z" level=warning msg="Instance longhorn-testvol-llisw1-r-67c2043c starts running, Port 10000" func="controller.(*InstanceHandler).syncStatusWithInstanceManager" file="instance_handler.go:173"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:01Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[longhorn-manager-4hdqm] time="2024-02-21T06:14:01Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Engine, namespace: longhorn-system, name: longhorn-testvol-llisw1-e-0, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhornnode\":\"ip-10-0-2-36\",\"longhornvolume\":\"longhorn-testvol-llisw1\"}},{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:14:01Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Engine longhorn-system/longhorn-testvol-llisw1-e-0 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:01Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Engine longhorn-system/longhorn-testvol-llisw1-e-0 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:01Z" level=info msg="Creating instance longhorn-testvol-llisw1-e-0" func="controller.(*InstanceHandler).createInstance" file="instance_handler.go:481"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:01Z" level=info msg="Creating instance" func="instance.(*Server).InstanceCreate" file="instance.go:113" dataEngine=DATA_ENGINE_V1 name=longhorn-testvol-llisw1-e-0 type=engine
[longhorn-manager-6rjkv] time="2024-02-21T06:14:01Z" level=info msg="Event(v1.ObjectReference{Kind:\"Engine\", Namespace:\"longhorn-system\", Name:\"longhorn-testvol-llisw1-e-0\", UID:\"ba0eb73e-97c5-411a-a239-28373e87edf5\", APIVersion:\"longhorn.io/v1beta2\", ResourceVersion:\"46325\", FieldPath:\"\"}): type: 'Normal' reason: 'Start' Starts longhorn-testvol-llisw1-e-0" func="record.(*eventBroadcasterImpl).StartLogging.func1" file="event.go:298"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:01Z" level=info msg="Creating process" func="client.(*ProcessManagerClient).ProcessCreate" file="process_manager.go:96" args="[--engine-instance-name longhorn-testvol-llisw1-e-0 controller longhorn-testvol-llisw1 --frontend tgt-blockdev --size 2147483648 --current-size 2147483648 --engine-replica-timeout 8 --file-sync-http-client-timeout 30 --snapshot-max-count 250 --snapshot-max-size 0 --replica tcp://10.42.3.6:10000 --replica tcp://10.42.2.7:10000 --replica tcp://10.42.1.10:10000]" binary=/engine-binaries/longhornio-longhorn-engine-master-head/longhorn name=longhorn-testvol-llisw1-e-0 portArgs="[--listen,0.0.0.0:]" portCount=1
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:01Z" level=info msg="Process Manager: prepare to create process longhorn-testvol-llisw1-e-0" func="process.(*Manager).ProcessCreate" file="process_manager.go:222"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:01Z" level=debug msg="Process Manager: validate process path: /engine-binaries/longhornio-longhorn-engine-master-head/longhorn dir: /engine-binaries/ image: longhornio-longhorn-engine-master-head binary: longhorn" func=process.ensureValidProcessPath file="process_manager.go:203"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:01Z" level=info msg="Process Manager: created process longhorn-testvol-llisw1-e-0" func="process.(*Manager).ProcessCreate" file="process_manager.go:264"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] warning: GOCOVERDIR not set, no coverage data emitted
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="Creating volume longhorn-testvol-llisw1 controller with iSCSI target request timeout 15s and engine to replica(s) timeout 8s" func=cmd.startController file="controller.go:181"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:01Z" level=info msg="Starting with replicas [\"tcp://10.42.3.6:10000\" \"tcp://10.42.2.7:10000\" \"tcp://10.42.1.10:10000\"]" func=cmd.startController file="controller.go:196"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:01Z" level=info msg="Connecting to remote: 10.42.3.6:10000 (tcp)" func="remote.(*Factory).Create" file="remote.go:382"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="Opening remote: 10.42.3.6:10000" func="remote.(*Remote).open" file="remote.go:68"
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] time="2024-02-21T06:14:01Z" level=info msg="New connection from: 10.42.2.7:34920" func="rpc.(*DataServer).listenAndServeTCP" file="dataserver.go:57"
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] time="2024-02-21T06:14:01Z" level=info msg="Opening replica: dir /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-b60ff363, size 2147483648, sector size 512" func="replica.(*Server).Open" file="server.go:76"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="Connecting to remote: 10.42.2.7:10000 (tcp)" func="remote.(*Factory).Create" file="remote.go:382"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="Opening remote: 10.42.2.7:10000" func="remote.(*Remote).open" file="remote.go:68"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-r-67c2043c] time="2024-02-21T06:14:01Z" level=info msg="New connection from: 10.42.2.7:36328" func="rpc.(*DataServer).listenAndServeTCP" file="dataserver.go:57"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-r-67c2043c] time="2024-02-21T06:14:01Z" level=info msg="Opening replica: dir /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-64b5a279, size 2147483648, sector size 512" func="replica.(*Server).Open" file="server.go:76"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="Connecting to remote: 10.42.1.10:10000 (tcp)" func="remote.(*Factory).Create" file="remote.go:382"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="Opening remote: 10.42.1.10:10000" func="remote.(*Remote).open" file="remote.go:68"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-testvol-llisw1-r-b1867404] time="2024-02-21T06:14:01Z" level=info msg="New connection from: 10.42.2.7:44920" func="rpc.(*DataServer).listenAndServeTCP" file="dataserver.go:57"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-testvol-llisw1-r-b1867404] time="2024-02-21T06:14:01Z" level=info msg="Opening replica: dir /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-9d006473, size 2147483648, sector size 512" func="replica.(*Server).Open" file="server.go:76"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="Adding backend: tcp://10.42.3.6:10000" func="controller.(*replicator).AddBackend" file="replicator.go:58"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="Adding backend: tcp://10.42.2.7:10000" func="controller.(*replicator).AddBackend" file="replicator.go:58"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:01Z" level=info msg="Adding backend: tcp://10.42.1.10:10000" func="controller.(*replicator).AddBackend" file="replicator.go:58"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:01Z" level=info msg="Start monitoring tcp://10.42.2.7:10000" func="controller.(*Controller).monitoring" file="control.go:1208"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:01Z" level=info msg="Start monitoring tcp://10.42.3.6:10000" func="controller.(*Controller).monitoring" file="control.go:1208"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:01Z" level=info msg="Start monitoring tcp://10.42.1.10:10000" func="controller.(*Controller).monitoring" file="control.go:1208"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="Got backend tcp://10.42.3.6:10000 UnmapMarkSnapChainRemoved false" func="controller.(*replicator).GetUnmapMarkSnapChainRemoved" file="replicator.go:467"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="Got backend tcp://10.42.2.7:10000 UnmapMarkSnapChainRemoved false" func="controller.(*replicator).GetUnmapMarkSnapChainRemoved" file="replicator.go:467"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="Got backend tcp://10.42.1.10:10000 UnmapMarkSnapChainRemoved false" func="controller.(*replicator).GetUnmapMarkSnapChainRemoved" file="replicator.go:467"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:01Z" level=info msg="Controller checked and corrected flag unmapMarkSnapChainRemoved=false for backend replicas" func="controller.(*Controller).checkUnmapMarkSnapChainRemoved" file="control.go:685"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:01Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="Got backend tcp://10.42.3.6:10000 revision counter 1019" func="controller.(*replicator).GetRevisionCounter" file="replicator.go:437"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="Got backend tcp://10.42.2.7:10000 revision counter 1019" func="controller.(*replicator).GetRevisionCounter" file="replicator.go:437"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="Got backend tcp://10.42.1.10:10000 revision counter 1019" func="controller.(*replicator).GetRevisionCounter" file="replicator.go:437"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:01Z" level=info msg="device longhorn-testvol-llisw1: iSCSI device /dev/longhorn/longhorn-testvol-llisw1 shutdown" func="longhorndev.(*LonghornDevice).shutdownFrontend" file="dev.go:209"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:14:02 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 5858 "" "python-requests/2.25.1"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:02Z" level=debug msg="[DEBUG] handleAttachmentTicketCreation/Update for snapshot recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a" func="controller.(*SnapshotController).reconcile" file="snapshot_controller.go:453"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:02Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:02Z" level=info msg="Wait for gRPC service of process longhorn-testvol-llisw1-e-0 to start at localhost:10010" func="process.(*GRPCHealthChecker).WaitForRunning" file="healthchecker.go:36"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] go-iscsi-helper: tgtd is already running
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:02Z" level=info msg="go-iscsi-helper: found available target id 1" func="iscsidev.(*Device).CreateTarget" file="iscsi.go:105"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] tgtd: device_mgmt(246) sz:110 params:path=/var/run/longhorn-longhorn-testvol-llisw1.sock,bstype=longhorn,bsopts=size=2147483648;request_timeout=15
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] tgtd: bs_thread_open(409) 16
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:02Z" level=info msg="New data socket connection established" func="socket.(*Socket).handleServerConnection" file="frontend.go:133"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:03Z" level=info msg="default: automatically rescan all LUNs of all iscsi sessions" func=iscsi.LoginTarget file="initiator.go:133"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:14:03 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 5907 "" "python-requests/2.25.1"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:03Z" level=info msg="Wait for gRPC service of process longhorn-testvol-llisw1-e-0 to start at localhost:10010" func="process.(*GRPCHealthChecker).WaitForRunning" file="healthchecker.go:36"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:14:04 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 5907 "" "python-requests/2.25.1"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:04Z" level=info msg="Creating device /dev/longhorn/longhorn-testvol-llisw1 8:0" func=util.mknod file="util.go:95"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:04Z" level=info msg="device longhorn-testvol-llisw1: iSCSI device sda created" func="longhorndev.(*LonghornDevice).startScsiDevice" file="dev.go:138"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:04Z" level=info msg="Listening on gRPC Controller server: 0.0.0.0:10010" func="controller.(*Controller).StartGRPCServer.func1" file="control.go:126"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:04Z" level=info msg="Wait for gRPC service of process longhorn-testvol-llisw1-e-0 to start at localhost:10010" func="process.(*GRPCHealthChecker).WaitForRunning" file="healthchecker.go:36"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:04Z" level=info msg="Process longhorn-testvol-llisw1-e-0 has started at localhost:10010" func="process.(*GRPCHealthChecker).WaitForRunning" file="healthchecker.go:33"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:14:05 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 5907 "" "python-requests/2.25.1"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:05Z" level=warning msg="Instance longhorn-testvol-llisw1-e-0 starts running, Storage IP 10.42.2.7" func="controller.(*InstanceHandler).syncStatusWithInstanceManager" file="instance_handler.go:164"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:05Z" level=warning msg="Instance longhorn-testvol-llisw1-e-0 starts running, IP 10.42.2.7" func="controller.(*InstanceHandler).syncStatusWithInstanceManager" file="instance_handler.go:169"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:05Z" level=warning msg="Instance longhorn-testvol-llisw1-e-0 starts running, Port 10010" func="controller.(*InstanceHandler).syncStatusWithInstanceManager" file="instance_handler.go:173"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:05Z" level=info msg="Starting monitoring engine" func="controller.(*EngineMonitor).Run" file="engine_controller.go:790" controller=longhorn-engine engine=longhorn-testvol-llisw1-e-0 node=ip-10-0-2-36
[longhorn-manager-6rjkv] time="2024-02-21T06:14:05Z" level=debug msg="[DEBUG] handleAttachmentTicketCreation/Update for snapshot recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a" func="controller.(*SnapshotController).reconcile" file="snapshot_controller.go:453"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:05Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[longhorn-manager-6rjkv] time="2024-02-21T06:14:05Z" level=info msg="Event(v1.ObjectReference{Kind:\"Volume\", Namespace:\"longhorn-system\", Name:\"longhorn-testvol-llisw1\", UID:\"e67523bf-18ea-458f-af6d-5a2160970c50\", APIVersion:\"longhorn.io/v1beta2\", ResourceVersion:\"46308\", FieldPath:\"\"}): type: 'Normal' reason: 'Attached' volume longhorn-testvol-llisw1 has been attached to ip-10-0-2-36" func="record.(*eventBroadcasterImpl).StartLogging.func1" file="event.go:298"

=== VolumeAttachementController updates volume attachment status ===

[longhorn-manager-6rjkv] time="2024-02-21T06:14:05Z" level=debug msg="[DEBUG] Updating volume attachment longhorn-testvol-llisw1 status" func="datastore.(*DataStore).UpdateLHVolumeAttachmentStatus" file="longhorn.go:4815" controller=volume-attachment-controller

=== SnapshotController creates snapshot ===

[longhorn-manager-6rjkv] time="2024-02-21T06:14:05Z" level=info msg="Creating snapshot recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a of volume longhorn-testvol-llisw1" func="controller.(*SnapshotController).handleSnapshotCreate" file="snapshot_controller.go:646" controller=longhorn-snapshot node=ip-10-0-2-36
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:05Z" level=info msg="Snapshotting volume: snapshot recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a" func="proxy.(*Proxy).VolumeSnapshot" file="snapshot.go:28" dataEngine=DATA_ENGINE_V1 engineName=longhorn-testvol-llisw1-e-0 serviceURL="10.42.2.7:10010" volumeName=longhorn-testvol-llisw1
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:05Z" level=info msg="Starting snapshot" func="controller.(*Controller).Snapshot" file="control.go:193" snapshot=recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a volume=longhorn-testvol-llisw1
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:05Z" level=info msg="Requesting system sync before snapshot" func="controller.(*Controller).Snapshot" file="control.go:195" snapshot=recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a volume=longhorn-testvol-llisw1
[longhorn-manager-6rjkv] time="2024-02-21T06:14:05Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[longhorn-manager-nj822] 10.42.2.90 - - [21/Feb/2024:06:14:05 +0000] "POST /v1/volumes/longhorn-testvol-llisw1?action=snapshotCRGet HTTP/1.1" 200 691 "" "Go-http-client/1.1"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:06Z" level=info msg="Starting to snapshot: 10.42.1.10:10000 recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a UserCreated true Created at 2024-02-21T06:14:06Z, Labels map[RecurringJob:recurring-test]" func="remote.(*Remote).Snapshot" file="remote.go:88"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-testvol-llisw1-r-b1867404] time="2024-02-21T06:14:06Z" level=info msg="Replica server starts to snapshot [recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a] volume, user created true, created time 2024-02-21T06:14:06Z, labels map[RecurringJob:recurring-test]" func="replica.(*Server).Snapshot" file="server.go:185"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:06Z" level=info msg="Starting to snapshot: 10.42.3.6:10000 recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a UserCreated true Created at 2024-02-21T06:14:06Z, Labels map[RecurringJob:recurring-test]" func="remote.(*Remote).Snapshot" file="remote.go:88"
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] time="2024-02-21T06:14:06Z" level=info msg="Replica server starts to snapshot [recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a] volume, user created true, created time 2024-02-21T06:14:06Z, labels map[RecurringJob:recurring-test]" func="replica.(*Server).Snapshot" file="server.go:185"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] time="2024-02-21T06:14:06Z" level=info msg="Starting to create disk" func="replica.(*Replica).createDisk" file="replica.go:889" disk=recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:06Z" level=info msg="Starting to snapshot: 10.42.2.7:10000 recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a UserCreated true Created at 2024-02-21T06:14:06Z, Labels map[RecurringJob:recurring-test]" func="remote.(*Remote).Snapshot" file="remote.go:88"
[instance-manager-16d650f38cc11d9112b544b831ce8876] time="2024-02-21T06:14:06Z" level=info msg="Starting to create disk" func="replica.(*Replica).createDisk" file="replica.go:889" disk=recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-testvol-llisw1-r-b1867404] time="2024-02-21T06:14:06Z" level=info msg="Cleaning up new disk metadata file path /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-9d006473/volume-snap-recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a.img.meta before linking" func="replica.(*Replica).linkDisk" file="replica.go:774"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] time="2024-02-21T06:14:06Z" level=info msg="Cleaning up new disk checksum file /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-9d006473/volume-snap-recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a.img.checksum before linking" func="replica.(*Replica).linkDisk" file="replica.go:780"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] time="2024-02-21T06:14:06Z" level=info msg="Cleaning up new disk file /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-9d006473/volume-snap-recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a.img before linking" func="replica.(*Replica).linkDisk" file="replica.go:786"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] time="2024-02-21T06:14:06Z" level=info msg="Finished creating disk" func="replica.(*Replica).createDisk" file="replica.go:987" disk=recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] time="2024-02-21T06:14:06Z" level=info msg="Cleaning up new disk metadata file path /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-b60ff363/volume-snap-recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a.img.meta before linking" func="replica.(*Replica).linkDisk" file="replica.go:774"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] time="2024-02-21T06:14:06Z" level=info msg="Removing disk volume-head-000.img" func="replica.(*Replica).rmDisk" file="replica.go:829"

=== Finished snapshot ===

[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:06Z" level=info msg="Finished to snapshot: 10.42.2.7:10000 recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a UserCreated true Created at 2024-02-21T06:14:06Z, Labels map[RecurringJob:recurring-test]" func="remote.(*Remote).Snapshot" file="remote.go:109"
[instance-manager-16d650f38cc11d9112b544b831ce8876] time="2024-02-21T06:14:06Z" level=info msg="Cleaning up new disk checksum file /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-b60ff363/volume-snap-recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a.img.checksum before linking" func="replica.(*Replica).linkDisk" file="replica.go:780"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:06Z" level=info msg="Finished to snapshot: 10.42.1.10:10000 recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a UserCreated true Created at 2024-02-21T06:14:06Z, Labels map[RecurringJob:recurring-test]" func="remote.(*Remote).Snapshot" file="remote.go:109"
[instance-manager-16d650f38cc11d9112b544b831ce8876] time="2024-02-21T06:14:06Z" level=info msg="Cleaning up new disk file /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-b60ff363/volume-snap-recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a.img before linking" func="replica.(*Replica).linkDisk" file="replica.go:786"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:06Z" level=info msg="Finished to snapshot: 10.42.3.6:10000 recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a UserCreated true Created at 2024-02-21T06:14:06Z, Labels map[RecurringJob:recurring-test]" func="remote.(*Remote).Snapshot" file="remote.go:109"
[instance-manager-16d650f38cc11d9112b544b831ce8876] time="2024-02-21T06:14:06Z" level=info msg="Finished creating disk" func="replica.(*Replica).createDisk" file="replica.go:987" disk=recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:06Z" level=info msg="Finished snapshot" func="controller.(*Controller).Snapshot" file="control.go:216" snapshot=recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a volume=longhorn-testvol-llisw1
[instance-manager-16d650f38cc11d9112b544b831ce8876] time="2024-02-21T06:14:06Z" level=info msg="Removing disk volume-head-000.img" func="replica.(*Replica).rmDisk" file="replica.go:829"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-r-67c2043c] time="2024-02-21T06:14:06Z" level=info msg="Replica server starts to snapshot [recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a] volume, user created true, created time 2024-02-21T06:14:06Z, labels map[RecurringJob:recurring-test]" func="replica.(*Server).Snapshot" file="server.go:185"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:06Z" level=info msg="Starting to create disk" func="replica.(*Replica).createDisk" file="replica.go:889" disk=recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:14:06 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 8653 "" "python-requests/2.25.1"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:06Z" level=info msg="Cleaning up new disk metadata file path /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-64b5a279/volume-snap-recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a.img.meta before linking" func="replica.(*Replica).linkDisk" file="replica.go:774"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:06Z" level=info msg="Cleaning up new disk checksum file /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-64b5a279/volume-snap-recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a.img.checksum before linking" func="replica.(*Replica).linkDisk" file="replica.go:780"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:06Z" level=info msg="Forwarding request to 10.42.2.82:9500" func="api.(*Fwd).HandleProxyRequestByNodeID" file="forwarder.go:136"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:06Z" level=info msg="Cleaning up new disk file /host/var/lib/longhorn/replicas/longhorn-testvol-llisw1-64b5a279/volume-snap-recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a.img before linking" func="replica.(*Replica).linkDisk" file="replica.go:786"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:06Z" level=info msg="Finished creating disk" func="replica.(*Replica).createDisk" file="replica.go:987" disk=recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:06Z" level=info msg="Removing disk volume-head-000.img" func="replica.(*Replica).rmDisk" file="replica.go:829"
[longhorn-manager-6rjkv] 10.42.1.16 - - [21/Feb/2024:06:14:06 +0000] "POST /v1/volumes/longhorn-testvol-llisw1?action=snapshotList HTTP/1.1" 200 882 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:14:06 +0000] "POST /v1/volumes/longhorn-testvol-llisw1?action=snapshotList HTTP/1.1" 200 882 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:14:06 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 8653 "" "python-requests/2.25.1"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:06Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:14:07 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 8653 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:14:08 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 8653 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:14:09 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 8653 "" "python-requests/2.25.1"
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:14:10 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 8653 "" "python-requests/2.25.1"
[longhorn-manager-nj822] time="2024-02-21T06:14:10Z" level=debug msg="Triggering sync backup target" func="controller.(*BackupStoreTimer).Start.func1" file="setting_controller.go:1061" component=backup-store-timer controller=longhorn-setting interval=30s node=ip-10-0-2-231
[longhorn-manager-6rjkv] time="2024-02-21T06:14:10Z" level=debug msg="[DEBUG] handleAttachmentTicketCreation/Update for snapshot recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a" func="controller.(*SnapshotController).reconcile" file="snapshot_controller.go:453"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:10Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[longhorn-manager-4hdqm] time="2024-02-21T06:14:10Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Replica, namespace: longhorn-system, name: longhorn-testvol-llisw1-r-67c2043c, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhorn.io/backing-image\":\"\",\"longhorndiskuuid\":\"4ea76150-406f-424a-b1e0-c6f499a7d608\",\"longhornnode\":\"ip-10-0-2-36\",\"longhornvolume\":\"longhorn-testvol-llisw1\"}},{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:14:10Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-67c2043c user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:10Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-67c2043c user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-nj822] time="2024-02-21T06:14:10Z" level=info msg="Forwarding request to 10.42.2.82:9500" func="api.(*Fwd).HandleProxyRequestByNodeID" file="forwarder.go:136"
[longhorn-manager-nj822] 10.42.2.90 - - [21/Feb/2024:06:14:10 +0000] "POST /v1/volumes/longhorn-testvol-llisw1?action=snapshotCRGet HTTP/1.1" 200 601 "" "Go-http-client/1.1"

=== requested for backup ===

[longhorn-manager-4hdqm] time="2024-02-21T06:14:10Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Backup, namespace: longhorn-system, name: backup-79bfec158ca24393, operation: CREATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]},{\"op\": \"replace\", \"path\": \"/spec/labels\", \"value\": {\"KubernetesStatus\":\"{\\\"pvName\\\":\\\"longhorn-testvol-llisw1-pv\\\",\\\"pvStatus\\\":\\\"Bound\\\",\\\"namespace\\\":\\\"default\\\",\\\"pvcName\\\":\\\"longhorn-testvol-llisw1-pvc\\\",\\\"lastPVCRefAt\\\":\\\"\\\",\\\"workloadsStatus\\\":[{\\\"podName\\\":\\\"longhorn-testvol-llisw1-dep-5549f647fd-blxmx\\\",\\\"podStatus\\\":\\\"Succeeded\\\",\\\"workloadName\\\":\\\"longhorn-testvol-llisw1-dep-5549f647fd\\\",\\\"workloadType\\\":\\\"ReplicaSet\\\"}],\\\"lastPodRefAt\\\":\\\"2024-02-21T06:10:54Z\\\"}\",\"RecurringJob\":\"recurring-test\",\"longhorn.io/volume-access-mode\":\"rwo\"}}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:14:10Z" level=debug msg="admit result: CREATE longhorn.io/v1beta2, Kind=Backup longhorn-system/backup-79bfec158ca24393 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:10Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Replica, namespace: longhorn-system, name: longhorn-testvol-llisw1-r-2c35a938, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhorn.io/backing-image\":\"\",\"longhorndiskuuid\":\"d24b38fa-0608-425b-b08f-0c6d921952c4\",\"longhornnode\":\"ip-10-0-2-62\",\"longhornvolume\":\"longhorn-testvol-llisw1\"}},{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook

=== SnapshotController deletes volume-attachment ===

[longhorn-manager-6rjkv] time="2024-02-21T06:14:10Z" level=debug msg="[DEBUG] handleAttachmentTicketDeletion for snapshot recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a" func="controller.(*SnapshotController).reconcile.func3" file="snapshot_controller.go:441" alreadyCreatedBefore=true requestCreateNewSnapshot=true
[longhorn-manager-4hdqm] time="2024-02-21T06:14:10Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-2c35a938 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:10Z" level=debug msg="[DEBUG] Updating volume attachment longhorn-testvol-llisw1" func="datastore.(*DataStore).UpdateLHVolumeAttachment" file="longhorn.go:4802" controller=ip-10-0-2-36

=== Snapshot created ===

[longhorn-manager-6rjkv] time="2024-02-21T06:14:10Z" level=info msg="Event(v1.ObjectReference{Kind:\"Snapshot\", Namespace:\"longhorn-system\", Name:\"recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a\", UID:\"7a69c4bc-1baa-45f7-a8a1-cdac3f95e585\", APIVersion:\"longhorn.io/v1beta2\", ResourceVersion:\"46351\", FieldPath:\"\"}): type: 'Normal' reason: 'SnapshotCreate' successfully provisioned the snapshot" func="record.(*eventBroadcasterImpl).StartLogging.func1" file="event.go:298"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:10Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-2c35a938 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:10Z" level=info msg="Event(v1.ObjectReference{Kind:\"Snapshot\", Namespace:\"longhorn-system\", Name:\"recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a\", UID:\"7a69c4bc-1baa-45f7-a8a1-cdac3f95e585\", APIVersion:\"longhorn.io/v1beta2\", ResourceVersion:\"46351\", FieldPath:\"\"}): type: 'Normal' reason: 'SnapshotUpdate' snapshot becomes ready to use" func="record.(*eventBroadcasterImpl).StartLogging.func1" file="event.go:298"

=== Updating volume attachment ===

[longhorn-manager-4hdqm] time="2024-02-21T06:14:10Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=VolumeAttachment, namespace: longhorn-system, name: longhorn-testvol-llisw1, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:14:10Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=VolumeAttachment longhorn-system/longhorn-testvol-llisw1 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:10Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=VolumeAttachment longhorn-system/longhorn-testvol-llisw1 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"

=== Detaching volume ===

[longhorn-manager-6rjkv] time="2024-02-21T06:14:10Z" level=info msg="Should detach volume ip-10-0-2-36 because there is no matching attachment ticket" func="controller.(*VolumeAttachmentController).shouldDoDetach" file="volume_attachment_controller.go:573" controller=longhorn-volume-attachment longhornVolumeAttachment=longhorn-testvol-llisw1 node=ip-10-0-2-36
[longhorn-manager-6rjkv] time="2024-02-21T06:14:10Z" level=info msg="Volume longhorn-testvol-llisw1 is selected to detach from node ip-10-0-2-36" func="controller.(*VolumeAttachmentController).handleVolumeDetachment" file="volume_attachment_controller.go:532" controller=longhorn-volume-attachment longhornVolumeAttachment=longhorn-testvol-llisw1 node=ip-10-0-2-36
[longhorn-manager-4hdqm] time="2024-02-21T06:14:10Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Volume, namespace: longhorn-system, name: longhorn-testvol-llisw1, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]},{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhornvolume\":\"longhorn-testvol-llisw1\",\"recurring-job-group.longhorn.io/default\":\"enabled\",\"recurring-job.longhorn.io/recurring-test\":\"enabled\",\"setting.longhorn.io/remove-snapshots-during-filesystem-trim\":\"ignored\",\"setting.longhorn.io/replica-auto-balance\":\"ignored\",\"setting.longhorn.io/snapshot-data-integrity\":\"ignored\"}}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:14:10Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Volume longhorn-system/longhorn-testvol-llisw1 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:10Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Volume longhorn-system/longhorn-testvol-llisw1 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"

=== BackupController updates volume attachement ===

[longhorn-manager-6rjkv] 10.42.2.90 - - [21/Feb/2024:06:14:10 +0000] "POST /v1/volumes/longhorn-testvol-llisw1?action=snapshotBackup HTTP/1.1" 200 9356 "" "Go-http-client/1.1"
[longhorn-manager-nj822] 10.42.2.90 - - [21/Feb/2024:06:14:10 +0000] "POST /v1/volumes/longhorn-testvol-llisw1?action=snapshotBackup HTTP/1.1" 200 9356 "" "Go-http-client/1.1"
[longhorn-manager-nj822] 10.42.2.90 - - [21/Feb/2024:06:14:11 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 9578 "" "Go-http-client/1.1"
[longhorn-manager-nj822] time="2024-02-21T06:14:11Z" level=debug msg="[DEBUG] handleAttachmentTicketCreation/Update for backup backup-79bfec158ca24393" func="controller.(*BackupController).reconcile" file="backup_controller.go:371"
[longhorn-manager-nj822] time="2024-02-21T06:14:11Z" level=debug msg="[DEBUG][handleAttachmentTicketCreation] updating attachment longhorn-testvol-llisw1 FROM {AttachmentTickets:map[] Volume:longhorn-testvol-llisw1}" func="controller.(*BackupController).handleAttachmentTicketCreation" file="backup_controller.go:514" controller=longhorn-backup node=ip-10-0-2-231
[longhorn-manager-nj822] time="2024-02-21T06:14:11Z" level=debug msg="[DEBUG][handleAttachmentTicketCreation] updating attachment longhorn-testvol-llisw1 TO {AttachmentTickets:map[backup-controller-backup-79bfec158ca24393:0xc0021def80] Volume:longhorn-testvol-llisw1}" func="controller.(*BackupController).handleAttachmentTicketCreation" file="backup_controller.go:519" controller=longhorn-backup node=ip-10-0-2-231
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=VolumeAttachment, namespace: longhorn-system, name: longhorn-testvol-llisw1, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-nj822] time="2024-02-21T06:14:11Z" level=debug msg="[DEBUG][handleAttachmentTicketCreation] updating attachment: longhorn-testvol-llisw1" func="controller.(*BackupController).handleAttachmentTicketCreation.func2" file="backup_controller.go:508" controller=longhorn-backup node=ip-10-0-2-231
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=VolumeAttachment longhorn-system/longhorn-testvol-llisw1 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-nj822] time="2024-02-21T06:14:11Z" level=debug msg="[DEBUG] Updating volume attachment longhorn-testvol-llisw1" func="datastore.(*DataStore).UpdateLHVolumeAttachment" file="longhorn.go:4802" controller=ip-10-0-2-231
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=VolumeAttachment longhorn-system/longhorn-testvol-llisw1 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Replica, namespace: longhorn-system, name: longhorn-testvol-llisw1-r-b1867404, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhorn.io/backing-image\":\"\",\"longhorndiskuuid\":\"f48bf0ec-2725-4cc8-802e-c502c53c8d61\",\"longhornnode\":\"ip-10-0-2-231\",\"longhornvolume\":\"longhorn-testvol-llisw1\"}},{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-b1867404 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-b1867404 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"

=== VolumeAttachmentController failed to update volume attachment status ===

[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=debug msg="[DEBUG] Updating volume attachment longhorn-testvol-llisw1 status" func="datastore.(*DataStore).UpdateLHVolumeAttachmentStatus" file="longhorn.go:4815" controller=volume-attachment-controller
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=warning msg="Failed to sync Longhorn VolumeAttachment" func=controller.handleReconcileErrorLogging file="utils.go:66" LonghornVolumeAttachment=longhorn-system/longhorn-testvol-llisw1 controller=longhorn-volume-attachment error="longhorn-volume-attachment: failed to sync VolumeAttachment longhorn-system/longhorn-testvol-llisw1: Operation cannot be fulfilled on volumeattachments.longhorn.io \"longhorn-testvol-llisw1\": the object has been modified; please apply your changes to the latest version and try again" node=ip-10-0-2-36

=== VolumeAttachmentController updates volume attachment status ===

[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=debug msg="[DEBUG] Updating volume attachment longhorn-testvol-llisw1 status" func="datastore.(*DataStore).UpdateLHVolumeAttachmentStatus" file="longhorn.go:4815" controller=volume-attachment-controller
[longhorn-manager-4hdqm] 10.42.1.16 - - [21/Feb/2024:06:14:11 +0000] "GET /v1/volumes/longhorn-testvol-llisw1 HTTP/1.1" 200 9717 "" "python-requests/2.25.1"

=== VolumeController requeue Volume ===

[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=debug msg="Requeue volume due to error Operation cannot be fulfilled on volumes.longhorn.io \"longhorn-testvol-llisw1\": the object has been modified; please apply your changes to the latest version and try again" func="controller.(*VolumeController).syncVolume.func2" file="volume_controller.go:467" accessMode=rwo controller=longhorn-volume frontend=blockdev migratable=false node=ip-10-0-2-36 owner=ip-10-0-2-36 state=attached volume=longhorn-testvol-llisw1
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36

=== Backing up snapshot ===

[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Backing up snapshot recurrin-4f
0bfb25-df42-4c51-a923-02e15070b74a to backup backup-79bfec158ca24393" func="proxy.(*Proxy).SnapshotBackup" file="backup.go:44" dataEngine=DATA_ENGINE_V1 en
gineName=longhorn-testvol-llisw1-e-0 serviceURL="10.42.2.7:10010" volumeName=longhorn-testvol-llisw1
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Replica, namespace: longhorn-system, name: longhorn-testvol-llisw1-r-2c35a938, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhorn.io/backing-image\":\"\",\"longhorndiskuuid\":\"d24b38fa-0608-425b-b08f-0c6d921952c4\",\"longhornnode\":\"ip-10-0-2-62\",\"longhornvolume\":\"longhorn-testvol-llisw1\"}},{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-2c35a938 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Backing up recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a on tcp://10.42.3.6:10000, to s3://backupbucket@us-east-1/backupstore" func="sync.(*Task).createBackup" file="backup.go:105"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-2c35a938 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] time="2024-02-21T06:14:11Z" level=info msg="Initializing backup backup-79bfec158ca24393 for volume longhorn-testvol-llisw1 snapshot recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a" func=backup.DoBackupInit file="main.go:75" pkg=backup
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] time="2024-02-21T06:14:11Z" level=info msg="Start creating backup backup-79bfec158ca24393" func=backup.DoBackupCreate file="main.go:136" pkg=backup
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] time="2024-02-21T06:14:11Z" level=info msg="Loaded driver for s3://backupbucket@us-east-1/backupstore" func=s3.initFunc file="s3.go:85" pkg=s3
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] time="2024-02-21T06:14:11Z" level=info msg="Saving config in backupstore" func=backupstore.SaveConfigInBackupStore file="config.go:81" filepath=backupstore/volumes/5d/de/longhorn-testvol-llisw1/locks/lock-d3092d9f698b4d90.lck kind=s3 object=config pkg=backupstore reason=start
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] time="2024-02-21T06:14:11Z" level=info msg="Saved config in backupstore" func=backupstore.SaveConfigInBackupStore file="config.go:92" filepath=backupstore/volumes/5d/de/longhorn-testvol-llisw1/locks/lock-d3092d9f698b4d90.lck kind=s3 object=config pkg=backupstore reason=complete
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] time="2024-02-21T06:14:11Z" level=info msg="Stored lock backupstore/volumes/5d/de/longhorn-testvol-llisw1/locks/lock-d3092d9f698b4d90.lck type 1 on backupstore" func=backupstore.saveLock file="lock.go:190" pkg=backupstore
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Replica, namespace: longhorn-system, name: longhorn-testvol-llisw1-r-b1867404, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhorn.io/backing-image\":\"\",\"longhorndiskuuid\":\"f48bf0ec-2725-4cc8-802e-c502c53c8d61\",\"longhornnode\":\"ip-10-0-2-231\",\"longhornvolume\":\"longhorn-testvol-llisw1\"}},{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-b1867404 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-b1867404 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Replica, namespace: longhorn-system, name: longhorn-testvol-llisw1-r-67c2043c, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhorn.io/backing-image\":\"\",\"longhorndiskuuid\":\"4ea76150-406f-424a-b1e0-c6f499a7d608\",\"longhornnode\":\"ip-10-0-2-36\",\"longhornvolume\":\"longhorn-testvol-llisw1\"}},{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-67c2043c user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-67c2043c user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Engine, namespace: longhorn-system, name: longhorn-testvol-llisw1-e-0, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhornnode\":\"\",\"longhornvolume\":\"longhorn-testvol-llisw1\"}},{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Engine longhorn-system/longhorn-testvol-llisw1-e-0 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Engine longhorn-system/longhorn-testvol-llisw1-e-0 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"

=== Deleting instance ?? ====

[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=info msg="Deleting instance longhorn-testvol-llisw1-e-0" func="controller.(*InstanceHandler).deleteInstance" file="instance_handler.go:497"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Deleting instance" func="instance.(*Server).InstanceDelete" file="instance.go:174" cleanupRequired=true dataEngine=DATA_ENGINE_V1 diskUuid= name=longhorn-testvol-llisw1-e-0 type=engine
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=info msg="Deleting engine instance" func="controller.(*EngineController).DeleteInstance" file="engine_controller.go:550" controller=longhorn-engine engine=longhorn-testvol-llisw1-e-0 node=ip-10-0-2-36
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: prepare to delete process longhorn-testvol-llisw1-e-0" func="process.(*Manager).ProcessDelete" file="process_manager.go:273"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=info msg="Event(v1.ObjectReference{Kind:\"Engine\", Namespace:\"longhorn-system\", Name:\"longhorn-testvol-llisw1-e-0\", UID:\"ba0eb73e-97c5-411a-a239-28373e87edf5\", APIVersion:\"longhorn.io/v1beta2\", ResourceVersion:\"46368\", FieldPath:\"\"}): type: 'Normal' reason: 'Stop' Stops longhorn-testvol-llisw1-e-0" func="record.(*eventBroadcasterImpl).StartLogging.func1" file="event.go:298"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: deleted process longhorn-testvol-llisw1-e-0" func="process.(*Manager).ProcessDelete" file="process_manager.go:287"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=info msg="Deleting instance longhorn-testvol-llisw1-e-0" func="controller.(*InstanceHandler).deleteInstance" file="instance_handler.go:497"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=info msg="Deleting engine instance" func="controller.(*EngineController).DeleteInstance" file="engine_controller.go:550" controller=longhorn-engine engine=longhorn-testvol-llisw1-e-0 node=ip-10-0-2-36
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: trying to stop process longhorn-testvol-llisw1-e-0" func="process.(*Process).StopWithSignal.func1" file="process.go:170"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Wait for process longhorn-testvol-llisw1-e-0 to shutdown" func="process.(*Process).StopWithSignal.func1" file="process.go:176"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:11Z" level=warning msg="Received signal interrupt to shutdown" func=cmd.registerShutdown.func1 file="shutdown.go:31"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:11Z" level=warning msg="Starting to execute registered shutdown func github.com/longhorn/longhorn-engine/app/cmd.startController.func1" func=cmd.registerShutdown.func1 file="shutdown.go:34"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=info msg="Event(v1.ObjectReference{Kind:\"Engine\", Namespace:\"longhorn-system\", Name:\"longhorn-testvol-llisw1-e-0\", UID:\"ba0eb73e-97c5-411a-a239-28373e87edf5\", APIVersion:\"longhorn.io/v1beta2\", ResourceVersion:\"46370\", FieldPath:\"\"}): type: 'Normal' reason: 'Stop' Stops longhorn-testvol-llisw1-e-0" func="record.(*eventBroadcasterImpl).StartLogging.func1" file="event.go:298"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=debug msg="Process Manager: wait for process longhorn-testvol-llisw1-e-0 to shutdown before unregistering process" func="process.(*Manager).unregisterProcess.func1" file="process_manager.go:324"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:11Z" level=info msg="Shutting down iSCSI device for target iqn.2019-10.io.longhorn:longhorn-testvol-llisw1" func=iscsidev.LogoutTarget file="iscsi.go:266"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Deleting instance" func="instance.(*Server).InstanceDelete" file="instance.go:174" cleanupRequired=true dataEngine=DATA_ENGINE_V1 diskUuid= name=longhorn-testvol-llisw1-e-0 type=engine
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: prepare to delete process longhorn-testvol-llisw1-e-0" func="process.(*Manager).ProcessDelete" file="process_manager.go:273"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: deleted process longhorn-testvol-llisw1-e-0" func="process.(*Manager).ProcessDelete" file="process_manager.go:287"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=debug msg="Process Manager: wait for process longhorn-testvol-llisw1-e-0 to shutdown before unregistering process" func="process.(*Manager).unregisterProcess.func1" file="process_manager.go:324"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Wait for process longhorn-testvol-llisw1-e-0 to shutdown" func="process.(*Process).StopWithSignal.func1" file="process.go:176"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=debug msg="Process Manager: wait for process longhorn-testvol-llisw1-e-0 to shutdown before unregistering process" func="process.(*Manager).unregisterProcess.func1" file="process_manager.go:324"

=== volume-attachment-controller updates volume attachment status ?? ===

[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=debug msg="[DEBUG] Updating volume attachment longhorn-testvol-llisw1 status" func="datastore.(*DataStore).UpdateLHVolumeAttachmentStatus" file="longhorn.go:4815" controller=volume-attachment-controller
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=debug msg="Process Manager: wait for process longhorn-testvol-llisw1-e-0 to shutdown before unregistering process" func="process.(*Manager).unregisterProcess.func1" file="process_manager.go:324"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:11Z" level=info msg="Shutting down iSCSI target iqn.2019-10.io.longhorn:longhorn-testvol-llisw1" func="iscsidev.(*Device).DeleteTarget" file="iscsi.go:336"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] lh_client_close_conn: Closing connection
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] lh_client_close_conn: Connection close complete
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:11Z" level=warning msg="Socket server connection closed" func="socket.(*Socket).handleServerConnection" file="frontend.go:137"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:11Z" level=info msg="device longhorn-testvol-llisw1: iSCSI device /dev/longhorn/longhorn-testvol-llisw1 shutdown" func="longhorndev.(*LonghornDevice).shutdownFrontend" file="dev.go:209"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:11Z" level=info msg="Closing: 10.42.3.6:10000" func="remote.(*Remote).Close" file="remote.go:48"
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-testvol-llisw1-r-2c35a938] time="2024-02-21T06:14:11Z" level=info msg="Closing replica" func="replica.(*Server).Close" file="server.go:309"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:11Z" level=info msg="Monitoring stopped tcp://10.42.3.6:10000" func="controller.(*Controller).monitoring" file="control.go:1214"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:11Z" level=info msg="Closing: 10.42.2.7:10000" func="remote.(*Remote).Close" file="remote.go:48"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:11Z" level=info msg="Monitoring stopped tcp://10.42.2.7:10000" func="controller.(*Controller).monitoring" file="control.go:1214"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-e-0] time="2024-02-21T06:14:11Z" level=info msg="Closing: 10.42.1.10:10000" func="remote.(*Remote).Close" file="remote.go:48"
[instance-manager-6f6372acb5d67390c3b9274dd68b2ab7] [longhorn-testvol-llisw1-r-b1867404] time="2024-02-21T06:14:11Z" level=info msg="Closing replica" func="replica.(*Server).Close" file="server.go:309"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:11Z" level=info msg="Monitoring stopped tcp://10.42.1.10:10000" func="controller.(*Controller).monitoring" file="control.go:1214"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-r-67c2043c] time="2024-02-21T06:14:11Z" level=info msg="Closing replica" func="replica.(*Server).Close" file="server.go:309"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: process longhorn-testvol-llisw1-e-0 stopped" func="process.(*Process).Start.func1" file="process.go:78"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: successfully unregistered process longhorn-testvol-llisw1-e-0" func="process.(*Manager).unregisterProcess.func1" file="process_manager.go:344"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: successfully unregistered process longhorn-testvol-llisw1-e-0" func="process.(*Manager).unregisterProcess.func1" file="process_manager.go:344"

=== SnapshotController deletes volume attachment ===

[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=debug msg="[DEBUG] handleAttachmentTicketDeletion for snapshot recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a" func="controller.(*SnapshotController).reconcile.func3" file="snapshot_controller.go:441" alreadyCreatedBefore=true requestCreateNewSnapshot=true
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=debug msg="Ignoring recurring job labels on Volume longhorn-testvol-llisw1 PVC longhorn-testvol-llisw1-pvc due to missing source label" func="controller.(*VolumeController).syncPVCRecurringJobLabels" file="volume_controller.go:3668" controller=longhorn-volume node=ip-10-0-2-36
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Replica, namespace: longhorn-system, name: longhorn-testvol-llisw1-r-67c2043c, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhorn.io/backing-image\":\"\",\"longhorndiskuuid\":\"4ea76150-406f-424a-b1e0-c6f499a7d608\",\"longhornnode\":\"ip-10-0-2-36\",\"longhornvolume\":\"longhorn-testvol-llisw1\"}},{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-67c2043c user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-67c2043c user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=info msg="Deleting instance longhorn-testvol-llisw1-r-67c2043c" func="controller.(*InstanceHandler).deleteInstance" file="instance_handler.go:497"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=info msg="Deleting replica instance" func="controller.(*ReplicaController).DeleteInstance" file="replica_controller.go:555" controller=longhorn-replica node=ip-10-0-2-36 nodeID=ip-10-0-2-36 ownerID=ip-10-0-2-36 replica=longhorn-testvol-llisw1-r-67c2043c
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Deleting instance" func="instance.(*Server).InstanceDelete" file="instance.go:174" cleanupRequired=true dataEngine=DATA_ENGINE_V1 diskUuid=4ea76150-406f-424a-b1e0-c6f499a7d608 name=longhorn-testvol-llisw1-r-67c2043c type=replica
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: prepare to delete process longhorn-testvol-llisw1-r-67c2043c" func="process.(*Manager).ProcessDelete" file="process_manager.go:273"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: deleted process longhorn-testvol-llisw1-r-67c2043c" func="process.(*Manager).ProcessDelete" file="process_manager.go:287"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=info msg="Event(v1.ObjectReference{Kind:\"Replica\", Namespace:\"longhorn-system\", Name:\"longhorn-testvol-llisw1-r-67c2043c\", UID:\"3495b15f-d79c-49b1-9133-ae6ecaf21e6a\", APIVersion:\"longhorn.io/v1beta2\", ResourceVersion:\"46376\", FieldPath:\"\"}): type: 'Normal' reason: 'Stop' Stops longhorn-testvol-llisw1-r-67c2043c" func="record.(*eventBroadcasterImpl).StartLogging.func1" file="event.go:298"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: trying to stop process longhorn-testvol-llisw1-r-67c2043c" func="process.(*Process).StopWithSignal.func1" file="process.go:170"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Wait for process longhorn-testvol-llisw1-r-67c2043c to shutdown" func="process.(*Process).StopWithSignal.func1" file="process.go:176"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=debug msg="Process Manager: wait for process longhorn-testvol-llisw1-r-67c2043c to shutdown before unregistering process" func="process.(*Manager).unregisterProcess.func1" file="process_manager.go:324"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-testvol-llisw1-r-67c2043c] time="2024-02-21T06:14:11Z" level=warning msg="Received signal interrupt to shutdown" func=cmd.registerShutdown.func1 file="shutdown.go:31"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] time="2024-02-21T06:14:11Z" level=warning msg="Starting to execute registered shutdown func github.com/longhorn/longhorn-engine/app/cmd.startReplica.func4" func=cmd.registerShutdown.func1 file="shutdown.go:34"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: process longhorn-testvol-llisw1-r-67c2043c stopped" func="process.(*Process).Start.func1" file="process.go:78"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=info msg="Stopping monitoring engine" func="controller.(*EngineMonitor).Run.func1" file="engine_controller.go:795" controller=longhorn-engine engine=longhorn-testvol-llisw1-e-0 node=ip-10-0-2-36
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=debug msg="Requeue engine due to conflict" func="controller.(*EngineController).syncEngine.func2" file="engine_controller.go:308" controller=longhorn-engine engine=longhorn-testvol-llisw1-e-0 error="Operation cannot be fulfilled on engines.longhorn.io \"longhorn-testvol-llisw1-e-0\": the object has been modified; please apply your changes to the latest version and try again" node=ip-10-0-2-36
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: successfully unregistered process longhorn-testvol-llisw1-r-67c2043c" func="process.(*Manager).unregisterProcess.func1" file="process_manager.go:344"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=info msg="Request (user: system:serviceaccount:longhorn-system:longhorn-service-account, longhorn.io/v1beta2, Kind=Replica, namespace: longhorn-system, name: longhorn-testvol-llisw1-r-2c35a938, operation: UPDATE) patchOps: [{\"op\": \"replace\", \"path\": \"/metadata/labels\", \"value\": {\"longhorn.io/backing-image\":\"\",\"longhorndiskuuid\":\"d24b38fa-0608-425b-b08f-0c6d921952c4\",\"longhornnode\":\"ip-10-0-2-62\",\"longhornvolume\":\"longhorn-testvol-llisw1\"}},{\"op\": \"replace\", \"path\": \"/metadata/finalizers\", \"value\": [\"longhorn.io\"]}]" func="admission.(*Handler).admit" file="admission.go:115" service=admissionWebhook
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-2c35a938 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=debug msg="admit result: UPDATE longhorn.io/v1beta2, Kind=Replica longhorn-system/longhorn-testvol-llisw1-r-2c35a938 user=system:serviceaccount:longhorn-system:longhorn-service-account allowed=true err=<nil>" func="webhook.(*Router).admit" file="router.go:89"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=info msg="Deleting instance longhorn-testvol-llisw1-r-67c2043c" func="controller.(*InstanceHandler).deleteInstance" file="instance_handler.go:497"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=info msg="Deleting instance longhorn-testvol-llisw1-r-2c35a938" func="controller.(*InstanceHandler).deleteInstance" file="instance_handler.go:497"
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Deleting instance" func="instance.(*Server).InstanceDelete" file="instance.go:174" cleanupRequired=true dataEngine=DATA_ENGINE_V1 diskUuid=4ea76150-406f-424a-b1e0-c6f499a7d608 name=longhorn-testvol-llisw1-r-67c2043c type=replica
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=info msg="Deleting replica instance" func="controller.(*ReplicaController).DeleteInstance" file="replica_controller.go:555" controller=longhorn-replica node=ip-10-0-2-36 nodeID=ip-10-0-2-36 ownerID=ip-10-0-2-36 replica=longhorn-testvol-llisw1-r-67c2043c
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Deleting instance" func="instance.(*Server).InstanceDelete" file="instance.go:174" cleanupRequired=true dataEngine=DATA_ENGINE_V1 diskUuid=d24b38fa-0608-425b-b08f-0c6d921952c4 name=longhorn-testvol-llisw1-r-2c35a938 type=replica
[instance-manager-cb55d580e975e0468c1ef0cede779ca8] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: prepare to delete process longhorn-testvol-llisw1-r-67c2043c" func="process.(*Manager).ProcessDelete" file="process_manager.go:273"
[longhorn-manager-6rjkv] time="2024-02-21T06:14:11Z" level=info msg="Event(v1.ObjectReference{Kind:\"Replica\", Namespace:\"longhorn-system\", Name:\"longhorn-testvol-llisw1-r-67c2043c\", UID:\"3495b15f-d79c-49b1-9133-ae6ecaf21e6a\", APIVersion:\"longhorn.io/v1beta2\", ResourceVersion:\"46378\", FieldPath:\"\"}): type: 'Normal' reason: 'Stop' Stops longhorn-testvol-llisw1-r-67c2043c" func="record.(*eventBroadcasterImpl).StartLogging.func1" file="event.go:298"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=info msg="Deleting replica instance" func="controller.(*ReplicaController).DeleteInstance" file="replica_controller.go:555" controller=longhorn-replica node=ip-10-0-2-62 nodeID=ip-10-0-2-62 ownerID=ip-10-0-2-62 replica=longhorn-testvol-llisw1-r-2c35a938
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: prepare to delete process longhorn-testvol-llisw1-r-2c35a938" func="process.(*Manager).ProcessDelete" file="process_manager.go:273"
[longhorn-manager-4hdqm] time="2024-02-21T06:14:11Z" level=info msg="Event(v1.ObjectReference{Kind:\"Replica\", Namespace:\"longhorn-system\", Name:\"longhorn-testvol-llisw1-r-2c35a938\", UID:\"b9ce97aa-0d23-49fa-83ac-5217cb6c205f\", APIVersion:\"longhorn.io/v1beta2\", ResourceVersion:\"46379\", FieldPath:\"\"}): type: 'Normal' reason: 'Stop' Stops longhorn-testvol-llisw1-r-2c35a938" func="record.(*eventBroadcasterImpl).StartLogging.func1" file="event.go:298"
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: deleted process longhorn-testvol-llisw1-r-2c35a938" func="process.(*Manager).ProcessDelete" file="process_manager.go:287"
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=info msg="Process Manager: trying to stop process longhorn-testvol-llisw1-r-2c35a938" func="process.(*Process).StopWithSignal.func1" file="process.go:170"
[instance-manager-16d650f38cc11d9112b544b831ce8876] [longhorn-instance-manager] time="2024-02-21T06:14:11Z" level=debug msg="Process Manager: wait for process longhorn-testvol-llisw1-r-2c35a938 to shutdown before unregistering process" func="process.(*Manager).unregisterProcess.func1" file="process_manager.go:324"

=== Cannot take snapshot backup because the volume is detached ===

[longhorn-manager-nj822] time="2024-02-21T06:14:11Z" level=warning msg="Cannot take snapshot backup" func=engineapi.NewBackupMonitor file="backup_monitor.go:90" backup=backup-79bfec158ca24393 controller=longhorn-backup error="proxyServer=10.42.1.10:8501 destination=10.42.2.7:10010: failed to backup snapshot recurrin-4f0bfb25-df42-4c51-a923-02e15070b74a to backup-79bfec158ca24393: rpc error: code = Internal desc = failed to create backup: failed to create backup to s3://backupbucket@us-east-1/backupstore for volume longhorn-testvol-llisw1: rpc error: code = Unavailable desc = error reading from server: EOF" node=ip-10-0-2-231

Based on the log, there seems to be a potential scenario of race condition where the snapshot ticket is removed from the volume attachment, resulting in the detachment of the volume (initiated by the snapshot controller), even as the backup process has already been initiated by the recurring job.

If true, then the commits in RC1 might not be directly linked to the test failure. However, they could potentially impact the processing speed, thereby triggering the condition.

I will experiment and gather more logs to confirm the case.

@yangchiu is this issue also happening in other branch releases?

According to https://github.com/longhorn/longhorn/issues/7937#issuecomment-1947706261, it doesn’t. Since it’s happening from Feb. 7, the only affected release is v1.5.4.

Verified passed on master-head (longhorn-manager 917762ab) by running test case test_recurring_jobs_when_volume_detached_unexpectedly 100 times.

Test results: https://ci.longhorn.io/job/private/job/longhorn-tests-regression/6409/ ~ https://ci.longhorn.io/job/private/job/longhorn-tests-regression/6413/

@c3y1huang Please help drive the tickets here to be ready for testing. Thanks.

Hi Jack, the race here is when volume.spec.nodeID is cleared, volume controller start detaching the volume but not reach the step of setting the engine.spec.desirestate to stop yet.

Now we can turn our investigation to why it is happening only in recent Longhorn version. Is it possible that there a bigger delay between volume.spec.nodeID is cleared and engine.spec.desirestate state is set to stop in recent Longhorn version?

Proposed solution:

Add a recurring job volume-attachement-ticket for the job. This should prevent unnecessary intermediate detachment during snapshot and backup operations triggered by the recurring backup job.

Looks like I got different error on Longhorn v1.5.4-rc3: proxyServer=10.42.191.99:8501 destination=10.42.191.99:10010: failed to backup snapshot c-00cx94-c844d1f7-a38b-4151-8058-e77e9b2f422e to backup-dfeb5dad14aa4e6d: rpc error: code = Unknown desc = failed to create backup to for volume testvol: rpc error: code = Unknown desc = missing input parameter

Error logs from instance manager: 2024-02-21T23:04:13.127508386Z time="2024-02-21T23:04:13Z" level=error msg="Failed to initialize backup backup-55ef929c06664af9" func="rpc.(*SyncAgentServer).BackupCreate" file="server.go:714" error="missing input parameter"

@mantissahz Please help check this .