cmssw: HLT farm crash in Run 379530

Reporting a crash in Run 379530

Here’s the recipe how to reproduce the crashes (tested with CMSSW_14_0_5_patch1 on lxplus8-gpu):

cmsrel CMSSW_14_0_5_patch1
cd CMSSW_14_0_5_patch1/src
cmsenv

the n prepare the reproducer as:

#!/bin/bash -ex

# CMSSW_14_0_5_patch1

hltGetConfiguration run:379530 \
  --globaltag 140X_dataRun3_HLT_v3 \
  --data \
  --no-prescale \
  --no-output \
  --max-events -1 \
  --input /store/group/tsg/FOG/debug/2024-04-17_run379530/run379530_ls0556.root  > hlt.py
  
cat <<@EOF >> hlt.py
process.options.wantSummary = True

process.options.numberOfThreads = 1
process.options.numberOfStreams = 0
@EOF

cmsRun hlt.py &> hlt.log

In the log, we will find the following message:

----- Begin Fatal Exception 17-Apr-2024 12:13:39 CEST-----------------------
An exception of category 'StdException' occurred while
   [0] Processing  Event run: 379530 lumi: 556 event: 1073222806 stream: 0
   [1] Running path 'DST_PFScouting_DoubleMuon_v1'
   [2] Calling method for module MuonHLTEcalPFClusterIsolationProducer/'hltMuonEcalMFPFClusterIsoForMuons'
Exception Message:
A std::exception was thrown.
vector::_M_range_check: __n (which is 18446744073709551615) >= this->size() (which is 2)
----- End Fatal Exception -------------------------------------------------

@cms-sw/hlt-l2 FYI @cms-sw/muon-pog-l2 FYI (sorry, tagged the DPG by mistake)

About this issue

  • Original URL
  • State: closed
  • Created 2 months ago
  • Comments: 23 (23 by maintainers)

Most upvoted comments

+hlt

unassign reconstruction

  • I realize now that the code is actually only within HLT purview - sorry for the noise @mandrenguyen

i mean since Marco have it set up already, may be he can do the PR.

OK, here’s a minimal fix: https://github.com/cms-sw/cmssw/pull/44764.

Would you like reco to make a PR or you’ll take care of it?

feel free to make the PR (provided @cms-sw/egamma-pog-l2 agrees). I am wondering though - since these classes were not touched in a long time, if the triggering factor (candidate eta=0) is just a matter of chance or there is some other underlying problem upstream.

Ok, since (current and former) egamma experts are reacting in this thread, I’ll leave it in their hands, unless someone asks me to do something 😃

assign reconstruction

assign hlt