forked from cms-sw/cmssw
-
Notifications
You must be signed in to change notification settings - Fork 5
Open
Labels
Description
While running the CPU profiling workflow (customizePixelTracksSoAonCPUForProfiling()) on 11_0_0_pre7_Patatrack at NERSC, I got an assertion failure
Begin processing the 3901st record. Run 321177, Event 188714878, LumiSection 142 on stream 13 at 20-Sep-2019 20:12:57.849 PDT
RecoPixelVertexing/PixelTriplets/plugins/CAHitNtupletGeneratorKernelsImpl.h:320: void kernel_countMultiplicity(const HitContainer*, const Quality*, CAConstants::TupleMultiplicity*): Assertion `nhits < 8' failed.
wrong mult 347 -1412
...
Thread 76 (Thread 0x2aaebc280700 (LWP 4401)):
...
#4 <signal handler called>
#5 0x00002aaaad63f207 in raise () from /lib64/libc.so.6
#6 0x00002aaaad6408f8 in abort () from /lib64/libc.so.6
#7 0x00002aaaad638026 in __assert_fail_base () from /lib64/libc.so.6
#8 0x00002aaaad6380d2 in __assert_fail () from /lib64/libc.so.6
#9 0x00002aab9a99280d in CAHitNtupletGeneratorKernels<cudaCompat::CPUTraits>::launchKernels(TrackingRecHit2DHeterogeneous<cudaCompat::CPUTraits> const&, TrackSoAT<32768>*, CUstream_st*) () from .../CMSSW_11_0_0_pre7_Patatrack/lib/slc7_amd64_gcc820/pluginRecoPixelVertexingPixelTripletsPlugins.so
#10 0x00002aab9a947413 in CAHitNtupletGeneratorOnGPU::makeTuples(TrackingRecHit2DHeterogeneous<cudaCompat::CPUTraits> const&, float) const () from .../cmssw/CMSSW_11_0_0_pre7_Patatrack/lib/slc7_amd64_gcc820/pluginRecoPixelVertexingPixelTripletsPlugins.so
#11 0x00002aab9a993b99 in CAHitNtupletCUDA::produce(edm::StreamID, edm::Event&, edm::EventSetup const&) const () from .../CMSSW_11_0_0_pre7_Patatrack/lib/slc7_amd64_gcc820/pluginRecoPixelVertexingPixel
TripletsPlugins.so
...
when running on 64 streams/threads. This failure occurred only once though during my tests on 4x{1, 16, 32}, 10x64, 4x{1, 20, 40}, and 10x80 streams/threads, but I thought to report it anyway ("NxM" meaning "N runs of M streams/threads").