feat: AdePT physics list plugin for e+/e-/gamma offload to GPU [WIP] by wdconinc · Pull Request #1606 · AIDASoft/DD4hep

wdconinc · 2026-04-11T17:14:36Z

BEGINRELEASENOTES

feat: AdePT physics list plugin for e+/e-/gamma offload to GPU

ENDRELEASENOTES

This PR adds an AdePT physics list plugin for DD4hep (similar to the celeritas physics list plugin in https://github.com/celeritas-project/celeritas/).

Notes on AdePT integration approach:

AdePT is added as a Geant4AdePTPhysics action, and must be added with a helper function to setupUserPhysics in DDsim. This is added through the steering file.
The use of callUserTrackingAction=true is required, since we need a Geant4AdePTUserParticleHandler to 'repair' the track/particle after it comes back from the GPU to the CPU. This is also added by the steering file.
The rest of the example steering file only serves to make it self-consistent and runnable, but does not have AdePT functionality.

Notes on DD4hep core changes:

SiD.xml gets an EcalRegion for testing.
In Geant4ParticleHandler, the loop over user handlers is moved earlier to allow them to 'repair' m_currTrack:
https://github.com/wdconinc/DD4hep/blob/a70d61afc812dc027990f29f8a351274fae11d59/DDG4/src/Geant4ParticleHandler.cpp#L349-L350
To accept the new AdePT particle handler, we allow in DDSim/Helper/ParticleHandler.py‎ the passthrough of a generic particle handler.

MarkusFrankATcernch

We do not want to make plugin headers public.
If necessary, we can put the header in a public directory and define the factory instance elsewhere....
There is a good reason to separate plugins from code.

github-actions · 2026-04-11T18:11:04Z

Test Results

18 files 18 suites 6h 13m 49s ⏱️
357 tests 354 ✅ 0 💤 3 ❌
3 143 runs 3 116 ✅ 0 💤 27 ❌

For more details on these failures, see this check.

Results for commit a70d61a.

♻️ This comment has been updated with latest results.

wdconinc · 2026-04-11T18:24:29Z

We do not want to make plugin headers public.
If necessary, we can put the header in a public directory and define the factory instance elsewhere....
There is a good reason to separate plugins from code.

Yes, I noticed when filing the PR that the unrelated DDCore change snuck in. I'll remove it when I am back at a computer.

andresailer · 2026-04-13T08:47:13Z

cc @SeverinDiederichs (FYI)

When AdePT's callUserTrackingAction=false (the default for performance), GPU-produced hits and hadronic secondaries carry trackID/parentID=0 from the dummy HostTrackData. This caused two classes of errors: 1. 'No Equivalent particle for track:0' (from Geant4ParticleMap::particleID) GPU hits have trackID=0, and when Geant4Output2ROOT tries to remap them to final particle IDs, it calls particleID(0) which fails. Fix: in Geant4ParticleHandler::endEvent(), after rebaseSimulatedTracks, add m_equivalentTracks[0] pointing to the primary particle (g4id=1) so that all GPU hits with dummy trackID=0 are correctly attributed. 2. Hadronic secondaries returned from GPU with parentID=0 breaking the MC truth parent chain. Fix: Geant4AdePTUserParticleHandler::begin() remaps particle.g4Parent from 0 to the entering primary's G4 track ID. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

wdconinc · 2026-04-13T20:59:49Z

Together with apt-sim/AdePT#546 this now offloads tracks to my (very crappy) GPU.

ddsim --steeringFile DDG4/examples/AdePTSteeringFile.py --compactFile /opt/local/DDDetectors/compact/SiD.xml

…ePTPhysics LastNParticlesOnCPU: when the in-flight count drops below this threshold the remaining particles are leaked back to Geant4/HepEm on CPU, terminating the GPU transport loop early. Setting this to a small value (e.g. 10-100) avoids launching many near-empty kernels during the long shower tail. Default 0 preserves the previous behaviour (always finish on GPU). SpeedOfLight: debug/benchmark mode that kills all e-/e+/gamma immediately without tracking them (equivalent to setting their mean free path to zero). Useful for measuring geometry or non-EM overhead in isolation. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Document and expose the property in the example steering file with a comment explaining its effect on GPU kernel launch efficiency. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

In ddsim-mt (PR#1240), Geant4ParticleHandler is created inside __setupGeneratorActions, which runs as a UserInitialization callback during G4RunManager::Initialize() -- after setupPhysics has returned. The previous approach of looking up the handler via kernel.generatorAction().get('ParticleHandler') inside setup_physics no longer works: the object either doesn't exist yet or its adopt() method is not available on the returned wrapper. The FIXME in ParticleHandler.py noted that setupUserParticleHandler was not extensible: it hardcoded only Geant4TCUserParticleHandler and Geant4TVUserParticleHandler and called exit(1) for anything else. Add an 'else' branch that supports arbitrary DDG4 action plugin class names: create the action and call part.adopt(user) without any special tracker-region configuration. This allows plugins such as Geant4AdePTUserParticleHandler to be registered simply via: runner.part.userParticleHandler = "Geant4AdePTUserParticleHandler" Update AdePTSteeringFile.py to use this clean mechanism in place of the monkey-patch workaround introduced in the previous commit. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: Juan Miguel Carceller <jmcarcell@users.noreply.github.com>

Co-authored-by: sss <sss@karma>

Halton sequences are low-discrepancy sequences that fill phase space with faster variance reduction (1/N) than standard uniform point picking with PRNGs (1/sqrt(N)). It's not cheating statistics since you lose the Poisson statistical properties between two consecutive events. This technique is often referred to as RQMC, randomized quasi-Monte Carlo. This adds scrambled Halton sequence support to the isotrope generators (where inter-event statistics are not considered since they don't represent real experimental running conditions). The scrambling uses Cranley-Patterson rotation, which is sufficient to remove correlations in three dimensional phase space sampling. The sequences are scrambled with the random seed, so different runs with different seeds will produce different sequences. This also then allows statistical treatment to determine the errors on aggregate quantities (see note in ddsim help). The various distributions are modified to take a sampler function that can either use PRNG or Halton sequences. For FFbar this is not possible since it uses an accept/reject algorithm that only works for PRNG.

Add basic MT functionality tests with 1, 2, and 4 threads, file-based generator tests (HepMC3, EDM4hep), and a comparison script framework for validating ST vs MT equivalence. Tests verify: - MT mode runs without crashes - Different thread counts (1, 2, 4) work correctly - File-based input generators work in MT mode - Backward compatibility with -j 1 (single-threaded) Fix double-save bug in EDM4hep/LCIO/ROOT output for ST mode In single-threaded mode, events were saved twice because setupEDM4hepOutput/setupLCIOOutput/setupROOTOutput hardcoded shared=True. Fixed by making the shared flag conditional on NumberOfThreads > 1. Fix SIGSEGV crash in MT mode: make EventSeeder shared setupEventSeeder() was called once per worker thread, creating multiple EventSeeder instances with shared=False. During cleanup this caused conflicts/double-free leading to SIGSEGV. Fixed by creating EventSeeder with shared=True (one instance shared across all workers) and guarding against duplicate creation. Add tests for G4Gun and GPS with macroFile These tests document that G4Gun and GPS with macroFile work in ST mode but not in MT mode (macros execute during global init before worker threads exist). Generator setup is guarded with numberOfThreads == 1. fix: additional DDTest changes

Fixes heap corruption and SIGSEGV crashes when using ROOT output in multi-threaded mode. Root cause: Multiple Geant4 worker threads were accessing ROOT I/O objects concurrently. ROOT's I/O system is not thread-safe by default, causing heap corruption during multi-threaded writes that manifested during exit in TFile::WriteStreamerInfo / TROOT::CloseFiles. Changes: 1. Call ROOT.EnableThreadSafety() before any ROOT objects are created when numberOfThreads > 1 (MT mode). 2. Add static std::mutex s_rootMutex to Geant4Output2ROOT and protect all ROOT I/O operations with std::lock_guard: - commit(): TTree::Fill() and branch operations - closeOutput(): file Write() and Close() - beginRun(): file creation and opening - fill(): branch Fill() operations The mutex ensures full serialization of ROOT I/O across all worker threads, preventing concurrent access to TFile/TTree/TBranch objects even with ROOT::EnableThreadSafety() in place. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

As a defense-in-depth measure alongside the primary fix in AdePT's HostTrackDataMapper (cpuAncestorG4id propagation), force any track whose G4 track ID is in the GPU-assigned range (>= INT_MAX/2, counting down from INT_MAX) to have G4PARTICLE_ABOVE_ENERGY_THRESHOLD set in particle.reason. This ensures such tracks enter the m_particleMap if-branch in Geant4ParticleHandler::end() rather than the else-branch that walks the parent chain and emits 'FATAL: No real particle parent present' when the chain is broken by an unregistered GPU-assigned parent ID. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

When AdePT returns GPU-processed tracks to CPU, the returned tracks carry GPU-assigned track IDs (counting down from INT_MAX). When G4HepEm then handles these tracks and produces hadronic secondaries (e.g. photo-nuclear products W182/W183/W184/W186, neutrons, protons) inline -- before the GPU track PostUserTrackingAction fires -- those secondaries have a GPU-range parentID that is not yet registered in m_particleMap/m_equivalentTracks, causing "FATAL: No real particle parent present" errors. Fix in Geant4AdePTUserParticleHandler: - In begin(): if track->GetParentID() is GPU-range, resolve g4Parent by looking up the parent in m_trackCache (populated when the GPU parent itself began). This replaces the GPU parent ID with the CPU ancestor ID. - In end() fallback: if track->GetParentID() is GPU-range, apply the same cache-based resolution rather than blindly using GetParentID(), which would undo the begin()-time fix for hadronic secondaries not in m_trackCache. - In end(): GPU-assigned track IDs are forced into m_particleMap (if-branch) via G4PARTICLE_ABOVE_ENERGY_THRESHOLD so they are always registered. - Update cache on end() instead of erasing, so a second end() call (for tracks that re-enter the GPU region) can still restore correct state. Fix in Geant4ParticleHandler (core, minimal): - Add cycle detection (std::set<int> visited) in the m_equivalentTracks walk in end() and rebaseSimulatedTracks() to prevent infinite loops if a self-referential entry is created by any remaining edge case. Also update AdePTSteeringFile.py to use adequate slot sizes (10M) for realistic simulation. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

wdconinc · 2026-04-16T21:05:12Z

(Copilot got ahead of itself, merged ddsim-mt into this, and pushed it up. I'll revert this.)

wdconinc added 2 commits April 9, 2026 16:54

feat: AdePT physics list plugin for e+/e-/gamma

fdeae3c

fix: lower slots; nthread fixes; correct AdePT::G4Integration linkage

543eaa0

wdconinc mentioned this pull request Apr 11, 2026

g4adept: new package spack/spack-packages#4150

Merged

2 tasks

MarkusFrankATcernch reviewed Apr 11, 2026

View reviewed changes

fix(cmake): revert the core plugin header installation

13a683d

wdconinc and others added 5 commits April 14, 2026 10:34

feat: add EcalRegion to SiD (EcalBarrel, EcalEndcap)

ec5a3cd

feat: add complete set of properties to example

3cda7bb

DDG4: add LastNParticlesOnCPU to AdePTSteeringFile.py

86882b1

Document and expose the property in the example steering file with a comment explaining its effect on GPU kernel launch efficiency. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

wdconinc force-pushed the adept branch from 4d7446e to eee1577 Compare April 14, 2026 20:20

wdconinc added 4 commits April 15, 2026 11:46

fix: always use CallUserTrackingAction=true

1bc9ebe

fix: use both Geant4AdePTUserParticleHandler::begin/end

5af01ad

fix: mv user end calls before reason use in Geant4ParticleHandler

fd92545

fix: rm unreachable code now that callUserTrackingAction=true

06dc416

wdconinc force-pushed the adept branch from 62336b6 to 06dc416 Compare April 15, 2026 17:01

wdconinc and others added 9 commits April 15, 2026 12:03

fix: mv PropertyMask ref handler too

a70d61a

Fix undefined behaviour in the BitFieldCoder (AIDASoft#1605)

9ae25d7

Co-authored-by: Juan Miguel Carceller <jmcarcell@users.noreply.github.com>

Update the checkout action since v4 uses a deprecated version of node

8810733

Fix typo in error message. (AIDASoft#1609)

cdbf59d

Co-authored-by: sss <sss@karma>

fix: remove HaltonSeed references

924b774

fix: allow for multiplicity > 1 in help string

fa8e23f

fix: use textwrap.dedent on triple-quoted docstring

a996663

flake8: remove whitespace

658c8c5

wdconinc and others added 29 commits April 15, 2026 12:14

ddsim: use instance methods

6bca3bf

ddsim: move output into setupWorker

c1bb10c

ddsim: remove geant4.terminate since geant4.run does all steps

bfff6e3

ddsim: fixup __setupWorker args when single-threaded

72fc879

ddsim: re-add removed cosmetics

84c3af5

ddsim: fix flake8 indendation

fcc4886

ddsim: fix flake8 unused variable

3a9707b

ddsim: restore original MagFieldTrackingSetup

72fe94a

ddsim: fix flake8 indentation

a41d134

PyDDG4: run -> runAll; add configure, initialize, run, terminate

e1b651d

ddsim: restore separation of configure, initialize from run, terminate

9be291c

ddsim: add each thread's g4gun and g4gps to _g4gun, _g4gun

d675f36

ddsim: readd exitCode

f4566c6

ddsim: use shared generator action for hepmc3

3007e11

ddsim: only set generator shared when G4MTRunManager

d4228be

ddsim: use shared-access-if-MT for all generators with input file

f2ec9eb

Remove elif leading to recursion and change shared check strategy

e39d296

fix: store generationInit in self to pass numberOfEvents

627c318

fix(test): g4gps, g4gun off-center, not along z

a010808

fix: flake8

b794d20

Allowing not passing --runType=batch with also G4Gun and G4GPS

b4cf875

Call endRun in Geant4Output2ROOT when all the threads are done

7759e7e

Load libDDG4IO in the comparison script

b540ab7

Fix comparison to avoid CPPYY-related errors

58e72db

Use a thread-local Geant4Random and make sure "G4EventID" is filled

2c515ed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: AdePT physics list plugin for e+/e-/gamma offload to GPU [WIP]#1606

feat: AdePT physics list plugin for e+/e-/gamma offload to GPU [WIP]#1606
wdconinc wants to merge 58 commits intoAIDASoft:masterfrom
wdconinc:adept

wdconinc commented Apr 11, 2026 •

edited

Loading

Uh oh!

MarkusFrankATcernch left a comment •

edited

Loading

Uh oh!

github-actions bot commented Apr 11, 2026 •

edited

Loading

Uh oh!

wdconinc commented Apr 11, 2026

Uh oh!

andresailer commented Apr 13, 2026

Uh oh!

wdconinc commented Apr 13, 2026

Uh oh!

wdconinc commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

wdconinc commented Apr 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MarkusFrankATcernch left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Apr 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Results

Uh oh!

wdconinc commented Apr 11, 2026

Uh oh!

andresailer commented Apr 13, 2026

Uh oh!

wdconinc commented Apr 13, 2026

Uh oh!

wdconinc commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

wdconinc commented Apr 11, 2026 •

edited

Loading

MarkusFrankATcernch left a comment •

edited

Loading

github-actions bot commented Apr 11, 2026 •

edited

Loading