reorganized stats/chunk containers by TjarkMiener · Pull Request #2999 · cta-observatory/ctapipe

TjarkMiener · 2026-04-27T14:24:58Z

We noticed in #2996 that containers dealing with chunks and stats needed some maintenance and docstring polishing.

kosack

Looks good, just need a changelog entry

ctao-sonarqube · 2026-04-28T09:24:15Z

Quality Gate passed

Issues
0 New issues
0 Fixed issues
0 Accepted issues

Measures
0 Security Hotspots
100.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube

TjarkMiener · 2026-04-29T07:37:02Z

+class ChunkStatisticsContainer(ChunkContainer):
+    """Container for descriptive statistics of the chunk distribution"""
+
+    mean = Field(None, "mean value of the chunk distribution")
+    median = Field(None, "median value of the chunk distribution")
+    std = Field(None, "standard deviation of the chunk distribution")


I was thinking of replacing this duplication by

class ChunkStatisticsContainer(ChunkContainer): """Container for descriptive statistics of the chunk distribution""" stats = Field( default_factory=StatisticsContainer, description="Statistical description of the chunk distribution", )

which would be nice but would also lead to breaking changes here and elsewhere.
@mexanick @maxnoe @kosack

What we do elsewhere is have separate containers for the index and the data and write multiple containers to the same table.

I.e. you could have the ChunkContainer, StatsContainer and a HistogramContainer and write like this:

writer.write(table, (chunk_container, stats_container))

like this, you also don't need separate containers for the interpolated result and the chunk storage.

See e.g.:

ctapipe/src/ctapipe/io/datawriter.py

Lines 309 to 312 in 8705606

self._writer.write(

table_name="simulation/event/subarray/shower",

containers=[event.index, event.simulation.shower],

)

I think n_events should be moved to the StatisticsContainer and then compute_stats() from the PlainAggregator and SigmaClippingAggregator should return StatisticsContainer and the compute_histo() from HistorgramAggregator from #2996 should return a HistogramContainer

mmh, that seems a bit weird. the n_events is a property of the chunk and should be the same for all chunk aggregations, regardless of whether you compute a histogram or stats or something else.

n_events should be really the number of events inside the chunk, which is very interesting in case of time-based chunks.

It should also be independent from which values are actually used due to e.g. outlier detection, sigma clipping or under/overflow.

right, the number of events are related to the chunks and the number of entries are related to the number of values used in the aggregation.

reorganized stats/chunk containers

7b43f6e

TjarkMiener added the maintenance label Apr 27, 2026

kosack requested changes Apr 28, 2026

View reviewed changes

add changelog

621b015

TjarkMiener requested review from kosack, maxnoe and mexanick April 28, 2026 09:00

mexanick approved these changes Apr 28, 2026

View reviewed changes

maxnoe reviewed Apr 28, 2026

View reviewed changes

Comment thread src/ctapipe/containers.py

TjarkMiener commented Apr 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reorganized stats/chunk containers#2999

reorganized stats/chunk containers#2999
TjarkMiener wants to merge 2 commits into
mainfrom
container_maintenance

TjarkMiener commented Apr 27, 2026

Uh oh!

kosack left a comment

Uh oh!

ctao-sonarqube Bot commented Apr 28, 2026

Uh oh!

Uh oh!

TjarkMiener Apr 29, 2026 •

edited

Loading

Uh oh!

maxnoe Apr 29, 2026

Uh oh!

maxnoe Apr 29, 2026 •

edited

Loading

Uh oh!

TjarkMiener Apr 29, 2026

Uh oh!

maxnoe Apr 29, 2026

Uh oh!

TjarkMiener Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	self._writer.write(
	table_name="simulation/event/subarray/shower",
	containers=[event.index, event.simulation.shower],
	)

Conversation

TjarkMiener commented Apr 27, 2026

Uh oh!

kosack left a comment

Choose a reason for hiding this comment

Uh oh!

ctao-sonarqube Bot commented Apr 28, 2026

Quality Gate passed

Uh oh!

Uh oh!

TjarkMiener Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

maxnoe Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

maxnoe Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TjarkMiener Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

maxnoe Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

TjarkMiener Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

TjarkMiener Apr 29, 2026 •

edited

Loading

maxnoe Apr 29, 2026 •

edited

Loading