Collapse dimensions common functionality for plot wrappers #405

willGraham01 · 2025-02-05T09:21:57Z

Description

What is this PR

Bug fix
Addition of a new feature
Other

Why is this PR needed?

See the discussion on Zulip.

What does this PR do?

Introduces the collapse_extra_dimensions function that can be used by the plotting wrappers to "remove" superfluous dimensions prior to creating plots.
Also introduces the coord_of_dimension function which is a short wrapper that saves us from writing the same logic in each of our plotting wrappers when asking the user to identify one individual, or one keypoint (etc). The user may specify this either by its index (slice-style) or coordinate (DataArray.sel style), and this small tidbit of code was appearing in a number of places across the plotting wrappers. Now we can run individual = coord_of_dim(da, "individuals", individual) to ensure that da.sel(individuals=individual) will work.

References

Zulip discussion.
PRs Plotting Wrappers: Occupancy Histogram #403, Plotting wrappers: Body-vector orientation #402, and Plotting wrappers: Head Trajectory #394 will need to merge in this method to use the functionality it introduces.

How has this PR been tested?

Local tests added and are passing.

Is this a breaking change?

No

Does this PR require an update to the documentation?

No

Checklist:

The code has been tested locally
Tests have been added to cover all new functionality
The documentation has been updated to reflect any changes
The code has been formatted with pre-commit

niksirbi

Hey @willGraham01, I found some issues with this implementation, which we'll have to clarify. The main problem in my opinion is that **selection is too flexible, allowing coordinates to be specified both by name and index, which leads to some ambiguities.

I am of the opinion that we should restrict it to only accept coordinate names, in which case there is no more need for coord_of_dimension (see comments for more details).

Another thing I realised while reviewing this PR is that the collapse_extra_dimensions utility may not be as useful as we thought for plot.trajectory, at least not in the current form of that function (see PR 394). The reason is that in plot.trajectory we plan to also accept lists of keypoints as a valid selection, while collapse_extra_dimensions will always collapse to max 1 keypoint. Personally, I think collapse_extra_dimensions should keep its behaviour—i.e. collapse all but the preserved dims to 1—but this means that we either have to work around that in plot.trajectory or make plot.trajectory less flexible in handling keypoints, cc @stellaprins.

Let me know what you think about it.

movement/utils/dimensions.py

tests/test_unit/test_collapse_dimensions.py

willGraham01 · 2025-02-06T09:20:31Z

The isel and sel issue is something I overlooked, I think your decision to enforce a sel-like behaviour is the sensible one (we still default to the 0th-indexed-coordinate if the user gives no selection).

The reason is that in plot.trajectory we plan to also accept lists of keypoints as a valid selection, while collapse_extra_dimensions will always collapse to max 1 keypoint. Personally, I think collapse_extra_dimensions should keep its behaviour—i.e. collapse all but the preserved dims to 1—but this means that we either have to work around that in plot.trajectory or make plot.trajectory less flexible in handling keypoints, cc @stellaprins.

Let me know what you think about it.

I think it still has merit in here, at least at removing dimensions that aren't time, space, nor keypoints. For example (inside plot_trajectory):

keypoints = selection.pop("keypoints", None) # Be harsher here since we now are enforcing selection by coordinate label only, hence 'None' default so we know to convert it to a coordinate label, not a coordinate index. This also removes keypoints from the dict, so we can pass selection to collapse_extra_dimensions as normal.
if keypoints is None:
    # Use default 0th keypoint
    keypoint = coord_of_dimension(da, "keypoints", 0)

if keypoints in da.keypoints:
    # If keypoints is a coordinate label, we have been given a single centroid. Otherwise, keypoints is an iterable so we'll drop into the else block
    # Centroid of one point it itself
    centroid = collapse_extra_dimensions(keypoints=the_single_keypoint_given, **selections)
else:
    # Centroid of many keypoints
    centroid = collapse_extra_dimensions(da, preserve_dims=["time", "space", "keypoints"], **selections)
    centroid = centroid.mean(dim="keypoints") # not sure if this is the precise syntax but you get the idea

Though I don't know if this is going to cause a mass re-write for @stellaprins though.

niksirbi · 2025-02-06T11:07:34Z

The isel and sel issue is something I overlooked, I think your decision to enforce a sel-like behaviour is the sensible one (we still default to the 0th-indexed-coordinate if the user gives no selection).

Agreed, let's go for sel-like behaviour, but still default to the 0-th index (first listed keypoint) if no selection.

I think it still has merit in here, at least at removing dimensions that aren't time, space, or keypoints. For example (inside plot_trajectory):

I agree that is still has merit, and we could go with your solution for plot.trajectory. @stellaprins I think you can forgo my suggestion for defaulting to plot the centroid of all keypoints if keypoints=None. It complicates the mental model too much, for us and for users. Instead we go for Will's mental model, i.e.:

If no keypoint is explicitly selected, plot the first one
If 1 keypoint is requested (by its label) we plot that one
If multiple keypoints are selected by label, plot their centroid

For individuals we only allow 1, which is the 0-th by default or the selected one if it's specified by label.

This way the mental models is the same for keypoints and individuals, except for the fact that you can explicitly request to see the centroid of several keypoints.

Are we all fine with this compromise?

stellaprins · 2025-02-06T11:27:37Z

* If no keypoint is explicitly selected, plot the first one

* If 1 keypoint is requested (by its label) we plot that one

* If multiple keypoints are selected by label, plot their centroid
For individuals we only allow 1, which is the 0-th by default or the selected one if it's specified by label.

This way the mental models is the same for keypoints and individuals, except for the fact that you can explicitly request to see the centroid of several keypoints.

Are we all fine with this compromise?

Sounds good to me! I quite liked the default of selecting the midpoint of all keypoints, because it would give a more sensible plot for the user if they do not specify any keypoints and makes it very easy for them to plot the centroid. As user I'd rather have the centroid plotted by default than the left paw if that happens to be the first keypoint. But I can see advantages to the above too.

Will have a look at this PR right now and address your comments @niksirbi and then after it is merged see how I can apply it to plot.trajectory.

niksirbi · 2025-02-06T11:44:30Z

Sounds good to me! I quite liked the default of selecting the midpoint of all keypoints, because it would give a more sensible plot for the user (? as user I'd rather have the centroid plotted by default than the left paw if that happens to be the first keypoint) if they do not specify any keypoints and makes it very easy for them to plot the centroid. But I can see advantages to the above too.

Actually this made me reconsider, from the user's perspective what you are describing makes sense. Perhaps try to implement plot.trajectory in that way (after this is merged), and we see how it looks when we have the code at hand?

sonarqubecloud · 2025-02-06T16:30:18Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

willGraham01 added 3 commits February 5, 2025 08:23

Write collapse function

69a85e0

Bring out coord_from_dimension as it's used a lot

e1cb1ff

Write test coverage

c7cfd0e

This comment was marked as resolved.

Sign in to view

willGraham01 requested a review from niksirbi February 5, 2025 09:24

niksirbi requested changes Feb 5, 2025

View reviewed changes

niksirbi reviewed Feb 5, 2025

View reviewed changes

tests/test_unit/test_collapse_dimensions.py Outdated Show resolved Hide resolved

stellaprins added 2 commits February 6, 2025 15:52

change selection input to str only, add value error tests

8a7a349

fix example

bbcc89b

stellaprins requested a review from niksirbi February 6, 2025 16:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Collapse dimensions common functionality for plot wrappers #405

Collapse dimensions common functionality for plot wrappers #405

willGraham01 commented Feb 5, 2025

This comment was marked as resolved.

niksirbi left a comment

willGraham01 commented Feb 6, 2025

niksirbi commented Feb 6, 2025

stellaprins commented Feb 6, 2025 •

edited

Loading

niksirbi commented Feb 6, 2025

sonarqubecloud bot commented Feb 6, 2025

Collapse dimensions common functionality for plot wrappers #405

Are you sure you want to change the base?

Collapse dimensions common functionality for plot wrappers #405

Conversation

willGraham01 commented Feb 5, 2025

Description

References

How has this PR been tested?

Is this a breaking change?

Does this PR require an update to the documentation?

Checklist:

This comment was marked as resolved.

niksirbi left a comment

Choose a reason for hiding this comment

willGraham01 commented Feb 6, 2025

niksirbi commented Feb 6, 2025

stellaprins commented Feb 6, 2025 • edited Loading

niksirbi commented Feb 6, 2025

sonarqubecloud bot commented Feb 6, 2025

Quality Gate passed

stellaprins commented Feb 6, 2025 •

edited

Loading