feat(data): add getBatchForStudyDeployments endpoint #517

NGrech · 2025-08-29T09:21:51Z

Implement DataStreamService.getBatchForStudyDeployments for cross- deployment retrieval
Introduce CollectedDataPoint and CollectedDataSet containers
Support filters: DataType and time range ([from, to])
Normalize timestamps via SyncPoint (epoch µs)
Update JSON schemas and RPC request handling
Add tests (serialization, handler dispatch, in-memory filters/order, schema example validation)
Document endpoint in carp-data.md

Enables efficient retrieval across multiple study deployments with optional filtering for analytics and reporting use cases.

Copilot

Pull Request Overview

This PR introduces a new endpoint getBatchForStudyDeployments to the DataStreamService for retrieving data across multiple study deployments with filtering capabilities.

Adds getBatchForStudyDeployments method with filters for data types, device roles, and time ranges
Introduces CollectedDataPoint and CollectedDataSet data containers for cross-deployment data
Updates RPC infrastructure, JSON schemas, and documentation to support the new endpoint

Reviewed Changes

Copilot reviewed 13 out of 13 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
carp.data.core/src/commonMain/kotlin/dk/cachet/carp/data/application/DataStreamService.kt	Adds the new getBatchForStudyDeployments method signature to the service interface
carp.data.core/src/commonMain/kotlin/dk/cachet/carp/data/application/CollectedDataPoint.kt	Defines the data structure for individual collected data points
carp.data.core/src/commonMain/kotlin/dk/cachet/carp/data/application/CollectedDataSet.kt	Defines the collection container with utility methods for filtering
carp.data.core/src/commonMain/kotlin/dk/cachet/carp/data/infrastructure/InMemoryDataStreamService.kt	Implements the new endpoint with filtering and validation logic
carp.data.core/src/commonMain/kotlin/dk/cachet/carp/data/infrastructure/DataStreamServiceRequest.kt	Adds the RPC request class for the new endpoint
carp.data.core/src/commonMain/kotlin/dk/cachet/carp/data/infrastructure/DataStreamServiceDecorator.kt	Updates the decorator to handle the new endpoint
rpc/schemas/data/CollectedDataPoint.json	JSON schema definition for CollectedDataPoint
rpc/schemas/data/CollectedDataSet.json	JSON schema definition for CollectedDataSet
rpc/schemas/data/DataStreamService/DataStreamServiceRequest.json	Updates the RPC schema to include the new request type
rpc/src/main/kotlin/dk/cachet/carp/rpc/GenerateExampleRequests.kt	Adds example request for the new endpoint
carp.data.core/src/commonTest/kotlin/dk/cachet/carp/data/infrastructure/InMemoryDataStreamServiceBatchRetrievalTest.kt	Comprehensive test suite for the new functionality
carp.data.core/src/commonTest/kotlin/dk/cachet/carp/data/infrastructure/DataStreamServiceInfrastructureTest.kt	Adds the new request to infrastructure tests
docs/carp-data.md	Documents the new endpoint in the service documentation

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

rpc/schemas/data/CollectedDataPoint.json

rpc/schemas/data/CollectedDataSet.json

...a.core/src/commonMain/kotlin/dk/cachet/carp/data/infrastructure/InMemoryDataStreamService.kt

Whathecode · 2025-09-01T09:49:50Z

@NGrech Can you clean up the commit history a bit? At least the fixes can be squashed. There is no point in me reviewing a commit which later gets undone. ;) Also, the generated test files should probably be added simultaneously with the commit which would otherwise fail if you don't do so. It should be a goal for each commit to compile.

Possibly, that means all of this should live in a single commit. I'll review it as such if I find some time.

NGrech · 2025-09-01T12:14:32Z

@Whathecode I squashed the new fixes into the original one.
Note that the reason the generated test files were not in the original commit was that there is no mention of that requirement in the CONTRIBUTING.md.
I think we should add fix this:

You can also run detekt separately through gradle detekt

to gradle detektPasses, since that is the command run in the code analysis check when committing and (at least on windows) gradle detekt will build successfully when there are issues that gradle detektPasses will fail on.

Whathecode

First review round: questioning what a CollectedDataPoint is. More to follow as I continue looking, but that seems pretty fundamental. :)

carp.data.core/src/commonMain/kotlin/dk/cachet/carp/data/application/CollectedDataPoint.kt

carp.data.core/src/commonMain/kotlin/dk/cachet/carp/data/application/CollectedDataSet.kt

NGrech · 2025-09-03T10:30:10Z

@Whathecode I have looked at the comments you have made and I replied to your feedback, I had some questions and comments so would be interested in seeing your views.

Whathecode · 2025-09-04T15:37:09Z

I see you added this to the 2.0.0 milestone instead of 1.3. Any reason you expect this to be a breaking change, i.e., warranting a new major release?

…sts, and docs - Implement DataStreamService.getBatchForStudyDeployments for cross-deployment retrieval with filters (deployment IDs, deviceRoleNames, dataTypes, time range) - Normalize timestamps via SyncPoint - Add ImmutableDataStreamBatch/Sequence for efficient data access - Update InMemoryDataStreamService for batch retrieval - Update JSON schemas, RPC handling, and documentation - Refactor and clean up related code - Add comprehensive tests and test resources

NGrech · 2025-09-17T11:12:46Z

@Whathecode I have updated the the code based on the last discussion.

Whathecode

Still an incomplete review, but I started looking at why you added ImmutableDataStreamBatch and ...Sequence. The PR description is missing some clarification in regards to why you are adding this. Have a look at some of the questions I asked and see whether you can clarify things.

I also have the impression that adding these changes can easily be done as a separate commit (and even PR). You don't need those for your updates to DataStreamService, and as far as I can tell, the existing data structures would work just fine.

While looking at changes, I noticed some incorrect code style whitespaces. I added a commit which you can squash.

Whathecode · 2025-09-21T15:19:51Z

carp.data.core/src/commonMain/kotlin/dk/cachet/carp/data/application/DataStreamSequence.kt

+ * and cleaner concepts.
+ */
+@JsExport
+class ImmutableDataStreamSequence<TData : Data> private constructor(


Is this really needed? 🤔 You don't use it here. How do you expect to use it?

There already is the read-only DataStreamSequence interface, and as part of serialization DataStreamSequenceSnapshot is used (which is immutable). So, unless you want to safeguard against explicit casts that should get you there already from an encapsulation perspective,.

Whathecode · 2025-09-21T16:00:05Z