Refactor `DataStreamLifecycle` creation code #138780

nielsbauman · 2025-11-28T17:50:29Z

We currently have several methods for creating DataStreamLifecycle and corresponding DataStreamLifecycle.Template objects. These methods need to specify all fields of those objects, which results in a lot of changed lines when adding a new field.

To avoid those extra changed lines, we convert most usages to using builders, which don't require passing null values to have default values for fields.

elasticsearchmachine · 2025-11-28T17:50:54Z

Pinging @elastic/es-data-management (Team:Data Management)

nielsbauman · 2025-11-28T17:54:39Z

...reams/src/test/java/org/elasticsearch/datastreams/lifecycle/DataStreamLifecycleFixtures.java

-        return DataStreamLifecycle.createDataLifecycleTemplate(
-            frequently(),
-            randomResettable(ESTestCase::randomTimeValue),
-            downsampling,
-            randomResettable(() -> randomSamplingMethod(downsampling.get()))
-        );
+        return DataStreamLifecycle.dataLifecycleBuilder()
+            .enabled(frequently())
+            .dataRetention(randomResettable(ESTestCase::randomTimeValue))
+            .downsamplingRounds(downsampling)
+            .downsamplingMethod(randomResettable(() -> randomSamplingMethod(downsampling.get())))
+            .buildTemplate();


There are one or two similar usages where we do actually lose a bit of value by using a builder. In these cases, it would actually be nice to get a compilation error if you add a new field but forget to add it to these methods that generate random instances. That way, you maintain full coverage by having to specify a random value for the new field here as well.

However, because DataStreamLifecycle.LifecycleType is package private, we can't currently use the constructor of DataStreamLifecycle. The only options I could think of are using a builder (like I currently do in this PR) or make the LifecycleType enum public. Thoughts are welcome :)

nielsbauman · 2025-11-28T17:56:55Z

server/src/main/java/org/elasticsearch/cluster/metadata/DataStreamLifecycle.java

    enum LifecycleType implements Writeable {
-        DATA("data", (byte) 0),
-        FAILURES("failures", (byte) 1);
-
-        private final String label;
-        private final byte id;
-        private static final Map<Byte, LifecycleType> REGISTRY = Arrays.stream(LifecycleType.values())
-            .collect(Collectors.toMap(l -> l.id, Function.identity()));
-
-        LifecycleType(String label, byte id) {
-            this.label = label;
-            this.id = id;
-        }
+        DATA,
+        FAILURES;

        @Override
        public void writeTo(StreamOutput out) throws IOException {
-            out.write(id);
+            out.writeEnum(this);
        }

        public static LifecycleType read(StreamInput in) throws IOException {
-            return REGISTRY.get(in.readByte());
+            return in.readEnum(LifecycleType.class);


These changes are technically a bit out-of-scope for this PR, but I figured it's fine to include them here. There is no need to have this custom serialization; we have built-in code for that.

nielsbauman · 2025-11-28T17:57:30Z

...er/src/test/java/org/elasticsearch/cluster/metadata/DataStreamFailureStoreTemplateTests.java

        );
        DataStreamFailureStore.Template result = DataStreamFailureStore.builder(template).composeTemplate(template).buildTemplate();
-        assertThat(result, equalTo(normalise(template)));
+        assertThat(result, equalTo(template));


By changing the builder fields to ResettableValue, we don't need to normalise these templates anymore.

nielsbauman · 2025-11-28T18:02:20Z

server/src/test/java/org/elasticsearch/cluster/metadata/MetadataIndexTemplateServiceTests.java

-        assertLifecycleResolution(service, project, List.of(ct30d, ctNullRetention), null, DataStreamLifecycle.Template.DATA_DEFAULT);
+        assertLifecycleResolution(
+            service,
+            project,
+            List.of(ct30d, ctNullRetention),
+            null,
+            DataStreamLifecycle.dataLifecycleBuilder().dataRetention(ResettableValue.reset()).buildTemplate()
+        );


The assertions in this test were previously technically incorrect. With my change to ResettableValue in the builder, we can now properly assess these values. Previously, we weren't able to distinguish between ResettableValue.reset() and ResettableValue.undefined() because of how the template composition uses the builders (which wasn't necessarily a problem, as that doesn't matter for the final value, which is null in both cases), but now the end result does represent the correct intention of the value.

Refactor DLM lifecycle builder

75be786

nielsbauman requested a review from masseyke November 28, 2025 17:50

nielsbauman added >refactoring :Data Management/Data streams Data streams and their lifecycles labels Nov 28, 2025

elasticsearchmachine added v9.3.0 Team:Data Management Meta label for data/management team labels Nov 28, 2025

nielsbauman commented Nov 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor `DataStreamLifecycle` creation code #138780

Refactor `DataStreamLifecycle` creation code #138780

nielsbauman commented Nov 28, 2025

Uh oh!

elasticsearchmachine commented Nov 28, 2025

Uh oh!

nielsbauman Nov 28, 2025

Uh oh!

nielsbauman Nov 28, 2025

Uh oh!

nielsbauman Nov 28, 2025

Uh oh!

nielsbauman Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Refactor DataStreamLifecycle creation code #138780

Are you sure you want to change the base?

Refactor DataStreamLifecycle creation code #138780

Conversation

nielsbauman commented Nov 28, 2025

Uh oh!

elasticsearchmachine commented Nov 28, 2025

Uh oh!

nielsbauman Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

nielsbauman Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

nielsbauman Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

nielsbauman Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Refactor `DataStreamLifecycle` creation code #138780

Refactor `DataStreamLifecycle` creation code #138780