Implement semi-supernode flag #8241

pawanjay176 · 2025-10-20T05:55:36Z

Issue Addressed

Proposed Changes

// TODO

…_custody to custody_context

michaelsproul · 2025-10-20T06:10:03Z

beacon_node/beacon_chain/src/chain_config.rs

+    /// This is derived from CLI flags and not persisted.
+    #[serde(skip)]


Most of these settings are derived from CLI flags? I don't think we need the serde(skip) here, if anything I think we might need it so we can write CLI tests in lighthouse/tests/beacon_node.rs.

(the persistence of this config using serde is only for tests)

michaelsproul · 2025-10-20T06:13:25Z

beacon_node/beacon_chain/src/custody_context.rs

-        CustodyContext {
+        node_custody_type: NodeCustodyType,
+    ) -> Result<Self, String> {
+        if ssz_context.persisted_is_supernode && node_custody_type != NodeCustodyType::Supernode {


So we might want a DB schema change to store the NodeCustodyType? Then we could enforce that the node can't change "down", i.e. supernode to semi-supernode/full, or semi-supernode to full?

Does anything break if we don't block the transition from semi-supernode to full? Because this check already covers both supernode cases.

Sorry I initially added this check thinking we want to prevent changes to the NodeCustodyType.
But it is technically possible to do, we just need to set the correct value of earliest_available_slot.

One use case is all the nodes on sepolia that had to run a supernode to reconstruct all the blobs could downgrade to a semi-supernode.
We initially wanted to avoid switching of modes but its quite easy to handle in the rpc layer by just setting the right value of the earliest_available_slot. So I don't see any strong reason to not to allow this

jimmygchen · 2025-10-20T11:28:48Z

beacon_node/src/cli.rs

+                          half of the data columns (enough for reconstruction), enabling efficient \
+                          data availability with lower bandwidth and storage requirements compared to \
+                          a supernode, while still supporting full blob reconstruction.")
+                .display_order(0)


looks good, just need to update the cli help in lighthouse book

jimmygchen · 2025-10-20T11:37:56Z

beacon_node/beacon_chain/src/custody_context.rs

-                    new_custody_group_count: updated_cgc,
-                    sampling_count: self.num_of_custody_groups_to_sample(effective_epoch, spec),
+                    new_custody_group_count: updated_cgc as u64,
+                    sampling_count: self.sampling_count_at_epoch(None, spec) as u64,


I noticed num_of_custody_groups_to_sample and num_of_data_columns_to_sample are now both mapped to sampling_count_at_epoch - I'm sure this is intentional, but I want to make sure the reason for the previous separation was clear - it was mainly to support subnet decoupling (which isn't my favourite feature) and unifying this to a single function is likely going to break that.

See:

Update SAMPLES_PER_SLOT to be number of custody groups instead of data columns #7683

jimmygchen · 2025-10-20T11:40:26Z

beacon_node/beacon_chain/src/schema_change/migration_schema_v26.rs

                    validator_custody_at_head: ssz_v24.validator_custody_at_head,
-                    persisted_is_supernode: ssz_v24.persisted_is_supernode,
+                    // TODO(pawan): fix
+                    persisted_node_custody_count: if ssz_v24.persisted_is_supernode {


Note that we'll need a new schema migration file for this change

jimmygchen · 2025-10-20T11:47:51Z

beacon_node/beacon_chain/src/custody_context.rs

+        let update_earliest_available_slot = if cgc_current > cgc_persisted {
+            true
+        } else {
+            false


Does this assume we have previously completed column backfill?

jimmygchen · 2025-10-20T11:55:27Z

I've done a quick pass and I feel like this is quite a risky change before the release as there's a bit of refactor and it touches quite a lot of code related to custody / sampling and may be something we want to review and test throughly. I can continue the review tomorrow.

Is it possible to have --semi-supernode with minimal refactor and changes? I think the restart handling could be handled in a separate PR since we don't handle that between supernode and fullnode anyway and is something we may want to think about how to handle properly, e.g. prompt user about require resync before proceeding, prune extra columns etc

pawanjay176 · 2025-10-20T21:45:54Z

I've done a quick pass and I feel like this is quite a risky change before the release as there's a bit of refactor

I agree. Its a lot more change than I anticipated initially. I don't think we should rush it for the release either.

@eserilev

Addresses #8218 A simplified version of #8241 for the initial release. I've tried to minimise the logic change in this PR, although introducing the `NodeCustodyType` enum still result in quite a bit a of diff, but the actual logic change in `CustodyContext` is quite small. The main changes are in the `CustdoyContext` struct * ~~combining `validator_custody_count` and `current_is_supernode` fields into a single `custody_group_count_at_head` field. We persist the cgc of the initial cli values into the `custody_group_count_at_head` field and only allow for increase (same behaviour as before).~~ * I noticed the above approach caused a backward compatibility issue, I've [made a fix](15569bc) and changed the approach slightly (which was actually what I had originally in mind): * when initialising, only override the `validator_custody_count` value if either flag `--supernode` or `--semi-supernode` is used; otherwise leave it as the existing default `0`. Most other logic remains unchanged. All existing validator custody unit tests are still all passing, and I've added additional tests to cover semi-supernode, and restoring `CustodyContext` from disk. Note: I've added a `WARN` if the user attempts to switch to a `--semi-supernode` or `--supernode` - this currently has no effect, but once @eserilev column backfill is merged, we should be able to support this quite easily. Things to test - [x] cgc in metadata / enr - [x] cgc in metrics - [x] subscribed subnets - [x] getBlobs endpoint Co-Authored-By: Jimmy Chen <[email protected]>

pawanjay176 added 7 commits October 19, 2025 21:46

Rename validator_custody.rs ->custody_context.rs

4a5a2bd

Refactor validator_custody to add NodeCustodyStatus, rename validator…

8240858

…_custody to custody_context

Refactor all calls to custody_context

2d25896

Fix custody_context tests

cf9c6dc

fmt

6144fd5

Add tests with claude's help

710d852

Rename to semi-supernode

3ecbfea

michaelsproul reviewed Oct 20, 2025

View reviewed changes

Update the earliest_available_slot if required

5373b60

jimmygchen reviewed Oct 20, 2025

View reviewed changes

jimmygchen added the v8.1.0 Post-Fulu release label Oct 21, 2025

jimmygchen mentioned this pull request Oct 21, 2025

Add --semi-supernode support #8254

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement semi-supernode flag #8241

Implement semi-supernode flag #8241

pawanjay176 commented Oct 20, 2025

Uh oh!

michaelsproul Oct 20, 2025

Uh oh!

michaelsproul Oct 20, 2025

Uh oh!

michaelsproul Oct 20, 2025

Uh oh!

pawanjay176 Oct 20, 2025

Uh oh!

jimmygchen Oct 20, 2025

Uh oh!

jimmygchen Oct 20, 2025 •

edited

Loading

Uh oh!

jimmygchen Oct 20, 2025

Uh oh!

jimmygchen Oct 20, 2025

Uh oh!

jimmygchen commented Oct 20, 2025 •

edited

Loading

Uh oh!

pawanjay176 commented Oct 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		/// This is derived from CLI flags and not persisted.
		#[serde(skip)]

Implement semi-supernode flag #8241

Are you sure you want to change the base?

Implement semi-supernode flag #8241

Conversation

pawanjay176 commented Oct 20, 2025

Issue Addressed

Proposed Changes

Uh oh!

michaelsproul Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

michaelsproul Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

michaelsproul Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

pawanjay176 Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

jimmygchen Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

jimmygchen Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jimmygchen Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

jimmygchen Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

jimmygchen commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pawanjay176 commented Oct 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jimmygchen Oct 20, 2025 •

edited

Loading

jimmygchen commented Oct 20, 2025 •

edited

Loading