sweep: start tracking input spending status in the fee bumper #9447

yyforyongyu · 2025-01-26T02:18:12Z

There are two places tracking the spending status of a given input - one in the sweeper, the other in the fee bumper. We now move the tracking to be handled in the fee bumper so we always have a single source of truth. By the end of this fix, we should see that,

the fee func will be kept on the original line when retrying sweeps.
both the sweeper and the fee bumper can recover their state from a restart.
for the neutrino backend, the initial sweeping tx is now always RBF-compliant.

The fix is made of two PRs to keep the size small - the first PR will enable tracking the spending status of inputs in the fee bumper, and the second will fix the rest.

Depends on,

sweeper: rename Failed to Fatal and minor refactor #9446

This change is

coderabbitai · 2025-01-26T02:18:41Z

Important

Review skipped

Auto reviews are limited to specific labels.

🏷️ Labels to auto review (1)

llm-review

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR. (Beta)
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

Roasbeef

First pass, one thing that wasn't immediately obvious to me is: where do we fix the issue that the the state of the fee function is properly carried over into the new batch (instead of reset) when one of the inputs in a cohort is spent?

itest/lnd_sweep_test.go

Roasbeef · 2025-02-04T00:18:56Z

sweep/fee_bumper.go

 				continue
 			}

-			log.Warnf("Detected third party spent of output=%v "+
-				"in tx=%v", op, spend.SpendingTx.TxHash())
+			spendingTx := spend.SpendingTx


I wonder if we should actually block here, even if just for a moment, to allow the scheduler to run the goroutine that does the dispatch.

Spent a bit of time to re-familiarize myself with the notifier after the latest set of refactors, and I don't see an area where we'll insta dispatch the response before exiting the initial method call.

you mean we perform a tiny sleep (sth likecase <- time.After(100ms)) instead?

We did refactor this area a bit here 1200b75, which makes sure the block is always sent before the tx, but we cannot guarantee the order is maintained since pipeline is a bit deep we cannot be sure they are read in this order.

A previous attempt was to implement a method HasOutpointSpent on Blockbeat - the idea is that, whenever we are notified of a block, we can easily access the block data to see if the watched outputs are spent or not, hence eliminating the race, which guarantees we won't miss a spending event. However there was some difficult involved when implementing it in neutrino, as discussed here. Now that you mention it I think it's still worthy to keep it as a TODO, since we can just register spend when it's neutrino to avoid fetching full blocks, and read the block data when it's a full node.

yyforyongyu · 2025-02-05T13:02:59Z

Note to reviewers - the itest is disabled here and the CI should pass in #9448. I think once approved we can merge #9448 to this branch, and merge this PR to master.

morehouse

The code itself looks pretty good to me. Only nits there.

The first commit dcd5119 looks like it doesn't belong in this PR, since the added test isn't fixed until the next PR.

Also TxUnknownSpend is currently unhandled in UtxoSweeper, which means things would break if we merged this PR alone. We should really add a minimal switch case to UtxoSweeper.handleBumpEvent that keeps the existing behavior for TxUnknownSpend (i.e. same as TxFailed case).

itest/lnd_sweep_test.go

sweep/fee_bumper.go

yyforyongyu · 2025-02-07T14:17:08Z

The first commit dcd5119 looks like it doesn't belong in this PR, since the added test isn't fixed until the next PR.

Moved to the next PR.

Also TxUnknownSpend is currently unhandled in UtxoSweeper, which means things would break if we merged this PR alone. We should really add a minimal switch case to UtxoSweeper.handleBumpEvent that keeps the existing behavior for TxUnknownSpend (i.e. same as TxFailed case).

We can't merge this PR back to the master alone tho. The plan is to merge #9448 to this one, and then merge this one to the master, otherwise the itests would fail. Maybe I should've just created one PR instead - was thinking about reducing each PR's size, but the split could've done better I guess.

morehouse

Code LGTM. Will wait to approve until tests pass.

I'll start looking at the next PR today.

Roasbeef

Reviewed 5 of 5 files at r1, 4 of 4 files at r2, all commit messages.
Reviewable status: all files reviewed, 9 unresolved discussions (waiting on @morehouse and @yyforyongyu)

lightninglabs-deploy · 2025-02-19T02:52:55Z

@morehouse: review reminder
@yyforyongyu, remember to re-request review from reviewers when ready

To track the input and its spending tx, which will be used later to detect unknown spends.

This commit refactors the `processRecords` to always handle the inputs spent when processing the records. We now make sure to handle unknown spends for all backends (previously only neutrino), and rely solely on the spending notification to give us the onchain status of inputs.

We now rename "third party" to "unknown" as the inputs can be spent via an older sweeping tx, a third party (anchor), or a remote party (pin). In fee bumper we don't have the info to distinguish the above cases, and leave them to be further handled by the sweeper as it has more context.

This commit adds a new field `InputsSpent` to the `BumpResult` so they can be used to track inputs spent by txns not recoginized by the fee bumper.

We now start handling `TxUnknownSpend` in our sweeper to make sure the failed inputs are retried when possible.

This is a minor refactor so the `createAndPublishTx` flow becomes more clear, also prepares for the following commit where we start to handle missing inputs.

A minor refactor to break the method `handleUnknownSpent` into two steps, which prepares the following commit where we start handling missing inputs.

This commit refactors `handleInitialTxError` and `createAndCheckTx` to take a `monitorRecord` param, which prepares for the following commit where we start handling missing inputs.

This commit handles the case when the input is missing during the RBF process, which could happen when the bumped tx has inputs being spent by a third party. Normally we should be able to catch the spend early via the spending notification and never attempt to fee bump the record. However, due to the possible race between block notification and spend notification, this cannot be guaranteed. Thus, we need to handle the case during the RBF when seeing a `ErrMissingInputs`, which can only happen when the inputs are spent by others.

This commit adds the failed tx to the result when marking the input as fatal, which is used in the commit resolver when handling revoked outputs.

Previously, when a given input is found spent in the mempool, we'd mark it as Published and never offer it to the fee bumper. This is dangerous as the input will never be fee bumped. We now fix it by always initializing the input with state Init, and only use mempool to check for fee and fee rate. This changes the current restart behavior - as previously when a sweeping tx is broadcast, the node shuts down, when it starts again, the input will be offered to the sweeper again, but not to the fee bumper, which means the sweeping tx will stay in the mempool with the last-tried fee rate. After this change, after a restart, the input will be swept again, and the fee bumper will monitor its status. The restart will also behave like a fee bump if there's already an existing sweeping tx in the mempool.

So we can focus on testing normal flow vs persistence flow.

Before this commit, the only error returned from `IsOurTx` is when the root bucket was not created. In that case, we should consider the tx to be not found in our db, since technically our db is empty. A future PR may consider treating our wallet as the single source of truth and query the wallet instead to check for past sweeping txns.

morehouse

LGTM

yyforyongyu · 2025-02-20T16:08:59Z

check commits failed with no space left error again, weird.

morehouse · 2025-02-20T16:14:01Z

It would be good to figure out why it keeps doing that.

But IMO this PR is still good to go. I did a cursory check that commits appear in the same order as on #9448, and the only diff is the final commit added to satisfy the linter: 9f7e2bf.

guggero · 2025-02-20T17:02:21Z

It would be good to figure out why it keeps doing that.

I think it's just that the available space on the GitHub runners is very low. So it's probably just the build cache getting too large with all the different commits being compiled one-by-one.
Perhaps we shouldn't also use the GitHub cache feature, as that will stack up even more as it then combines the build caches from multiple runs.

I cleaned the GitHub cache and re-ran the step.

guggero · 2025-02-20T17:17:21Z

It would be good to figure out why it keeps doing that.

I think it's just that the available space on the GitHub runners is very low. So it's probably just the build cache getting too large with all the different commits being compiled one-by-one. Perhaps we shouldn't also use the GitHub cache feature, as that will stack up even more as it then combines the build caches from multiple runs.

I cleaned the GitHub cache and re-ran the step.

Actually, turns out we're caching things twice, since actions/setup-go now automatically caches the build and module cache.
Fixing that in #9535, so this should become even less likely.

Roasbeef

LGTM

Reviewed 6 of 12 files at r3.
Reviewable status: 7 of 13 files reviewed, 9 unresolved discussions (waiting on @morehouse and @yyforyongyu)

yyforyongyu added utxo sweeping no-itest no-changelog size/micro small bug fix or feature, less than 15 mins of review, less than 250 labels Jan 26, 2025

yyforyongyu added this to the v0.19.0 milestone Jan 26, 2025

yyforyongyu mentioned this pull request Jan 26, 2025

sweep: properly handle failed sweeping txns #9448

Merged

yyforyongyu force-pushed the yy-prepare-fee-replace branch from e8cb0c7 to a738e7f Compare January 27, 2025 09:51

yyforyongyu force-pushed the yy-sweeper-fix branch from 48d4631 to bd0f218 Compare January 27, 2025 09:51

Roasbeef reviewed Feb 4, 2025

View reviewed changes

yyforyongyu force-pushed the yy-prepare-fee-replace branch from a738e7f to b98542b Compare February 5, 2025 11:53

yyforyongyu force-pushed the yy-sweeper-fix branch 2 times, most recently from 0ca8914 to 18df4fb Compare February 5, 2025 12:49

yyforyongyu requested review from Roasbeef and morehouse February 5, 2025 13:15

yyforyongyu changed the base branch from yy-prepare-fee-replace to master February 5, 2025 14:49

yyforyongyu force-pushed the yy-sweeper-fix branch from 18df4fb to af48039 Compare February 5, 2025 14:50

morehouse reviewed Feb 6, 2025

View reviewed changes

saubyk assigned yyforyongyu Feb 7, 2025

yyforyongyu force-pushed the yy-sweeper-fix branch 2 times, most recently from cb67094 to e2a7210 Compare February 7, 2025 14:14

yyforyongyu force-pushed the yy-sweeper-fix branch from e2a7210 to 7cdf369 Compare February 7, 2025 14:23

morehouse reviewed Feb 7, 2025

View reviewed changes

Roasbeef requested a review from morehouse February 12, 2025 02:46

Roasbeef reviewed Feb 12, 2025

View reviewed changes

yyforyongyu force-pushed the yy-sweeper-fix branch 2 times, most recently from cfbf023 to 53f84ed Compare February 13, 2025 15:14

yyforyongyu removed the no-changelog label Feb 20, 2025

yyforyongyu added 18 commits February 20, 2025 14:40

sweep: add method getSpentInputs

8c9ba32

To track the input and its spending tx, which will be used later to detect unknown spends.

sweep: add a new event TxUnknownSpend

61cec43

sweep: remove dead code and add better logging

121116c

itest: add fee replacement test

388183e

sweep: start tracking inputs spent by unknown tx

2f1205a

This commit adds a new field `InputsSpent` to the `BumpResult` so they can be used to track inputs spent by txns not recoginized by the fee bumper.

sweep: retry sweeping inputs upon TxUnknownSpend

4281894

We now start handling `TxUnknownSpend` in our sweeper to make sure the failed inputs are retried when possible.

sweep: add method handleReplacementTxError

db8319d

This is a minor refactor so the `createAndPublishTx` flow becomes more clear, also prepares for the following commit where we start to handle missing inputs.

sweep: add createUnknownSpentBumpResult

f614e7a

A minor refactor to break the method `handleUnknownSpent` into two steps, which prepares the following commit where we start handling missing inputs.

sweep: refactor handleInitialTxError and createAndCheckTx

4f469de

This commit refactors `handleInitialTxError` and `createAndCheckTx` to take a `monitorRecord` param, which prepares for the following commit where we start handling missing inputs.

sweep: signal tx in markInputFatal

4bd1a34

This commit adds the failed tx to the result when marking the input as fatal, which is used in the commit resolver when handling revoked outputs.

itest: split up force close tests

c61f781

So we can focus on testing normal flow vs persistence flow.

docs: add release notes

8d49246

sweep: fix error logging

7ab0e15

yyforyongyu force-pushed the yy-sweeper-fix branch from 53f84ed to 7ab0e15 Compare February 20, 2025 06:42

contractcourt: fix errorlint

9f7e2bf

yyforyongyu force-pushed the yy-sweeper-fix branch from f8698a1 to 9f7e2bf Compare February 20, 2025 15:14

morehouse approved these changes Feb 20, 2025

View reviewed changes

Roasbeef requested a review from morehouse February 21, 2025 00:53

Roasbeef approved these changes Feb 21, 2025

View reviewed changes

Roasbeef merged commit 553899b into lightningnetwork:master Feb 21, 2025
31 of 34 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sweep: start tracking input spending status in the fee bumper #9447

sweep: start tracking input spending status in the fee bumper #9447

yyforyongyu commented Jan 26, 2025 •

edited by Roasbeef

Loading

coderabbitai bot commented Jan 26, 2025 •

edited

Loading

Review skipped

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

Roasbeef left a comment

Roasbeef Feb 4, 2025

yyforyongyu Feb 5, 2025

yyforyongyu commented Feb 5, 2025

morehouse left a comment

yyforyongyu commented Feb 7, 2025

morehouse left a comment

Roasbeef left a comment

lightninglabs-deploy commented Feb 19, 2025

morehouse left a comment

yyforyongyu commented Feb 20, 2025

morehouse commented Feb 20, 2025

guggero commented Feb 20, 2025

guggero commented Feb 20, 2025

Roasbeef left a comment

sweep: start tracking input spending status in the fee bumper #9447

sweep: start tracking input spending status in the fee bumper #9447

Conversation

yyforyongyu commented Jan 26, 2025 • edited by Roasbeef Loading

coderabbitai bot commented Jan 26, 2025 • edited Loading

Review skipped

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Roasbeef left a comment

Choose a reason for hiding this comment

Roasbeef Feb 4, 2025

Choose a reason for hiding this comment

yyforyongyu Feb 5, 2025

Choose a reason for hiding this comment

yyforyongyu commented Feb 5, 2025

morehouse left a comment

Choose a reason for hiding this comment

yyforyongyu commented Feb 7, 2025

morehouse left a comment

Choose a reason for hiding this comment

Roasbeef left a comment

Choose a reason for hiding this comment

lightninglabs-deploy commented Feb 19, 2025

morehouse left a comment

Choose a reason for hiding this comment

yyforyongyu commented Feb 20, 2025

morehouse commented Feb 20, 2025

guggero commented Feb 20, 2025

guggero commented Feb 20, 2025

Roasbeef left a comment

Choose a reason for hiding this comment

yyforyongyu commented Jan 26, 2025 •

edited by Roasbeef

Loading

coderabbitai bot commented Jan 26, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)