-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Kademlia bootstrap gets stuck forever in some cases #5432
Comments
Here is Can be reproduced after multiple attempts with https://github.com/subspace/subspace/tree/dsn-sync-getting-stuck-wip by running:
|
Tried to race bootstrapping with a simple tokio timer future and wsa not able to reproduce this after countless attempts, while it doesn't take too many attempts to reproduce otherwise 🤔 |
Interesting, it does look related. I hope it will be merged soon, subscribed. Thanks! |
Let's close, I'll let you know if the issue remains after we upgrade to fixed version at Subspace. |
We were getting user reports (and I reproduced it myself a few times) that sync in our protocol gets stuck forever. Implementation-wise it is based on libp2p and I have now tried it many times and discovered that sometimes Kademlia bootstrap gets stuck.
Specifically logs look like this:
Successful bootstrapping takes ~3 minutes and looks like this:
bootstrap_step
above corresponds toKademliaEvent::OutboundQueryProgressed
events and ifKademliaEvent::OutboundQueryProgressed.step.last == true
is calledBootstrap finished
is printed.I believe Yamux error is not related to this, we're seeing them periodically and they don't seem to break any protocols fundamentally.
Now I'm wondering if it is related in any way to #5418 somehow due to underlying lookups done in both cases.
libp2p-kad 0.45.3 (latest at the moment of writing),
disjoint_query_paths: true
The text was updated successfully, but these errors were encountered: