core: DelayedStream cancels provided stream if not using it. #2618

zhangkun83 · 2017-01-18T19:34:31Z

Resolves #1537

Also disallow cancel() before start().
DelayedClientTransport.shutdownNow() races with stream start(), thus it
shouldn't call cancel() directly. It would delay the cancellation until
the stream is started.

Resolves grpc#1537 Also disallow cancel() before start(). DelayedClientTransport.shutdownNow() races with stream start(), thus it shouldn't call cancel() directly. It would delay the cancellation until the stream is started.

carl-mastrangelo · 2017-01-20T21:30:05Z

@ejona86 can you look at this, it is blocking 1.1

ejona86

Thanks. This looks great.

ejona86 · 2017-02-01T22:30:51Z

core/src/main/java/io/grpc/internal/DelayedClientTransport.java

+        started = true;
+        savedPendingCancelReason = pendingCancelReason;
+      }
+      super.start(listener);


Could you add a comment how this doesn't do any I/O if savedPendingCancelReady != null, since there won't be a real transport/stream.

Why does this matter?

If we're immediately cancelling, it seems pretty weak to start if it is going to do I/O. Doing unnecessary I/O in presence of cancellation is generally a recipe of cascading failure. There are two things important here though: 1) the cancellation involved here is expected to be rare and only impact a few streams since it is only due to a race between the stream creation and the delayed client transport shutdown and 2) even when it happens, no I/O is done.

ejona86 · 2017-02-01T23:33:36Z

core/src/main/java/io/grpc/internal/DelayedStream.java

+      // ClientStream.cancel() must be called after start()
+      stream.start(NOOP_STREAM_LISTENER);
+      if (savedError != null) {
+        stream.cancel(savedError);


Let's not cancel with savedError. We're cancelling the stream because of the incorrect API usage, not because of a previous error.

At that point error itself can be deleted.

The misuse case is calling setStream() twice, which is the else branch.
This branch is for calling setStream() after cancel(), which is legit, and I think the real stream should be cancelled with savedError which was used to cancel the delayed stream.

I think the real stream should be cancelled with savedError which was used to cancel the delayed stream.

That probably doesn't matter at all. A simple Status.CANCELLED.withDescription("cancelled before setStream") or "the stream was only started so it could be cancelled" would seem fine to me. The application won't end up seeing the status. When debugging, I would probably prefer to see the hard-coded status instead of savedError as well, since the reason for the cancel is the limited API that required the start() in the first place.

This branch is for calling setStream() after cancel(), which is legit

Hmm... If it is legit, we have to make sure it doesn't happen frequently and could only impact a few streams. And it seems that doesn't hold for MetadataApplierImpl and may not hold for DelayedClientTransport{,2}. That means the fix could actually cause stability problems whereas the current code isn't causing any. Hmm...

For my own record, the issue is setStream() may be scheduled for an unbounded period of time into the future. For example, CallCredentials may take a long time to fetch the credentials before triggering setStream() (in MetadataApplierImpl), prior to which the RPC may be cancelled due to deadline-exceeded. In those cases, real stream are created before setStream() is called, only to be started (as required by cancel()) and then cancelled. start() typically involve I/O, e.g., sending headers, which are wasted. If the delay before setStream() is long enough, this will have a high chance to happen, unlike typical race conditions with low likelihood.

ejona86

I don't feel comfortable yet with the start-to-just-cancel of the real streams currently present, because it seems it could happen en masse.

ejona86 · 2017-02-06T19:05:34Z

core/src/main/java/io/grpc/internal/DelayedStream.java

+      // ClientStream.cancel() must be called after start()
+      stream.start(NOOP_STREAM_LISTENER);
+      if (savedError != null) {
+        stream.cancel(savedError);


I think the real stream should be cancelled with savedError which was used to cancel the delayed stream.

That probably doesn't matter at all. A simple Status.CANCELLED.withDescription("cancelled before setStream") or "the stream was only started so it could be cancelled" would seem fine to me. The application won't end up seeing the status. When debugging, I would probably prefer to see the hard-coded status instead of savedError as well, since the reason for the cancel is the limited API that required the start() in the first place.

This branch is for calling setStream() after cancel(), which is legit

Hmm... If it is legit, we have to make sure it doesn't happen frequently and could only impact a few streams. And it seems that doesn't hold for MetadataApplierImpl and may not hold for DelayedClientTransport{,2}. That means the fix could actually cause stability problems whereas the current code isn't causing any. Hmm...

zhangkun83 · 2017-02-07T22:00:50Z

As discussed in #1537, putting this on-hold.

core: DelayedStream cancels provided stream if not using it.

59b5e61

Resolves grpc#1537 Also disallow cancel() before start(). DelayedClientTransport.shutdownNow() races with stream start(), thus it shouldn't call cancel() directly. It would delay the cancellation until the stream is started.

zhangkun83 assigned ejona86 Jan 18, 2017

zhangkun83 added 2 commits January 18, 2017 12:38

Call out the threading for cancel().

e4c6878

Merge branch 'master' into delayedstream_cancel

fb0fb96

ejona86 approved these changes Feb 1, 2017

View reviewed changes

ejona86 requested changes Feb 6, 2017

View reviewed changes

zhangkun83 mentioned this pull request Feb 7, 2017

DelayedStream.setStream() should cancel the provided stream if not using it #1537

Open

zhangkun83 closed this Feb 7, 2017

lock bot locked as resolved and limited conversation to collaborators Jan 21, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

core: DelayedStream cancels provided stream if not using it. #2618

core: DelayedStream cancels provided stream if not using it. #2618

Uh oh!

zhangkun83 commented Jan 18, 2017

Uh oh!

carl-mastrangelo commented Jan 20, 2017

Uh oh!

ejona86 left a comment

Uh oh!

ejona86 Feb 1, 2017

Uh oh!

zhangkun83 Feb 2, 2017

Uh oh!

ejona86 Feb 6, 2017

Uh oh!

ejona86 Feb 1, 2017

Uh oh!

zhangkun83 Feb 2, 2017

Uh oh!

ejona86 Feb 6, 2017 •

edited

Loading

Uh oh!

zhangkun83 Feb 7, 2017 •

edited

Loading

Uh oh!

ejona86 left a comment

Uh oh!

ejona86 Feb 6, 2017 •

edited

Loading

Uh oh!

zhangkun83 commented Feb 7, 2017

Uh oh!

Uh oh!

core: DelayedStream cancels provided stream if not using it. #2618

core: DelayedStream cancels provided stream if not using it. #2618

Uh oh!

Conversation

zhangkun83 commented Jan 18, 2017

Uh oh!

carl-mastrangelo commented Jan 20, 2017

Uh oh!

ejona86 left a comment

Choose a reason for hiding this comment

Uh oh!

ejona86 Feb 1, 2017

Choose a reason for hiding this comment

Uh oh!

zhangkun83 Feb 2, 2017

Choose a reason for hiding this comment

Uh oh!

ejona86 Feb 6, 2017

Choose a reason for hiding this comment

Uh oh!

ejona86 Feb 1, 2017

Choose a reason for hiding this comment

Uh oh!

zhangkun83 Feb 2, 2017

Choose a reason for hiding this comment

Uh oh!

ejona86 Feb 6, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zhangkun83 Feb 7, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ejona86 left a comment

Choose a reason for hiding this comment

Uh oh!

ejona86 Feb 6, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zhangkun83 commented Feb 7, 2017

Uh oh!

Uh oh!

ejona86 Feb 6, 2017 •

edited

Loading

zhangkun83 Feb 7, 2017 •

edited

Loading

ejona86 Feb 6, 2017 •

edited

Loading