When streaming, assume network latency is >=1ms (#259)

alexjpwalker · web-flow · commit 2e38275894d3 · 2022-06-30T17:15:38.000+01:00
## What is the goal of this PR?

When streaming (e.g. `match` queries), we now assume the network latency is &gt;=1ms, and have the server compensate accordingly. This sharply improves performance (up to 3x) when fetching answers from a server hosted at `localhost`.

## What are the changes implemented in this PR?

When streaming from `localhost` with a per-answer performance profiler attached, it reveals that the answers come in "batches" of size 50 - the first 50 answers arrive in, say, 0.005s, then there is a 0.005s gap, then another 50 answers arrive, then there is another 0.005s gap, and so on. This indicates that there is a bug in the implementation of prefetch size - and sure enough, we've tracked it down. It manifests itself when connecting to `localhost`, and occurs due to the following logical flaw.

Answers are streamed in batches of size N from the server (where N = `prefetch_size`, default 50), to prevent the server doing unnecessary work in case the client does not end up consuming all answers. Once the client sees N answers, it should send a "CONTINUE" request to the server to continue streaming.

However, while the Nth answer is being sent to the client, and while the server is waiting to receive the CONTINUE request, the streaming should actually continue. If it doesn't, we end up with "wasted time" where the server is waiting and isn't sending anything. Thus, the server must predict to the best of its ability when the client will send the next CONTINUE. This is typically equal to the _network latency_.

However, when connecting to `localhost`, the network latency is 0 - while it is physically impossible for the client to respond to the server at the exact same moment that the server sends the Nth answer. `localhost` is an edge case that is currently unhandled.

To mitigate the problem, we now coerce the measured network latency to be at least 1ms.
diff --git a/typedb/connection/session.py b/typedb/connection/session.py
@@ -54,7 +54,7 @@ def __init__(self, client: "_TypeDBClientImpl", database: str, session_type: Ses
         start_time = time.time() * 1000.0
         res = self._stub().session_open(session_open_req(database, session_type.proto(), options.proto()))
         end_time = time.time() * 1000.0
-        self._network_latency_millis = int(end_time - start_time - res.server_duration_millis)
+        self._network_latency_millis = max(int(end_time - start_time - res.server_duration_millis), 1)
         self._session_id = res.session_id
         self._is_open = AtomicBoolean(True)