rustc_query_system: reduce dependency graph memory usage #79589

tgnottingham · 2020-12-01T08:22:59Z

This change implements, at a high level, two space optimizations to the dependency graph.

The first optimization is sharing graph data with the previous dependency graph. Whenever we intern a node, we know whether that node is new (not in the previous graph) or not, and if not, the color of the node in the previous graph.

Red and green nodes have their DepNode present in the previous graph, so for that piece of node data, we can just store the index of the node in the previous graph rather than duplicate the DepNode. Green nodes additionally have the the same result Fingerprint, so we can avoid duplicating that too. Finally, we distinguish between "light" and "dark" green nodes, where the latter are nodes that were marked green because all of their dependencies were marked green. These nodes can additionally share edges with the previous graph, because we know that their set of dependencies is the same (technically, light green and red nodes can have the same dependencies too, but we don't try to figure out whether or not that's the case).

Also, some effort is made to pack data tightly, and to avoid storing DepNodes as map keys more than once.

The second optimization is storing edges in a more compact representation, as in the SerializedDepGraph, that is, in a single vector, rather than one EdgesVec per node. An EdgesVec is a SmallVec with an inline buffer for 8 elements. Each EdgesVec is, at minimum, 40 bytes, and has a per-node overhead of up to 40 bytes. In the ideal case of exactly 8 edges, then 32 bytes are used for edges, and the overhead is 8 bytes. But most of the time, the overhead is higher.

In contrast, using a single vector to store all edges, and having each node specify its start and end elements as 4 byte indices into the vector has a constant overhead of 8 bytes--the best case scenario for the per-node EdgesVec approach.

The downside of this approach is that EdgesVecs built up during query execution have to be copied into the vector, whereas before, we could just take ownership over them. However, we mostly make up for this because the single vector representation enables a more efficient implementation of DepGraph::serialize.

rust-highfive · 2020-12-01T08:23:02Z

r? @lcnr

(rust-highfive has picked a reviewer for you, use r? to override)

tgnottingham · 2020-12-01T08:24:37Z

@rustbot label T-compiler A-incr-comp I-compilemem I-compiletime

tgnottingham · 2020-12-01T08:27:32Z

By the way, reviewing this is probably best done commit-by-commit, starting with the comments and structs near DepNodeData.

compiler/rustc_query_system/src/dep_graph/graph.rs

lcnr · 2020-12-01T09:53:00Z

@bors try @rust-timer queue

rust-timer · 2020-12-01T09:53:01Z

Awaiting bors try build completion

bors · 2020-12-01T09:53:11Z

⌛ Trying commit 07d6913867813e544e465a7a86664da43b2f855d with merge 2dcb181386ce78b9432c26d19dfe249ade022c5a...

bors · 2020-12-01T10:32:38Z

☀️ Try build successful - checks-actions
Build commit: 2dcb181386ce78b9432c26d19dfe249ade022c5a (2dcb181386ce78b9432c26d19dfe249ade022c5a)

rust-timer · 2020-12-01T10:32:40Z

Queued 2dcb181386ce78b9432c26d19dfe249ade022c5a with parent c4926d0, future comparison URL.

rust-timer · 2020-12-01T12:36:49Z

Finished benchmarking try commit (2dcb181386ce78b9432c26d19dfe249ade022c5a): comparison url.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. Please note that if the perf results are neutral, you should likely undo the rollup=never given below by specifying rollup- to bors.

Importantly, though, if the results of this run are non-neutral do not roll this PR up -- it will mask other regressions or improvements in the roll up.

@bors rollup=never
@rustbot modify labels: +S-waiting-on-review -S-waiting-on-perf

lcnr · 2020-12-01T14:24:47Z

This looks quite good to me, especially the 10% rss improvements on some real world benchmarks. I don't know enough about the internals of the query system to review this myself.

r? @nikomatsakis for review or reassignment

jyn514 · 2020-12-01T15:56:31Z

cc also @cjgillot and @nnethercote

wesleywiser · 2020-12-01T17:52:14Z

cc @rust-lang/wg-incr-comp

tgnottingham · 2020-12-01T20:06:37Z

For the perf results, it's helpful to look at incr-full and incr-patched/unchanged results separately using the check boxes, as this change affects them differently.

Non-incremental builds shouldn't be affected much, so you can probably safely untick the full box and ignore the bootstrap timings (I assume the bootstrap benchmark doesn't use incremental).

I'm very happy with the results. You can see that incr-full takes a small hit to instruction count (but ignore the unused-warnings change -- that regularly varies by ~%1 in my experience), but I'm working on a change to improve this. I'll probably leave it for a separate PR, since this one is big enough already.

tgnottingham · 2020-12-01T22:49:58Z

Rebased to rename EdgesIndex to EdgeIndex.

nnethercote · 2020-12-01T23:17:44Z

This is a rare example of a change where the cycles and wall-time improvements are significantly larger than the instruction counts improvements. Nice work!

tgnottingham · 2020-12-02T01:45:34Z

Thanks, wouldn't have known where to begin without Massif! 🙏

nikomatsakis · 2020-12-08T11:38:56Z

Apologies for the delay! I'll try to take a look soon!

rust-timer · 2020-12-23T01:36:35Z

Awaiting bors try build completion.

bors · 2020-12-23T01:36:44Z

⌛ Trying commit 03eb75f with merge 9efae6f05396b03e469da1db40b7eedb501e0697...

bors · 2020-12-23T02:23:17Z

☀️ Try build successful - checks-actions
Build commit: 9efae6f05396b03e469da1db40b7eedb501e0697 (9efae6f05396b03e469da1db40b7eedb501e0697)

rust-timer · 2020-12-23T02:23:19Z

Queued 9efae6f05396b03e469da1db40b7eedb501e0697 with parent 969b42d, future comparison URL.

@rustbot label: +S-waiting-on-perf

rust-timer · 2020-12-23T10:53:15Z

Finished benchmarking try commit (9efae6f05396b03e469da1db40b7eedb501e0697): comparison url.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. Please note that if the perf results are neutral, you should likely undo the rollup=never given below by specifying rollup- to bors.

Importantly, though, if the results of this run are non-neutral do not roll this PR up -- it will mask other regressions or improvements in the roll up.

@bors rollup=never
@rustbot label: +S-waiting-on-review -S-waiting-on-perf

bjorn3 · 2020-12-23T11:00:05Z

Except for a single 2.6% regression for coercions-debug, this doesn't have any big regressions. incr-unchanged runs have huge wins of up to 2.4%.

michaelwoerister · 2020-12-23T14:29:52Z

also added a change to fix a race condition.

@tgnottingham Can you elaborate on what the exact problem was there?

michaelwoerister · 2020-12-23T14:44:56Z

OK, I see now. Yes, it can't hurt to keep things locked throughout. Both methods are supposed to be called in situations where the dep-graph has already settled down and nothing new is added, but let's not rely on that invariant to be upheld.

michaelwoerister · 2020-12-23T14:46:04Z

Let's see if I have r+ rights currently: @bors r+

bors · 2020-12-23T14:46:06Z

@michaelwoerister: 🔑 Insufficient privileges: Not in reviewers

michaelwoerister · 2020-12-23T14:46:38Z

Nope... @nikomatsakis do you want to give the r+?

lqd · 2020-12-23T16:18:23Z

@bors r=michaelwoerister

bors · 2020-12-23T16:18:25Z

📌 Commit 03eb75f has been approved by michaelwoerister

bors · 2020-12-24T01:06:41Z

⌛ Testing commit 03eb75f with merge 49b3151...

bors · 2020-12-24T04:02:16Z

☀️ Test successful - checks-actions
Approved by: michaelwoerister
Pushing 49b3151 to master...

rust-highfive assigned lcnr Dec 1, 2020

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Dec 1, 2020

tgnottingham commented Dec 1, 2020

View reviewed changes

compiler/rustc_query_system/src/dep_graph/graph.rs Show resolved Hide resolved

tgnottingham commented Dec 1, 2020

View reviewed changes

compiler/rustc_query_system/src/dep_graph/graph.rs Show resolved Hide resolved

rust-highfive assigned nikomatsakis and unassigned lcnr Dec 1, 2020

jyn514 added the A-query-system Area: The rustc query system (https://rustc-dev-guide.rust-lang.org/query.html) label Dec 1, 2020

tgnottingham force-pushed the shared_dep_graph branch from 07d6913 to 2e64e80 Compare December 1, 2020 22:48

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Dec 23, 2020

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Dec 23, 2020

bors added the merged-by-bors This PR was explicitly merged by bors. label Dec 24, 2020

bors merged commit 49b3151 into rust-lang:master Dec 24, 2020

rustbot added this to the 1.50.0 milestone Dec 24, 2020

tgnottingham deleted the shared_dep_graph branch January 20, 2021 18:32

tgnottingham mentioned this pull request Jan 20, 2021

[do not merge] Remove PackedFingerprint #81230

Closed

tgnottingham mentioned this pull request Jan 21, 2021

Use PackedFingerprint in DepNode to reduce memory consumption #78646

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rustc_query_system: reduce dependency graph memory usage #79589

rustc_query_system: reduce dependency graph memory usage #79589

tgnottingham commented Dec 1, 2020

rust-highfive commented Dec 1, 2020

tgnottingham commented Dec 1, 2020

tgnottingham commented Dec 1, 2020

lcnr commented Dec 1, 2020

rust-timer commented Dec 1, 2020

bors commented Dec 1, 2020

bors commented Dec 1, 2020

rust-timer commented Dec 1, 2020

rust-timer commented Dec 1, 2020

lcnr commented Dec 1, 2020 •

edited

Loading

jyn514 commented Dec 1, 2020

wesleywiser commented Dec 1, 2020

tgnottingham commented Dec 1, 2020

tgnottingham commented Dec 1, 2020

nnethercote commented Dec 1, 2020

tgnottingham commented Dec 2, 2020

nikomatsakis commented Dec 8, 2020

rust-timer commented Dec 23, 2020

bors commented Dec 23, 2020

bors commented Dec 23, 2020

rust-timer commented Dec 23, 2020

rust-timer commented Dec 23, 2020

bjorn3 commented Dec 23, 2020

michaelwoerister commented Dec 23, 2020

michaelwoerister commented Dec 23, 2020

michaelwoerister commented Dec 23, 2020

bors commented Dec 23, 2020

michaelwoerister commented Dec 23, 2020

lqd commented Dec 23, 2020

bors commented Dec 23, 2020

bors commented Dec 24, 2020

bors commented Dec 24, 2020

rustc_query_system: reduce dependency graph memory usage #79589

rustc_query_system: reduce dependency graph memory usage #79589

Conversation

tgnottingham commented Dec 1, 2020

rust-highfive commented Dec 1, 2020

tgnottingham commented Dec 1, 2020

tgnottingham commented Dec 1, 2020

lcnr commented Dec 1, 2020

rust-timer commented Dec 1, 2020

bors commented Dec 1, 2020

bors commented Dec 1, 2020

rust-timer commented Dec 1, 2020

rust-timer commented Dec 1, 2020

lcnr commented Dec 1, 2020 • edited Loading

jyn514 commented Dec 1, 2020

wesleywiser commented Dec 1, 2020

tgnottingham commented Dec 1, 2020

tgnottingham commented Dec 1, 2020

nnethercote commented Dec 1, 2020

tgnottingham commented Dec 2, 2020

nikomatsakis commented Dec 8, 2020

rust-timer commented Dec 23, 2020

bors commented Dec 23, 2020

bors commented Dec 23, 2020

rust-timer commented Dec 23, 2020

rust-timer commented Dec 23, 2020

bjorn3 commented Dec 23, 2020

michaelwoerister commented Dec 23, 2020

michaelwoerister commented Dec 23, 2020

michaelwoerister commented Dec 23, 2020

bors commented Dec 23, 2020

michaelwoerister commented Dec 23, 2020

lqd commented Dec 23, 2020

bors commented Dec 23, 2020

bors commented Dec 24, 2020

bors commented Dec 24, 2020

lcnr commented Dec 1, 2020 •

edited

Loading