-
Notifications
You must be signed in to change notification settings - Fork 273
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dominator nodes only store immediate dominator #5137
base: develop
Are you sure you want to change the base?
Dominator nodes only store immediate dominator #5137
Conversation
5e1537f
to
0cc530a
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Getting better -- main missing bit is the postorder numbering; rather than force cfgt
into a particular order we should probably just assign postorder numbers in cfg_dominators_templatet::nodet
and use that number for the intersect algorithm (which depends on being able to judge topological ordering quickly by comparing numbers)
@@ -467,7 +467,7 @@ static java_bytecode_convert_methodt::method_offsett get_common_dominator( | |||
const auto &dominator_nodeidx= | |||
dominator_analysis.cfg.entry_map.at(v->var.start_pc); | |||
const auto &this_var_doms= | |||
dominator_analysis.cfg[dominator_nodeidx].dominators; | |||
dominator_analysis.dominated_by(dominator_nodeidx); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the name dominated_by
is the opposite of what we want? This should be the set of dominators of dominator_nodeidx
src/analyses/cfg_dominators.h
Outdated
@@ -76,7 +74,45 @@ class cfg_dominators_templatet | |||
/// Note by definition all program points dominate themselves. | |||
bool dominates(T lhs, const nodet &rhs_node) const | |||
{ | |||
return rhs_node.dominators.count(lhs); | |||
if (!rhs_node.dominator) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove, duplicates the while
condition
src/analyses/cfg_dominators.h
Outdated
|
||
/// Returns a set of T that are the dominators of start_node. Only use if you | ||
/// need the entire set of dominators from node to root. | ||
std::set<T> dominated_by(T start_node) const |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How about instead of returning a std::set
we provide an iterator, then dominates
can use it?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't mind it, but it'd need to be lazily-evaluated. Do we have anything like that right now I can copy/directly use?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I just got done writing a custom iterator, so I'll make a quick draft and post it
src/analyses/cfg_dominators.h
Outdated
|
||
/// Returns a set of T that are the dominators of start_node. Only use if you | ||
/// need the entire set of dominators from node to root. | ||
std::set<T> dominated_by(T start_node) const |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As above, rename dominators
(but see iterator comment -- dominators
should return an iterable with a begin()
and end()
which walk over the tree)
src/analyses/cfg_dominators.h
Outdated
// in-place intersection. not safe to use set_intersect | ||
while(n_it!=node.dominators.end() && o_it!=other.end()) | ||
auto edge_index = *other.dominator; | ||
while(potential_dominator != edge_index) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is the intersect
function from the Cooper paper -- pull it out into a static function to make the correspondence with the paper clearer
src/analyses/sese_regions.cpp
Outdated
@@ -133,44 +133,43 @@ void sese_region_analysist::compute_sese_regions( | |||
++it) | |||
{ | |||
// Only look for regions starting at nontrivial CFG edges: | |||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
unrelated whitespace change
src/analyses/sese_regions.cpp
Outdated
|
||
// Ideally I would use `optionalt<std::size_t>` here, but it triggers a | ||
// GCC-5 bug. | ||
std::size_t closest_exit_index = dominators.cfg.size(); | ||
for(const auto &possible_exit : instruction_postdoms) | ||
std::size_t current_index = *instruction_postdom; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ideal use site for a dominators iterator
528caba
to
52f2caa
Compare
2276b24
to
ada97ce
Compare
0de344e
to
8f51eef
Compare
Codecov Report
@@ Coverage Diff @@
## develop #5137 +/- ##
===========================================
- Coverage 67.12% 67.09% -0.04%
===========================================
Files 1149 1149
Lines 94194 94073 -121
===========================================
- Hits 63229 63116 -113
+ Misses 30965 30957 -8
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This PR failed Diffblue compatibility checks (cbmc commit: 8f51eef).
Build URL: https://travis-ci.com/diffblue/test-gen/builds/131132137
Status will be re-evaluated on next push.
Common spurious failures include: the cbmc commit has disappeared in the mean time (e.g. in a force-push); the author is not in the list of contributors (e.g. first-time contributors); compatibility was already broken by an earlier merge.
Now good to review. Kudos to @smowton for chunks of this. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm part-way through reviewing this
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- You should add a comment saying where you got the algorithm from, i.e. figure 3 in "A Simple, Fast Dominance Algorithm" by Cooper, Harvey and Kennedy. This means you don't have to explain why it does what you want it to.
- You haven't added any tests. Is this sufficiently tested? You're making quite a big change.
@owen-mc-diffblue Applied all comments. There's mention of the paper in the class documentation, that good enough? (I have added your comment about intersect diagram though) Otherwise there are enough existing tests to cover this so that no new ones should be needed. Probably. |
8f51eef
to
5573904
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Benchmarks?
src/analyses/cfg_dominators.h
Outdated
get_reverse_postordered_instructions(std::size_t entry_node_index) const; | ||
|
||
/// Walks up the dominators of left/right nodes until it finds one that is | ||
/// common to both sides. Used to work out a the common immediate dominator |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"a the common immediate dominator" -> "the least common dominator" (i.e. the dominator that is common to both, and which dominates no other common dominator)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is 'least common denominator' a known phrase? It reads really awkwardly otherwise.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks like the go-to phrase is in fact "nearest common dominator"
src/analyses/cfg_dominators.h
Outdated
if(node.postorder_index != nodet::NODE_NOT_VISITED) | ||
return; | ||
|
||
// Otherwise set that we've processed it, and put children on the stack. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We don't put the children, we put the individual node (along with initialising that node's child iterators), as the method name suggests
std::size_t next_postorder_index = 0; | ||
std::vector<stack_entryt> stack; | ||
|
||
auto place_node_on_stack_if_not_visited = [this, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would be good if you can find a way to please clang-format that's less odd than this
src/analyses/cfg_dominators.h
Outdated
|
||
std::vector<T> order; | ||
|
||
// Note that this will only select and order nodes that have been processed. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
that have been processed -> that are reachable from the entry point
src/analyses/cfg_dominators.h
Outdated
@@ -115,6 +246,26 @@ class cfg_dominators_templatet | |||
protected: | |||
void initialise(P &program); | |||
void fixedpoint(P &program); | |||
|
|||
/// Goal of this is to assign post-order numbering to each node in the tree. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"Goal of this is to assign" -> "Assigns"
src/analyses/cfg_dominators.h
Outdated
void assign_postordering(std::size_t start_node_index); | ||
|
||
/// Sort our CFG nodes in reverse post-order. Just gets every node that we've | ||
/// processed (anything not NODE_NOT_VISITED) then order from highest to |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
order -> orders
src/analyses/cfg_dominators.h
Outdated
while(n_it!=node.dominators.end() && o_it!=other.end()) | ||
// compute intersection of predecessors | ||
auto &edges = (post_dom ? node.out : node.in); | ||
if(edges.size() != 0) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This check is still pointless (the for-loop will terminate immediately if there are no edges)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Caused exceptions at one point when attempting to get the in/out iterators, hopefully not the case anymore.
5573904
to
b7dd30d
Compare
Looks good except needs benchmarks. If in doubt ask @hannes-steffenhagen-diffblue for the benchmarking he did when trying to replace the dominators algorithm with Tarjan's. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This PR failed Diffblue compatibility checks (cbmc commit: 5573904).
Build URL: https://travis-ci.com/diffblue/test-gen/builds/131999602
Status will be re-evaluated on next push.
Common spurious failures include: the cbmc commit has disappeared in the mean time (e.g. in a force-push); the author is not in the list of contributors (e.g. first-time contributors); compatibility was already broken by an earlier merge.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This PR failed Diffblue compatibility checks (cbmc commit: b7dd30d).
Build URL: https://travis-ci.com/diffblue/test-gen/builds/132007774
Status will be re-evaluated on next push.
Common spurious failures include: the cbmc commit has disappeared in the mean time (e.g. in a force-push); the author is not in the list of contributors (e.g. first-time contributors); compatibility was already broken by an earlier merge.
Change existing API calls to use the new immediate-dominator version of dominator analysis.
b7dd30d
to
533c35a
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This PR failed Diffblue compatibility checks (cbmc commit: 533c35a).
Build URL: https://travis-ci.com/diffblue/test-gen/builds/132137431
Status will be re-evaluated on next push.
Common spurious failures include: the cbmc commit has disappeared in the mean time (e.g. in a force-push); the author is not in the list of contributors (e.g. first-time contributors); compatibility was already broken by an earlier merge.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Had some intermediate comments from a couple days ago, sorry if these are out of date now
@@ -12,15 +12,16 @@ Author: Georg Weissenbacher, [email protected] | |||
#ifndef CPROVER_ANALYSES_CFG_DOMINATORS_H | |||
#define CPROVER_ANALYSES_CFG_DOMINATORS_H | |||
|
|||
#include <set> | |||
#include <cassert> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Now that these are sorted we might as well remove this header, <cassert>
shouldn't be needed anywhere.
{ | ||
} | ||
|
||
private: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you reorder this so it doesn't keep switching between private and public things?
{ | ||
advance(); | ||
return *this; | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🚫
The logic for pre- and post increment is the wrong way round. operator++()
should advance()
then return *this
, operator++(int)
should save *this
, advance()
and then return the saved iterator.
const auto &next_dominator = current_node.dominator; | ||
INVARIANT( | ||
current_node.postorder_index == | ||
cfg_dominatorst::nodet::NODE_NOT_VISITED || |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can you wrap this up in something like is_reachable()
, or to be more honest is_potentially_reachable
?
} | ||
|
||
/// Returns true if program point \p lhs dominates \p rhs. | ||
/// Note by definition all program points dominate themselves. | ||
bool dominates(T lhs, T rhs) const | ||
{ | ||
return dominates(lhs, get_node(rhs)); | ||
const auto rhs_dominators = dominators(rhs); | ||
return std::any_of( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm very surprised there isn't something like std::contains
for this!
This moves our current algorithm closer to Coopers and has the benefit of massively reduced memory usage if there are a large amount of dominator nodes in a single method.
This does mean that any operation that needs to know anything about the state of the dominator nodes beyond the immediate then needs to walk back to find it, but with the improvements of the underlying map by @smowton it'll likely still be faster than the original implementation.
Currently up to make sure CI passes before opening for reviews.