Add rewrites to replace or remove Aeppl CheckParameterValue Ops #5233

ricardoV94 · 2021-12-01T11:51:06Z

This PR adds rewrites to replace or remove Aeppl CheckParameterValue Ops in logprob expressions.

Closes #5205
Closes #5204
Closes #4429

These rewrites are included by default when calling pymc.aesaraf.compile_pymc, which was previously named compile_rv_inplace
pymc.distributions.dist_math.bound was renamed to check_parameters and now returns an expression wrapped in the CheckParameterValue Op
The Model.check_bounds flag now only affects graphs at compilation time when pymc.aesaraf.compile_pymc is called within a Model context and that flag is set to False.
Most logp/logcdf methods were changed so that the bounding of the value variable is done separately from the parameter checks (missing this in a couple of multivariate distributions). This is in line with how aeppl defines logprob graphs, and to a lesser extent how scipy does things as well. It also provides a solution to Interaction between logcdf and new check_bounds flag #4429
Added explicit tests for univariate value and parameter bounds in check_logp, similar to what was already done in check_logcdf, as well as some specialized tests for non-scalar parameters/values in multivariate distributions.

The new dist_math.check_parameters always compresses the conditions to a scalar since that's required by CheckParameterValue. That means that when evaluating a logp vector, either all results get replaced by -np.inf or none do. It no longer switches only those that corresponded to invalid parameters/values.

I don't think it makes sense to generate logprob graphs differently than Aeppl does. If we want to keep the old format we should then reintroduce our own versions of logp/logcdf to replace those that are defined in Aeppl (many common distributions are defined there), or else we will have a mix of expressions that follow distinct logics.

codecov · 2021-12-06T15:29:57Z

Codecov Report

Merging #5233 (ae850a6) into main (7191e61) will increase coverage by 0.13%.
The diff coverage is 95.89%.

@@            Coverage Diff             @@
##             main    #5233      +/-   ##
==========================================
+ Coverage   78.98%   79.12%   +0.13%     
==========================================
  Files          88       88              
  Lines       14231    14301      +70     
==========================================
+ Hits        11240    11315      +75     
+ Misses       2991     2986       -5

Impacted Files	Coverage Δ
pymc/sampling_jax.py	`0.00% <0.00%> (ø)`
pymc/distributions/mixture.py	`19.76% <33.33%> (ø)`
pymc/distributions/multivariate.py	`73.20% <93.75%> (+0.68%)`	⬆️
pymc/aesaraf.py	`90.10% <96.29%> (+0.39%)`	⬆️
pymc/distributions/continuous.py	`96.85% <97.77%> (+0.12%)`	⬆️
pymc/distributions/bound.py	`100.00% <100.00%> (ø)`
pymc/distributions/discrete.py	`98.46% <100.00%> (+0.10%)`	⬆️
pymc/distributions/dist_math.py	`86.78% <100.00%> (-0.92%)`	⬇️
pymc/gp/util.py	`94.68% <100.00%> (ø)`
pymc/initial_point.py	`100.00% <100.00%> (ø)`
... and 5 more

brandonwillard · 2021-12-06T15:41:37Z

pymc/aesaraf.py

+for database in ("canonicalize", "stabilize", "specialize", "useless"):
+    aesara.compile.optdb[database].register(
+        "local_remove_check_parameter",
+        local_remove_check_parameter,
+        use_db_name_as_tag=False,
+    )
+
+    aesara.compile.optdb[database].register(
+        "local_check_parameter_to_ninf_switch",
+        local_check_parameter_to_ninf_switch,
+        use_db_name_as_tag=False,
+    )


Why add these to all those databases?

Also, don't forget that you can set a priority that affects the order in which they're run.

I just copied what I've seen elsewhere.

Can you give some recommendation on what is the minimum database(s) where these should be registered, as well as what the priority should be?

You should be able to put those in just the pass named "useless" and have them removed before the other passes. Check out aesara.compile.mode to see the passes and their ordering.

Otherwise, I would think that removing those checks should be a user-configurable option, and, if so, you might not want to include them by default. Instead, an option could be set that includes those rewrites when graphs are compiled by PyMC (e.g. by constructing a Mode object for aesara.function that uses includes=...).

Otherwise, I would think that removing those checks should be a user-configurable option, and, if so, you might not want to include them by default. Instead, an option could be set that includes those rewrites when graphs are compiled by PyMC (e.g. by constructing a Mode object for aesara.function that uses includes=...).

The rewrites are not running by default due to the use_db_name_as_tag=False option. I am manually including them in compile_pymc helper function only. Otherwise the new tests that expect the specific Exception would fail.

You should be able to put those in just the pass named "useless" and have them removed before the other passes. Check out aesara.compile.mode to see the passes and their ordering.

For some reason putting it in "useless" alone is not doing anything, but putting them in "canonicalize" seems to do the job.

Looks like we need to refactor that OptimizationDatabase code; the whole use_db_name_as_tag thing, and the special way it works with only one DB implementation (i.e. EquilibriumDB), is not good. Just the fact that there's a note about a specific subclass in the base class is a bad sign.

Anyway, "useless" doesn't use the same kind of underlying OptimizationDatabase types as "canonicalize", so I'm guessing that the reason the latter works and the former doesn't is related to that.

ricardoV94 · 2021-12-06T17:59:53Z

The failing dirichlet_multinomial tests should be solved by #5234

…essions * These rewrites are included by default when calling pymc.aesaraf.compile_pymc, which was previously named compile_rv_inplace * pymc.distributions.dist_math.bound was renamed to check_parameters and now returns an expression wrapped in the CheckParameterValue Op * The Model.check_bounds flag now only affects graphs at compilation time when pymc.aesaraf.compile_pymc is called within a Model context and that flag is set to False.

* Allow edges to be infinity in discrete domains * Automatically assign infinity edges when these are set to (None, None) * Simplex and MultiSimplex now return a Domain instance

…k_logp`

ricardoV94 added logprob request discussion v4 labels Dec 1, 2021

ricardoV94 force-pushed the logp_asserts branch from c46dafe to 3b99d14 Compare December 1, 2021 13:20

ricardoV94 requested review from brandonwillard, twiecki, ferrine and aseyboldt December 1, 2021 13:20

ricardoV94 force-pushed the logp_asserts branch 2 times, most recently from b283501 to 5caeb9a Compare December 2, 2021 10:33

ricardoV94 added this to the v4.0.0-beta1 (vNext) milestone Dec 2, 2021

ricardoV94 mentioned this pull request Dec 4, 2021

NotImplementedError with JAX backend and PyMC v4 #5240

Closed

ricardoV94 force-pushed the logp_asserts branch 2 times, most recently from 8c763c6 to 9735754 Compare December 6, 2021 15:28

brandonwillard reviewed Dec 6, 2021

View reviewed changes

ricardoV94 force-pushed the logp_asserts branch 3 times, most recently from 18c51db to 4284fc8 Compare December 6, 2021 17:58

ricardoV94 marked this pull request as ready for review December 6, 2021 17:59

ricardoV94 force-pushed the logp_asserts branch 2 times, most recently from b322a33 to d943a5c Compare December 6, 2021 18:42

ricardoV94 added 7 commits December 8, 2021 11:32

Update aeppl version

a093da2

Register CheckParameterValue in jax_funcify_Assert

35a8a52

Replace value bounding check_parameters with explicit switch statement

1ac88af

Add positive assertion to get_tau_sigma and Gamma.get_alpha_beta

5971bd0

Refactor Domain helper class

97587bf

* Allow edges to be infinity in discrete domains * Automatically assign infinity edges when these are set to (None, None) * Simplex and MultiSimplex now return a Domain instance

Add scalar parameter and value bound checks in `TestMatchesScipy.chec…

a3ab0f1

…k_logp`

ricardoV94 added 2 commits December 8, 2021 11:32

Add bound tests for some non-scalar values and parameters

901b72f

Add xfail to TestMatchesScipy.test_laplace

ae850a6

ricardoV94 force-pushed the logp_asserts branch from d943a5c to ae850a6 Compare December 8, 2021 10:33

twiecki approved these changes Dec 10, 2021

View reviewed changes

twiecki merged commit a4f9657 into pymc-devs:main Dec 10, 2021

ricardoV94 mentioned this pull request Dec 10, 2021

Simulator tests failing sporadically #5252

Closed

ricardoV94 deleted the logp_asserts branch December 12, 2021 14:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add rewrites to replace or remove Aeppl CheckParameterValue Ops #5233

Add rewrites to replace or remove Aeppl CheckParameterValue Ops #5233

ricardoV94 commented Dec 1, 2021 •

edited

Loading

Uh oh!

codecov bot commented Dec 6, 2021 •

edited

Loading

Uh oh!

brandonwillard Dec 6, 2021

Uh oh!

ricardoV94 Dec 6, 2021

Uh oh!

brandonwillard Dec 6, 2021 •

edited

Loading

Uh oh!

ricardoV94 Dec 6, 2021 •

edited

Loading

Uh oh!

ricardoV94 Dec 6, 2021 •

edited

Loading

Uh oh!

brandonwillard Dec 6, 2021 •

edited

Loading

Uh oh!

ricardoV94 commented Dec 6, 2021

Uh oh!

Uh oh!

Uh oh!

Add rewrites to replace or remove Aeppl CheckParameterValue Ops #5233

Add rewrites to replace or remove Aeppl CheckParameterValue Ops #5233

Conversation

ricardoV94 commented Dec 1, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Dec 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

brandonwillard Dec 6, 2021

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Dec 6, 2021

Choose a reason for hiding this comment

Uh oh!

brandonwillard Dec 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Dec 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Dec 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

brandonwillard Dec 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 commented Dec 6, 2021

Uh oh!

Uh oh!

ricardoV94 commented Dec 1, 2021 •

edited

Loading

codecov bot commented Dec 6, 2021 •

edited

Loading

brandonwillard Dec 6, 2021 •

edited

Loading

ricardoV94 Dec 6, 2021 •

edited

Loading

ricardoV94 Dec 6, 2021 •

edited

Loading

brandonwillard Dec 6, 2021 •

edited

Loading