Support for discounted properties in MDPs and DTMCs #621

AlexBork · 2024-09-10T10:18:24Z

This PR adds support for checking cumulative and total reward properties with a given discount factor on MDPs and DTMCs.

Extends the parser to allow discounted properties, proposed syntax is R=? [Cdiscount=_factor_]
Introduces helper classes for checking discounted properties
Extends the respective PCTL model checking classes with support for discounted cumulative and total reward properties, implemented analogous to the existing undiscounted implementations

sjunges · 2024-09-10T10:45:59Z

This is absolutely great.

I have two questions:

Does this code support convergence detection based on the Bellman residual error?
Do we throw relevant error messages when doing, e.g., weak bismulation?

Best,
Sebastian

AlexBork · 2024-09-10T11:19:10Z

Yes, convergence is detected using the Bellman residual, see src/storm/solver/helper/DiscountedValueIterationHelper.cpp. I have also tested convergence using only the number of iterations as the criterion, i.e. letting the iteration run until it is guaranteed that we are precise enough. That was however highly inefficient as the number of iterations for guaranteed convergence is usually far larger than the actual number needed to reach the precision.

The error messages are not implemented, I will take care of it! I'm happy about any pointers where such problems might occur.

src/storm/logic/DiscountedCumulativeRewardFormula.h

src/storm/modelchecker/helper/DiscountingHelper.cpp

src/storm/solver/helper/DiscountedValueIterationHelper.cpp

AlexBork · 2024-10-22T10:29:53Z

Thanks @sjunges for the review! I incorporated your comments now.

sjunges · 2024-10-29T11:32:19Z

Thanks! @tquatmann do you want to have a look before merging?

tquatmann

Thanks for the great work!

Do we (have to) enforce somewhere that the discount factor is strictly between 0 and 1? For both formula types (total and cumulative formulas)? I might have missed this in my review
I'd suggest to add more tests, in particular in FormulaParserTest, DtmcPrctlModelCheckerTest, and SchedulerGenerationMdpPrctlModelCheckerTest. The existing test also doesn't consider DiscountedCumulativeRewardFormulas

tquatmann · 2025-01-11T07:35:29Z

src/storm/logic/DiscountedCumulativeRewardFormula.h

+
+namespace storm {
+namespace logic {
+class DiscountedCumulativeRewardFormula : public CumulativeRewardFormula {


This means that .isCumulativeRewardFormula() is also true forDiscountedCumulativeRewardFormulas.
Does this have any unintended effects? (e.g. places like multi-objective model checking, where the discounted formula would be treaded as if it were undiscounted)? Maybe it's safer to also override .isCumulativeRewardFormula() to false here, so things will rather fail instead of silently dropping the discount factor.

Also, this probably needs to override gatherUsedVariables to make sure that variables in the discount factor are catched.

Same for TotalRewardFormulas

tquatmann · 2025-01-11T07:37:04Z

src/storm/logic/ExpressionSubstitutionVisitor.cpp

+        optionalRewardAccumulation = f.getRewardAccumulation();
+    }
+    return std::static_pointer_cast<Formula>(
+        std::make_shared<DiscountedCumulativeRewardFormula>(f.getDiscountFactor(), bounds, timeBoundReferences, optionalRewardAccumulation));


Need to apply substitution function on the discount factor

tquatmann · 2025-01-11T07:38:22Z

src/storm/logic/ExpressionSubstitutionVisitor.h

@@ -24,6 +24,7 @@ class ExpressionSubstitutionVisitor : public CloneVisitor {
    virtual boost::any visit(RewardOperatorFormula const& f, boost::any const& data) const override;
    virtual boost::any visit(BoundedUntilFormula const& f, boost::any const& data) const override;
    virtual boost::any visit(CumulativeRewardFormula const& f, boost::any const& data) const override;
+    virtual boost::any visit(DiscountedCumulativeRewardFormula const& f, boost::any const& data) const override;


Also needs a case for the DiscountedTotalRewardFormula (to apply substitution on the discount factor)

tquatmann · 2025-01-11T07:41:49Z

src/storm/logic/FormulaInformationVisitor.cpp

+    FormulaInformation result;
+    result.setContainsCumulativeRewardFormula(true);
+    for (unsigned i = 0; i < f.getDimension(); ++i) {
+        if (f.getTimeBoundReference(i).isRewardBound()) {
+            result.setContainsRewardBoundedFormula(true);
+        }
+    }
+    return result;
+}
+
+boost::any FormulaInformationVisitor::visit(DiscountedTotalRewardFormula const&, boost::any const&) const {
+    return FormulaInformation();


Don't these need a result.setContainsDiscountFormula(true)?

tquatmann · 2025-01-11T08:16:40Z

src/storm/modelchecker/helper/DiscountingHelper.h

+    DiscountingHelper(storm::storage::SparseMatrix<ValueType> const& A, ValueType discountFactor);
+    DiscountingHelper(storm::storage::SparseMatrix<ValueType> const& A, ValueType discountFactor, bool trackScheduler);
+
+    void setUpViOperator() const;


Wouldn't it be simpler to makesetUpViOperator private and call it within solveWithDiscountedValueIteration?

tquatmann · 2025-01-11T08:28:52Z

src/storm/modelchecker/prctl/helper/SparseDtmcPrctlHelper.cpp

+    // Initialize result to the zero vector.
+    std::vector<ValueType> result(transitionMatrix.getRowGroupCount(), storm::utility::zero<ValueType>());
+
+    auto multiplier = storm::solver::MultiplierFactory<ValueType>().create(env, transitionMatrix);


This does not use the discountFactor, right?

AlexBork added 3 commits September 5, 2024 17:10

Add discounting for MDPs and DTMCs

806bd5c

Add test case for discounting

3b6f523

Merge branch 'master' into discounting-merge

0dcc557

AlexBork added this to the 1.10 milestone Sep 10, 2024

Add handling for discounting in bisimulation

5069177

sjunges reviewed Oct 19, 2024

View reviewed changes

src/storm/logic/DiscountedCumulativeRewardFormula.h Outdated Show resolved Hide resolved

sjunges reviewed Oct 19, 2024

View reviewed changes

src/storm/modelchecker/helper/DiscountingHelper.cpp Outdated Show resolved Hide resolved

sjunges reviewed Oct 19, 2024

View reviewed changes

src/storm/solver/helper/DiscountedValueIterationHelper.cpp Outdated Show resolved Hide resolved

AlexBork added 5 commits October 21, 2024 15:21

Merge branch 'master' into discounting-merge

6abe306

Refactoring to use undiscounted formulae as parent classes

aa8ef93

Formatting

8beb6c6

Fix bound and add documentation

a88a27f

Cleanup

7004eb5

sjunges requested a review from tquatmann November 28, 2024 11:21

tquatmann reviewed Jan 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for discounted properties in MDPs and DTMCs #621

Support for discounted properties in MDPs and DTMCs #621

AlexBork commented Sep 10, 2024

sjunges commented Sep 10, 2024

AlexBork commented Sep 10, 2024

AlexBork commented Oct 22, 2024

sjunges commented Oct 29, 2024

tquatmann left a comment

tquatmann Jan 11, 2025

tquatmann Jan 11, 2025

tquatmann Jan 11, 2025

tquatmann Jan 11, 2025

tquatmann Jan 11, 2025

tquatmann Jan 11, 2025

Support for discounted properties in MDPs and DTMCs #621

Are you sure you want to change the base?

Support for discounted properties in MDPs and DTMCs #621

Conversation

AlexBork commented Sep 10, 2024

sjunges commented Sep 10, 2024

AlexBork commented Sep 10, 2024

AlexBork commented Oct 22, 2024

sjunges commented Oct 29, 2024

tquatmann left a comment

Choose a reason for hiding this comment

tquatmann Jan 11, 2025

Choose a reason for hiding this comment

tquatmann Jan 11, 2025

Choose a reason for hiding this comment

tquatmann Jan 11, 2025

Choose a reason for hiding this comment

tquatmann Jan 11, 2025

Choose a reason for hiding this comment

tquatmann Jan 11, 2025

Choose a reason for hiding this comment

tquatmann Jan 11, 2025

Choose a reason for hiding this comment