Skip to content

Commit f8a60d9

Browse files
Tom's second edits of advanced lectures, Feb 5
1 parent 8c6b2f2 commit f8a60d9

File tree

1 file changed

+17
-9
lines changed

1 file changed

+17
-9
lines changed

lectures/robustness.md

+17-9
Original file line numberDiff line numberDiff line change
@@ -44,11 +44,11 @@ His specification doubts make the decision-maker want a *robust* decision rule.
4444

4545
*Robust* means insensitive to misspecification of transition dynamics.
4646

47-
The decision-maker has a single *approximating model*.
47+
The decision-maker has a single *approximating model* of the transition dynamics.
4848

4949
He calls it *approximating* to acknowledge that he doesn't completely trust it.
5050

51-
He fears that outcomes will actually be determined by another model that he cannot describe explicitly.
51+
He fears that transition dynamics are actually determined by another model that he cannot describe explicitly.
5252

5353
All that he knows is that the actual data-generating model is in some (uncountable) set of models that surrounds his approximating model.
5454

@@ -91,7 +91,7 @@ Our "robust" decision-maker wants to know how well a given rule will work when h
9191

9292
$\ldots$ he wants to know *sets* of values that will be attained by a given decision rule $F$ under a *set* of transition laws.
9393

94-
Ultimately, he wants to design a decision rule $F$ that shapes these *sets* of values in ways that he prefers.
94+
Ultimately, he wants to design a decision rule $F$ that shapes the *set* of values in ways that he prefers.
9595

9696
With this in mind, consider the following graph, which relates to a particular decision problem to be explained below
9797

@@ -108,7 +108,7 @@ Here
108108
* *Value* refers to a sum of discounted rewards obtained by applying the decision rule $F$ when the state starts at some fixed initial state $x_0$.
109109
* *Entropy* is a non-negative number that measures the size of a set of models surrounding the decision-maker's approximating model.
110110
* Entropy is zero when the set includes only the approximating model, indicating that the decision-maker completely trusts the approximating model.
111-
* Entropy is bigger, and the set of surrounding models is bigger, the less the decision-maker trusts the approximating model.
111+
* Entropy is bigger, and the set of surrounding models is bigger, the less the decision-maker trusts the approximating model of the transition dynamics.
112112

113113
The shaded region indicates that for **all** models having entropy less than or equal to the number on the horizontal axis, the value obtained will be somewhere within the indicated set of values.
114114

@@ -196,15 +196,23 @@ We also allow for *model uncertainty* on the part of the agent solving this opti
196196

197197
In particular, the agent takes $w_t = 0$ for all $t \geq 0$ as a benchmark model but admits the possibility that this model might be wrong.
198198

199-
As a consequence, she also considers a set of alternative models expressed in terms of sequences $\{ w_t \}$ that are "close" to the zero sequence.
199+
As a consequence, she also considers a set of alternative models expressed in terms of sequences $\{ w_t \}$ that are more or less "close" to the zero sequence.
200200

201201
She seeks a policy that will do well enough for a set of alternative models whose members are pinned down by sequences $\{ w_t \}$.
202202

203-
Soon we'll quantify the quality of a model specification in terms of the maximal size of the expression $\sum_{t=0}^{\infty} \beta^{t+1}w_{t+1}' w_{t+1}$.
203+
A sequence $\{ w_t \}$ might represent
204+
205+
* nonlinearities absent from the approximating model
206+
* time variations in parameters of the approximating model
207+
* omitted state variables in the approximating model
208+
* neglected history dependencies $\ldots$
209+
* and other potential sources of misspecification
210+
211+
Soon we'll quantify the quality of a model specification in terms of the maximal size of the discounted sum $\sum_{t=0}^{\infty} \beta^{t+1}w_{t+1}' w_{t+1}$.
204212

205213
## Constructing More Robust Policies
206214

207-
If our agent takes $\{ w_t \}$ as a given deterministic sequence, then, drawing on intuition from earlier lectures on dynamic programming, we can anticipate Bellman equations such as
215+
If our agent takes $\{ w_t \}$ as a given deterministic sequence, then, drawing on ideas in earlier lectures on dynamic programming, we can anticipate Bellman equations such as
208216

209217
$$
210218
J_{t-1} (x)
@@ -221,7 +229,7 @@ Our tool for studying robustness is to construct a rule that works well even if
221229

222230
In our framework, "adverse" means "loss increasing".
223231

224-
As we'll see, this will eventually lead us to construct the Bellman equation
232+
As we'll see, this will eventually lead us to construct a Bellman equation
225233

226234
```{math}
227235
:label: rb_wcb0
@@ -456,7 +464,7 @@ The remaining step for agent 2's problem is to set $\theta$ to enforce the const
456464
Here $x_t$ is given by {eq}`rob_lomf` --- which in this case becomes $x_{t+1} = (A - B F + CK(F, \theta)) x_t$.
457465

458466
(rb_a1)=
459-
### Using Agent 2's Problem to Construct Bounds on the Value Sets
467+
### Using Agent 2's Problem to Construct Bounds on Value Sets
460468

461469
#### The Lower Bound
462470

0 commit comments

Comments
 (0)