You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: lectures/robustness.md
+17-9
Original file line number
Diff line number
Diff line change
@@ -44,11 +44,11 @@ His specification doubts make the decision-maker want a *robust* decision rule.
44
44
45
45
*Robust* means insensitive to misspecification of transition dynamics.
46
46
47
-
The decision-maker has a single *approximating model*.
47
+
The decision-maker has a single *approximating model* of the transition dynamics.
48
48
49
49
He calls it *approximating* to acknowledge that he doesn't completely trust it.
50
50
51
-
He fears that outcomes will actually be determined by another model that he cannot describe explicitly.
51
+
He fears that transition dynamics are actually determined by another model that he cannot describe explicitly.
52
52
53
53
All that he knows is that the actual data-generating model is in some (uncountable) set of models that surrounds his approximating model.
54
54
@@ -91,7 +91,7 @@ Our "robust" decision-maker wants to know how well a given rule will work when h
91
91
92
92
$\ldots$ he wants to know *sets* of values that will be attained by a given decision rule $F$ under a *set* of transition laws.
93
93
94
-
Ultimately, he wants to design a decision rule $F$ that shapes these *sets* of values in ways that he prefers.
94
+
Ultimately, he wants to design a decision rule $F$ that shapes the *set* of values in ways that he prefers.
95
95
96
96
With this in mind, consider the following graph, which relates to a particular decision problem to be explained below
97
97
@@ -108,7 +108,7 @@ Here
108
108
**Value* refers to a sum of discounted rewards obtained by applying the decision rule $F$ when the state starts at some fixed initial state $x_0$.
109
109
**Entropy* is a non-negative number that measures the size of a set of models surrounding the decision-maker's approximating model.
110
110
* Entropy is zero when the set includes only the approximating model, indicating that the decision-maker completely trusts the approximating model.
111
-
* Entropy is bigger, and the set of surrounding models is bigger, the less the decision-maker trusts the approximating model.
111
+
* Entropy is bigger, and the set of surrounding models is bigger, the less the decision-maker trusts the approximating model of the transition dynamics.
112
112
113
113
The shaded region indicates that for **all** models having entropy less than or equal to the number on the horizontal axis, the value obtained will be somewhere within the indicated set of values.
114
114
@@ -196,15 +196,23 @@ We also allow for *model uncertainty* on the part of the agent solving this opti
196
196
197
197
In particular, the agent takes $w_t = 0$ for all $t \geq 0$ as a benchmark model but admits the possibility that this model might be wrong.
198
198
199
-
As a consequence, she also considers a set of alternative models expressed in terms of sequences $\{ w_t \}$ that are "close" to the zero sequence.
199
+
As a consequence, she also considers a set of alternative models expressed in terms of sequences $\{ w_t \}$ that are more or less "close" to the zero sequence.
200
200
201
201
She seeks a policy that will do well enough for a set of alternative models whose members are pinned down by sequences $\{ w_t \}$.
202
202
203
-
Soon we'll quantify the quality of a model specification in terms of the maximal size of the expression $\sum_{t=0}^{\infty} \beta^{t+1}w_{t+1}' w_{t+1}$.
203
+
A sequence $\{ w_t \}$ might represent
204
+
205
+
* nonlinearities absent from the approximating model
206
+
* time variations in parameters of the approximating model
207
+
* omitted state variables in the approximating model
208
+
* neglected history dependencies $\ldots$
209
+
* and other potential sources of misspecification
210
+
211
+
Soon we'll quantify the quality of a model specification in terms of the maximal size of the discounted sum $\sum_{t=0}^{\infty} \beta^{t+1}w_{t+1}' w_{t+1}$.
204
212
205
213
## Constructing More Robust Policies
206
214
207
-
If our agent takes $\{ w_t \}$ as a given deterministic sequence, then, drawing on intuition from earlier lectures on dynamic programming, we can anticipate Bellman equations such as
215
+
If our agent takes $\{ w_t \}$ as a given deterministic sequence, then, drawing on ideas in earlier lectures on dynamic programming, we can anticipate Bellman equations such as
208
216
209
217
$$
210
218
J_{t-1} (x)
@@ -221,7 +229,7 @@ Our tool for studying robustness is to construct a rule that works well even if
221
229
222
230
In our framework, "adverse" means "loss increasing".
223
231
224
-
As we'll see, this will eventually lead us to construct the Bellman equation
232
+
As we'll see, this will eventually lead us to construct a Bellman equation
225
233
226
234
```{math}
227
235
:label: rb_wcb0
@@ -456,7 +464,7 @@ The remaining step for agent 2's problem is to set $\theta$ to enforce the const
456
464
Here $x_t$ is given by {eq}`rob_lomf` --- which in this case becomes $x_{t+1} = (A - B F + CK(F, \theta)) x_t$.
457
465
458
466
(rb_a1)=
459
-
### Using Agent 2's Problem to Construct Bounds on the Value Sets
467
+
### Using Agent 2's Problem to Construct Bounds on Value Sets
0 commit comments