|
60 | 60 | "## A temperature forecasting problem\n",
|
61 | 61 | "\n",
|
62 | 62 | "Until now, the only sequence data we have covered has been text data, for instance the IMDB dataset and the Reuters dataset. But sequence \n",
|
63 |
| - "data is found in many more problems than just language processing. In all of our examples in this section, will be playing with a weather \n", |
| 63 | + "data is found in many more problems than just language processing. In all of our examples in this section, we will be playing with a weather \n", |
64 | 64 | "timeseries dataset recorded at the Weather Station at the Max-Planck-Institute for Biogeochemistry in Jena, Germany: http://www.bgc-jena.mpg.de/wetter/.\n",
|
65 | 65 | "\n",
|
66 | 66 | "In this dataset, fourteen different quantities (such air temperature, atmospheric pressure, humidity, wind direction, etc.) are recorded \n",
|
|
221 | 221 | "\n",
|
222 | 222 | "* `lookback = 720`, i.e. our observations will go back 5 days.\n",
|
223 | 223 | "* `steps = 6`, i.e. our observations will be sampled at one data point per hour.\n",
|
224 |
| - "* `delay = 144`, i.e. our targets will be 24 hour in the future.\n", |
| 224 | + "* `delay = 144`, i.e. our targets will be 24 hours in the future.\n", |
225 | 225 | "\n",
|
226 | 226 | "To get started, we need to do two things:\n",
|
227 | 227 | "\n",
|
228 | 228 | "* Preprocess the data to a format a neural network can ingest. This is easy: the data is already numerical, so we don't need to do any \n",
|
229 |
| - "vectorization. However each timeseries in the data is one a different scale (e.g. temperature is typically between -20 and +30, but \n", |
| 229 | + "vectorization. However each timeseries in the data is on a different scale (e.g. temperature is typically between -20 and +30, but \n", |
230 | 230 | "pressure, measured in mbar, is around 1000). So we will normalize each timeseries independently so that they all take small values on a \n",
|
231 | 231 | "similar scale.\n",
|
232 | 232 | "* Write a Python generator that takes our current array of float data and yields batches of data from the recent past, alongside with a \n",
|
|
0 commit comments