New features #45

peterdudfield · 2023-03-22T08:15:19Z

jacobbieker · 2023-03-22T08:29:18Z

I would probably start with removing the sde from training, and then probably more lag features? I think XGBoost models don't need the data to be normalized, so not sure that's necessary, although I guess if the units are different between CEDA and live MetOffice it probably makes sense to do that first.

peterdudfield · 2023-03-22T09:01:52Z

Bonus one is to add mcc and hcc to nwp variables

JackKelly · 2023-03-22T11:22:43Z

if the units are different between CEDA and live MetOffice it probably makes sense to do that first

Yeah, it's pretty essential that the data the model sees at inference time is exactly the same as the data it sees at training time 🙂 so I agree that sounds like the priority!

And I agree with @jacobbieker that I don't think XGBoost models require the data to be normalised (because it chops real-valued inputs up into bins).

Does the model also get historical NWP data? If not, I think that might help a bit: i.e. if the model gets lagged GSP data for n timesteps in the past, then it might be useful to give the model NWP data for those same timesteps so the model can see the difference between the expected forecast (given the NWP) and what actually happened in the recent past. But maybe the model is already doing that?

peterdudfield · 2023-03-22T14:41:21Z

Thanks @JackKelly and @jacobbieker , i re-ordered above, do you that order is about right?

JackKelly · 2023-03-22T18:45:50Z

Lgtm!

jacobbieker · 2023-03-23T09:33:21Z

Looks great!

peterdudfield · 2023-03-23T13:47:12Z

Thanks, @dantravers you happy with this?

dantravers · 2023-03-23T23:30:15Z

Looks reasonable to me! I'd be curious to see if this does well, so could be higher?
Use historic NWP data, not just forecasts
But seems sensible. Thanks for asking the open question!

peterdudfield added this to Nowcasting WP4 Mar 29, 2023

peterdudfield closed this as completed Sep 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New features #45

New features #45

peterdudfield commented Mar 22, 2023 •

edited

Loading

jacobbieker commented Mar 22, 2023

peterdudfield commented Mar 22, 2023

JackKelly commented Mar 22, 2023 •

edited

Loading

peterdudfield commented Mar 22, 2023

JackKelly commented Mar 22, 2023

jacobbieker commented Mar 23, 2023

peterdudfield commented Mar 23, 2023

dantravers commented Mar 23, 2023

New features #45

New features #45

Comments

peterdudfield commented Mar 22, 2023 • edited Loading

jacobbieker commented Mar 22, 2023

peterdudfield commented Mar 22, 2023

JackKelly commented Mar 22, 2023 • edited Loading

peterdudfield commented Mar 22, 2023

JackKelly commented Mar 22, 2023

jacobbieker commented Mar 23, 2023

peterdudfield commented Mar 23, 2023

dantravers commented Mar 23, 2023

peterdudfield commented Mar 22, 2023 •

edited

Loading

JackKelly commented Mar 22, 2023 •

edited

Loading