Question About Input Architecture for GSP Data #33

kwon-encored · 2025-03-04T05:28:28Z

kwon-encored
Mar 4, 2025

Hi @Sukh-P,
I hope you're doing well!

I finally got the model running, and it's now achieving an NMAE10 close to 7! Still working to bring it down further, hopefully closer to your impressive ~3.

For my region of interest, I've divided it into 59 GSPs and have one full year of data (from Dec 1, 2023, to Dec 31, 2024).

I was curious about how your study structured the input data. In my case, I’ve considered two scenarios for training/testing splits:

Using GSPs 1 to 57 for training and validation, while reserving GSPs 58 and 59 for testing.
Using data from all GSPs but splitting temporally—training and validation with data from January to September, and using October to December for testing.

Which of these approaches is more aligned with your study, or do you recommend a different strategy?

Answered by Sukh-P

Mar 7, 2025

Hi @kwon-encored, that's great to hear! So I would recommend the training/validation split be done by time for all GSPs, option 2 you present, since there's a chance otherwise there is some data leakage from other GSPs being trained on a period you are testing in option 1.

One thing to note is that for our models we generally have used ~3 years of data for training and around 1 year for validation, since this way the model gets to see at least a few years of each season in training and is likely to generalise better.

All the best with the ongoing work!

View full answer

Sukh-P · 2025-03-07T16:20:10Z

Sukh-P
Mar 7, 2025
Maintainer

Hi @kwon-encored, that's great to hear! So I would recommend the training/validation split be done by time for all GSPs, option 2 you present, since there's a chance otherwise there is some data leakage from other GSPs being trained on a period you are testing in option 1.

One thing to note is that for our models we generally have used ~3 years of data for training and around 1 year for validation, since this way the model gets to see at least a few years of each season in training and is likely to generalise better.

All the best with the ongoing work!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Open Climate Fix

Question About Input Architecture for GSP Data #33

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Open Climate Fix

Question About Input Architecture for GSP Data #33

Uh oh!

kwon-encored Mar 4, 2025

Replies: 1 comment

Uh oh!

Uh oh!

Sukh-P Mar 7, 2025 Maintainer

kwon-encored
Mar 4, 2025

Sukh-P
Mar 7, 2025
Maintainer