Skip to content
Discussion options

You must be logged in to vote

Hi @kwon-encored, that's great to hear! So I would recommend the training/validation split be done by time for all GSPs, option 2 you present, since there's a chance otherwise there is some data leakage from other GSPs being trained on a period you are testing in option 1.

One thing to note is that for our models we generally have used ~3 years of data for training and around 1 year for validation, since this way the model gets to see at least a few years of each season in training and is likely to generalise better.

All the best with the ongoing work!

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by kwon-encored
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Help
Labels
None yet
2 participants