Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Process Metoffice data ready, for ml team #33

Open
peterdudfield opened this issue Jan 9, 2025 · 7 comments
Open

Process Metoffice data ready, for ml team #33

peterdudfield opened this issue Jan 9, 2025 · 7 comments
Assignees

Comments

@peterdudfield
Copy link
Contributor

No description provided.

@peterdudfield peterdudfield converted this from a draft issue Jan 9, 2025
@peterdudfield
Copy link
Contributor Author

Follows on from #31

@alirashidAR
Copy link
Contributor

@peterdudfield I would like to take up this issue could you please assign it to me .

@peterdudfield
Copy link
Contributor Author

THanks, please ask @jcamier for more details

@alirashidAR
Copy link
Contributor

@jcamier, could you please provide a brief overview of this task? I’m currently occupied with mid-semester exams, so I’ll be able to work on it in 2 to 3 days. Thanks!

@peterdudfield
Copy link
Contributor Author

@jcamier should be able to advise. Currently he is uploading lots of Metoffice data to huggingface. We need to process this so its in an easy to use format for the ML modelling. It should be a simialr shape to GFS data already online

@jcamier
Copy link
Collaborator

jcamier commented Jan 31, 2025

@alirashidAR I have a cron job that is getting the met office data from AWS netcdf files, converting to .zarr.zip and uploading them to hugging face ever hour due to rate limit constraints. Right now it is in March of 2023.
https://huggingface.co/datasets/openclimatefix/met-office-uk-deterministic-solar/tree/main/data/2023

The data is organized as follows data/<year>/<month>/<day>/.zarr.zip file for every hour of nwp forecast

I believe the next step is to be able to read this data using the ocf-data-sampler. I am currently reading up on ocf-data-sampler to get familiar with this. Maybe we can setup a time to discuss? Feel free to email me some dates/times that work for you.

@jcamier
Copy link
Collaborator

jcamier commented Jan 31, 2025

Here is a roadmap that Peter and I worked on a while back:

Image

@jcamier jcamier moved this from Todo to In Progress in Open Data PVNet Feb 8, 2025
@jcamier jcamier self-assigned this Feb 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Progress
Development

No branches or pull requests

3 participants