Google Summer of Code 2025: Open Data PVNet Discussion Thread #24
Replies: 44 comments 21 replies
-
|
Hey! |
Beta Was this translation helpful? Give feedback.
-
|
I am interested in contributing to Open Climate fix for gsoc 2025. I'm eager to apply my Python and machine learning fundamentals. How much guidance will be available for the NWP specific aspects? |
Beta Was this translation helpful? Give feedback.
-
|
Hi, @peterdudfield I'm Sairam, and I have a strong passion for machine learning, particularly with Python and LLMs. The Open Data Solar Forecasting Pipeline project has really caught my attention, and I'm excited about the possibility of contributing to it as part of GSoC '25. I've begun looking into the PVNet repository to familiarize myself with the current work, and I've also checked out some "good first issues." Recently, I commented on an issue in the Analysis Dashboard repository and am now waiting to be assigned so I can make my first contribution to Open Climate Fix. In addition to training the model using open NWP data, I would like to clarify what specific contributions you are looking for from me. Should I concentrate on enhancing model performance, integrating new datasets, or is there another area you’d prefer I focus on? Moreover, since PVNet is a key component of this project and has piqued my interest, would it be more beneficial for me to tackle a "good first issue" within PVNet instead of working on another repository? If that's the case, do you have any suggestions for an issue that would align well with the project's goals? I'm eager to begin with an initial contribution that supports the long-term objectives. I look forward to your guidance! |
Beta Was this translation helpful? Give feedback.
-
|
Hey!! I am highly interested in the intersection of climate science, machine learning, and sustainability, and I find the work being done here particularly intriguing. I have experience working with meteorological variables and applying machine learning techniques, and I would love to contribute to the ML modeling efforts. After reviewing the repository, I understand that the initial focus is on predicting the target variable for the entire UK region. Please correct me if I am mistaken. I would appreciate any guidance on how I can best contribute to the project. |
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield, I'm S Vijaya Bhaskar, a machine learning enthusiast with strong Python and PyTorch skills, and I have a keen interest in climate data and sustainable tech. I'm really excited about contributing to the solar forecasting project. I'm currently also contributing to OpenClimateFix in the elexonapi, so I'm getting familiar with the ecosystem. I noticed that while there aren't any issues labeled as "good first issues" at the moment, there are several open issues. Could you recommend one that would be a great starting point for someone with my background? I'm eager to dive in—whether it's improving model performance, integrating new datasets, or any other area where support is needed. Thanks for your guidance! |
Beta Was this translation helpful? Give feedback.
-
|
Hi! This GSoC 2025 project on transitioning PVNet to a public-data-only model is very intriguing. I've been following the advancements in Numerical Weather Prediction (NWP), particularly the application of diffusion models, hybrid models, and transformers, and I'm excited to see this applied to PVNet. I understand the core objective is to migrate PVNet from a mixed public/private dataset to a purely public dataset. This raises several key questions, and I'd love to gain a clearer understanding of the challenges:
I'm particularly interested in exploring potential architectural modifications Thank you for your time, and I look forward to learning more about this project! |
Beta Was this translation helpful? Give feedback.
-
|
Hey ,
I checked out the repo, and it looks really cool! Looking forward to learning more and getting involved. Best, |
Beta Was this translation helpful? Give feedback.
-
|
I'm Ajit Ashwath, and I’m really excited about this project. I’ve been working with ML for a while, mainly using Python, PyTorch and TensorFlow, and the idea of applying ML to renewable energy is really interesting to work on. I’d love to contribute to improving solar forecasting with open data. Before that, I do have some questions to ask. Since this model will be trained solely on a public NWP dataset, are there any known issues or limitations when using it? Does it involve a lot of preprocessing we need to consider? And, are there any particular, specific datasets available for expansion outside of the UK? Looking forward to hearing more! |
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield, I'm Jiya Gupta, and I'm interested in contributing to this project and would love to get involved. Could you guide me on where to start? I'm particularly keen on understanding how the data pipeline is set up and how I can help in making the data samples more manageable for ML training. Looking forward to your guidance! |
Beta Was this translation helpful? Give feedback.
-
|
Hey @peterdudfield , I'm Yeswanth, and I'm really excited about contributing to the Open Data PVNet project as part of GSoC 2025. I have a strong background in Python and machine learning, and I'm eager to build the model from scratch using open Numerical Weather Prediction (NWP) data. I've already started exploring the PVNet repo to understand the existing framework and get familiar with how things are structured. Before diving in, I wanted to clarify a few things about the scope and direction of this project: Key Questions:Core Model Goals – Since we’re building this from scratch, what’s the primary focus?
Challenges with Open Data – What are the biggest obstacles when training a model only on open-source NWP data?
Baseline Comparison – Is there an existing baseline we should aim to match or improve upon?
Model Architecture Decisions – Should we reuse parts of the PVNet model or start fresh?
Training and Deployment – What’s the expected workflow?
First Steps – What’s the best way to get started?
|
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield @Sukh-P, apologies for the ping I’m Chaitra Samant, a second-year Computer Engineering student from VJTI Mumbai, passionate about open-source and sustainable tech solutions. I’ve actively contributed to open-source projects, completed Hacktoberfest, and won an open-source contribution contest. I’m highly interested in contributing to Open Data PVNet for GSoC. I have experience with PyTorch, Deep Learning, and Web Development, and I’m eager to work on training the solar forecasting model using free NWP data. I’d love to understand the next steps for potential contributors—are there any prerequisite tasks or areas I should focus on to prepare effectively? Looking forward to your guidance! |
Beta Was this translation helpful? Give feedback.
-
|
Hello, @peterdudfield @Sukh-P. Perhaps this is the project that matches me. I recently worked as a machine learning engineer intern at AirQowere they used local sensors to provide data, and then they decided to launch a research on the possibility of the atomespheric measurements and their correlation with the air quality (PM2.5 Predicition), I developed a model that scored 3rd on the benchmark while being suitable for production environment I was then selected to implement my ML model on their platform were I was introduced to the world of MLOPs and models deployment. Experiments, model optimization, and validation are my thing. The research with airqo helped me with the foundations of NWP. I would love to discuss my fit to this project and if there is still a need for more hands to work on it. |
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield , @Sukh-P Your project's goal to leverage 100% open data for forecasting solar generation at the national level, starting with the UK, aligns perfectly with my skills and interests. I have experience with Python and Transflow, which I believe would be beneficial for this project. I have reviewed the project guidelines and have a few questions regarding the specifics of the data and the model's architecture. I would also appreciate some guidance on writing a cover letter tailored to this opportunity, as I am eager to make a significant contribution. Could we schedule a time to discuss this further? I am looking forward to the possibility of working together and contributing to the success of the Open Data PVNet project. Thank you for considering my interest. |
Beta Was this translation helpful? Give feedback.
-
|
Hi, I am Dhruvilsinh Chauhan, a grad student at San Diego State University. I am working as a AI/ML Research Assistant at JSB AI Center. I have experience with Python, PyTorch and ML algorithms. I am excited to train a model on new NWP data, create and integrate the solar forecasting pipeline. Looking forward to hear from you. Thank you, |
Beta Was this translation helpful? Give feedback.
-
|
Hey @peterdudfield , @Sukh-P, Here's a rough outline of my plan:
A few questions to help guide and refine my approach:
Looking forward to your thoughts and feedback. |
Beta Was this translation helpful? Give feedback.
-
|
Hello @peterdudfield and @Sukh-P, I'm Vinayak Yadav, currently working at SAC, ISRO on precipitation nowcasting and lightning prediction. I implemented DGMR for satellite data to benchmark our diffusion-based nowcasting model, and I'm planning to publish and deploy these projects (hopefully making them public) so they can serve the entire Indian subcontinent! For PVNet, I have a few questions and proposals:
Also, Thank you so much for all the incredible work on the Open Climate Fix project. I’ve personally benefited from it, and it's inspiring to see its positive impact on our community and the many researchers who rely on your efforts. Lastly, I'm really looking forward to your feedback and the chance to collaborate further. Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hi everyone! I’m Shravani Jaiswal, a Computer Science Engineering student with interests in Machine Learning and Renewable Energy. I’m excited about contributing to Open Data PVNet as part of Google Summer of Code 2025. I have experience with Python and basic ML (PyTorch), and I’m eager to deepen my understanding of solar forecasting and NWP data. I’ve gone through the project documentation and GitHub issues, and I’m particularly interested in:
This would be my first time contributing in open source so I would need some guidance I’d love to contribute to this project and learn from the mentors and community here! Could you suggest any beginner-friendly issues or key areas where I can start contributing? |
Beta Was this translation helpful? Give feedback.
-
|
Excited to contribute! Looking into how multi-agent systems can improve forecasting and grid optimization. Keen to collaborate! |
Beta Was this translation helpful? Give feedback.
-
|
Hello @peterdudfield and @Sukh-P, I'm interested in contributing to this project for Google Summer of Code 2025. I am 4th year student from C. V. Raman University. I have machine learning experience in Python and PyTorch. I’ve also worked on projects like Inception-Xception hybrid model (TensorFlow ,working on its PyTorch version), lstm_gru_cnn_sru_ensmeble and now working on cnn_sru hybrid model (everything in pytorch). I am every interested in PVNET model as how it is different from other model e.g. resnet, googlenet, etc. I’d love to learn more about the expectations for the ML pipeline and how best to prepare for the application process. Looking forward to your guidance! |
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield and @Sukh-P, I'm Abel Saj, a CS and Statistics double major at UNC Chapel Hill and I'm interested in working on this project! I have experience in ML and Python, and I’ve worked on projects involving generative AI, computer vision, neural networks, and other machine learning models. With my statistics and computer science background, I can analyze and model complex datasets effectively. I'm excited about contributing to an open-source solar forecasting pipeline and working with NWP data. |
Beta Was this translation helpful? Give feedback.
-
|
Respected sir, |
Beta Was this translation helpful? Give feedback.
-
|
Dear Peter, Sukh, and the Open Climate Fix team, I am writing to express my enthusiastic interest in contributing to the Open Data PVNet project as part of Google Summer of Code 2025. As a Ph.D. student in Artificial Intelligence, I am deeply engaged in research that leverages spatio-temporal modeling and deep learning, which strongly aligns with the challenges and goals of solar forecasting using Numerical Weather Prediction (NWP) data. My current doctoral research focuses on motion detection and energy tracking in video scenes using spatio-temporal filters inspired by human vision. This has given me hands-on experience building and optimizing deep learning architectures that extract temporal dependencies in complex, dynamic data expertise that I am eager to bring to the PVNet forecasting pipeline. During my Master’s degree in Decision Support and Intelligent Systems, I worked extensively on deep learning projects involving Recurrent Neural Networks (RNNs), LSTM networks, and Multilayer Perceptrons (MLPs). I’ve applied these models to behavior-based identification systems and instance segmentation tasks using Mask R-CNN. My strong background in PyTorch, Python, and handling large-scale datasets positions me well to contribute to building a solar forecasting model that relies entirely on open data. What excites me most about this project is the mission to democratize clean energy forecasting using transparent, reproducible methods. I’m particularly interested in working with open-source communities and using machine learning not just for innovation, but also for environmental and societal benefit. I’d love the opportunity to work closely with your team to develop a robust and benchmarked solar forecasting model for the UK and eventually help extend it to other regions. I'm confident that my research mindset, technical background, and collaborative spirit will make me a strong contributor to this initiative. Thank you for considering my application. I look forward to the possibility of discussing how I can contribute to Open Data PVNet and support your mission at Open Climate Fix. Warm regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hii @peterdudfield and the Open Climate Fix team |
Beta Was this translation helpful? Give feedback.
-
|
Thanks for the clear project description! Given that public NWP data might lack the granularity (spatial/temporal resolution, specific features) of the private sources used in Quartz Solar, what are the main anticipated challenges for the PVNet model in achieving good accuracy with only this open data? Is the initial focus purely on training the existing architecture, or does it potentially include exploring specific architectural adaptations or feature engineering strategies to compensate for the difference in input data richness? |
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield, @Sukh-P, I am Gianluca Ferro. I graduated with a Master’s degree in Electronic Engineering and currently work as a research fellow in Artificial Intelligence at ciparlabs , focusing on smart grids and renewable energy communities. My Master’s thesis centered on time series forecasting, where I developed a framework of predictors—ranging from gradient boosting to statistical models and neural networks—that can forecast energy consumption and generation from renewable sources. I find your Open Data PVNet project especially compelling because it aligns with my interest in applying ML to optimize energy production and consumption. I’m excited about the opportunity to train my models on purely open-data resources and help benchmark their accuracy against existing solutions. Before I dive in, I just have one question: Are there any specific challenges or limitations encountered when using publicly available NWP data for forecasting solar generation, such as gaps in spatiotemporal resolution or reliability issues? I look forward to possibly contributing to this project and helping advance the open-source solar forecasting capabilities. Thank you for your time, and I’m eager to discuss how I might fit into your development roadmap. Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield and @Sukh-P , I’m Mayank Jain, a second-year B.Tech student majoring in Computer Science with a focus on Data Science. I’m really excited about the Open Data PVnet project—it aligns closely with my interest in renewable energy and machine learning. During my academic journey, I explored solar forecasting models as part of my coursework and hackathon experiences. I’d love to contribute to this project and also learn more by collaborating with your amazing team. I’ve gone through the GitHub repo and documentation, and I’m eager to get started. Looking forward to your guidance and feedback! Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hi @peterdudfield and team, I'm Harsh Dhingra, currently enrolled in Master's student in Data Science and Msc Finance Before this, I worked as an ML Engineer at Lighthouse Global, where I developed and deployed several end-to-end ML solutions — including document classification systems with DistilBERT, RAG pipelines using Azure OpenAI, and optimized inference using NVIDIA Triton. I just discovered this project and would love to contribute! Given my experience with large-scale model deployment and ML optimization, I’m confident I can help with both the model training and performance benchmarking aspects of the open data solar forecasting pipeline. Looking forward to learning more about the project and how I can get involved! Best, |
Beta Was this translation helpful? Give feedback.
-
Google Summer of Code 2025 applications are now closed.We are currently reviewing all applications. Contributors will be announced 8 May 2025. Thank you! |
Beta Was this translation helpful? Give feedback.
-
|
I am probably late to the party. But hi. I am very interested in this project and I'm considering contributing to it in GSOC'26. So, can you give me the 101s about the project and where I should start |
Beta Was this translation helpful? Give feedback.
-
|
Great to see many people working on this project, i feel like climate issues are the most critical issue at hand right now. I would love to contribute to this project and participate in GSoC'26, since I want to make a real impact from my work. Could u provide me with the starting resources about this project. Thank you, |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
This space is for you to ask any questions you have about this project. We're here to provide clarifications and help you understand the project's goals, scope, and requirements. Feel free to ask about anything that interests you!
Please note that this discussion is for questions and clarifications, not for formal applications.
Project Description
We're building an open-source solar forecasting pipeline to integrate with OCF's PVNet model, using publicly available Numerical Weather Prediction data to forecast solar generation at the national level, starting with the UK. Currently, our main forecasting tool, Quartz Solar, is trained using a mixture of public and private datasets, and we want to create an effective model that uses 100% open data.
The data is ready to go, but we need a ML engineer to train the model. The aim will be to start with a UK forecast, but then extend to different countries.
Expected Outcome
A UK ML Solar forecast
trained on free NWP data with the accuracy benchmarked.
Other Key Information
Beta Was this translation helpful? Give feedback.
All reactions