Skip to content

Latest commit

 

History

History
32 lines (20 loc) · 4.38 KB

index.md

File metadata and controls

32 lines (20 loc) · 4.38 KB
site
sandpaper::sandpaper_site

Introduction to Geospatial Raster and Vector Data with Python

In this lesson you will learn how to work with geospatial datasets and how to process these with Python. Python is one of the most popular programming languages for data science and analytics, with a large and steadily growing community in the field of Earth and Space Sciences. The lesson is meant for participants with a basic knowledge of Python and it allows them to familiarize with the world of geospatial raster and vector data. If you are unfamiliar with Python, useful resources to get started include the Software Carpentry's lesson "Programming with Python" and the book "Think Python" by Allen Downey. In the Introduction to Geospatial Raster and Vector Data with Python lesson you will be introduced to a set of tools from the Python ecosystem and learn how these can be used to carry out geospatial data analysis tasks. In particular, you will work with satellite images and open topographical geo-datasets, and learn how these spatial datasets can be accessed, explored, manipulated and visualized using Python.

Case study - Wildfires

As a case study for this lesson we will focus on wildfires. According to the IPCC assessment report, the wildfire seasons are lengthening as a result of changes in temperature and increasing drought conditions. To analyse the impact of these wildfires, we will focus on the wildfire that occurred on the Greek island of Rhodes in the summer of 2023, which had a devastating effect and led to the evacuation of 19.000 people. In this lesson we are going to analyse the effect of this disaster by estimating which built-up areas were affected by these wildfires. Furthermore, we will analyse which vegetation and land-use types have been affected the most by the wildfire in order to get an understanding of which areas are more vulnerable to wildfires. The analysis that we set up provides insights in the effect of the wildfire and generates input for wildfire mitigation strategies.

Note, that the analyses presented in this lesson are developed for educational purposes. Therefore in some occasions the analysis steps have been simplified and assumptions have been made.

The data used in this lesson includes optical satellite images from the Copernicus Sentinel-2 mission and topographical data from OpenStreetMap (OSM). These datasets are real-world open data sets that entail sufficient complexity to teach many aspects of data analysis and management. The datasets have been selected to allow participants to focus on the core ideas and skills being taught while offering the chance to encounter common challenges with geospatial data. Furthermore, we have selected datasets which are available anywhere on Earth.

During this lesson we will setup an analysis pipeline which identifies scorched areas based on bands of satellite images collected after the disaster in July 2023. Next, we will calculate the Normalized Difference Vegetation Index (NDVI) to assess the vegetation cover of the areas before and after the wildfire. To investigate the affected built-up areas and main roads, we will use OSM vector data and compare them with the previously identified scorched areas.

To most effectively use this material, make sure to download the data and follow the software setup instructions before working through the lesson (this especially accounts for learners that follow this lesson in a workshop).

Python libraries used in this lesson

The main python libraries that are used in this lesson are: