This is a Python code for organizing concert information.
Result will be stored in Pandas DataFrame.
Tested on Python 3.8.18
Currently it works for LA Phil website and can crawl only scrolling type page. It will be updated.
My goal is to make it generally working for any kind of concert halls, sports matches, etc.
If you don't have libraries below, install these libraries.
- $ pip install webdriver-manager selenium beautifulsoup4 pandas
You may need ChromeDriver for this. Check here https://chromedriver.chromium.org/downloads
- $ git clone https://github.com/belyllium/Concert-Web-Crawler
- $ python3 run.py
Or you can run with crawl.ipynb. Both have same result.