cricket-world-cup2019--k-means-clustering-on-batsmen-data

We will use scrapped players and matches data to implement k-means clustering. For scrapping of data, please refer other repository https://github.com/niteshjindal170988/cricket-world-cup2019--players_data_scrapping

Scrapped data which is used for clustering can be found here:- https://drive.google.com/drive/folders/1IQCUKo1zBkVKM1b9NPMl2EApVb00cpXn?usp=sharing

Clustering the players based on the number of balls faced by them, total runs scored, number of 4s they hit and number of 6s they hit.

This will render us the group of players who showed similar pattern in their batting style. That is to say, players who hit same number of 4s and 6s and maintained a similar strike rates will create a cluster unqiue to them.

Steps-

Load the scrapped 'matches data', Data Preprocessing and extract the required information
Data Preprocessing Phase[from scrapped Matches_data.csv]
Prepare Batsman data with number of balls played, runs scored, number of 4s hit, number of 6s hit
Implement k means clustering - randomly assign centroids and follow expectation-maximization algorithm

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
cricket-world-cup2019-[k-means-clustering-on-batsmen-data].ipynb		cricket-world-cup2019-[k-means-clustering-on-batsmen-data].ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cricket-world-cup2019--k-means-clustering-on-batsmen-data

About

Releases

Packages

Languages

niteshjindal-7/cricket-world-cup2019--k-means-clustering-on-batsmen-data

Folders and files

Latest commit

History

Repository files navigation

cricket-world-cup2019--k-means-clustering-on-batsmen-data

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages