Skip to content

Commit 21e0788

Browse files
committed
Removed ReadMe.md, Relocated dataset.csv, Added Explanation inside Notebook
1 parent b8a3a30 commit 21e0788

File tree

3 files changed

+31
-1035
lines changed

3 files changed

+31
-1035
lines changed

notebooks/data_preprocessing/DataCleaning/Data PreProcessing and Model Training.ipynb

+31-12
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,30 @@
44
"cell_type": "markdown",
55
"metadata": {},
66
"source": [
7-
"Source : https://github.com/iSiddharth20/Predictive-Analysis-for-Machine-Faliure"
7+
"## Welcome to the AI-ML-Jupyter-Notebooks repository! \n",
8+
"### This guide will help you navigate and learn Data Pre-Processing and Cleaning using an Example Dataset.\n",
9+
"---"
10+
]
11+
},
12+
{
13+
"cell_type": "markdown",
14+
"metadata": {},
15+
"source": [
16+
"#### Make sure to inastall necessary dependencies by running this command :\n",
17+
"\n",
18+
"pip install -r requirements.txt\n",
19+
"\n",
20+
"---"
21+
]
22+
},
23+
{
24+
"cell_type": "markdown",
25+
"metadata": {},
26+
"source": [
27+
"### Details of Dataset : \n",
28+
"#### Contains Values obtained from certain sensors inside a machine and task is to predict if the machine will fail at certain resdings from those sensors. Hence, it is important to clean the data and extract the most relevant information from it.\n",
29+
"##### Source of Dataset : https://github.com/iSiddharth20/Predictive-Analysis-for-Machine-Faliure\n",
30+
"---"
831
]
932
},
1033
{
@@ -64,12 +87,15 @@
6487
"metadata": {},
6588
"outputs": [],
6689
"source": [
67-
"# Getting the Data Set in the Program\n",
68-
"\n",
90+
"# Getting the Data Set in the Program using URL\n",
6991
"url = 'https://raw.githubusercontent.com/iSiddharth20/Predictive-Analysis-for-Machine-Faliure/master/dataset.csv'\n",
70-
"\n",
7192
"s = requests.get(url).content\n",
72-
"data = pd.read_csv(io.StringIO(s.decode('utf-8')))"
93+
"data = pd.read_csv(io.StringIO(s.decode('utf-8')))\n",
94+
"\n",
95+
"\n",
96+
"# Getting the Data Set in the Program using a local CSV File\n",
97+
"source = '../../../data/dataset_datacleaning.csv'\n",
98+
"data = pd.read_csv(source)"
7399
]
74100
},
75101
{
@@ -1447,13 +1473,6 @@
14471473
"# Exporting Trained model as 'trained_model.pkl'\n",
14481474
"joblib.dump(model,\"trained_model.pkl\")"
14491475
]
1450-
},
1451-
{
1452-
"cell_type": "code",
1453-
"execution_count": null,
1454-
"metadata": {},
1455-
"outputs": [],
1456-
"source": []
14571476
}
14581477
],
14591478
"metadata": {

notebooks/data_preprocessing/DataCleaning/README.md

-9
This file was deleted.

0 commit comments

Comments
 (0)