Skip to content

Commit 54d4e3f

Browse files
authored
Create Readme.md
1 parent 0d5835e commit 54d4e3f

File tree

1 file changed

+21
-0
lines changed

1 file changed

+21
-0
lines changed

Complexity_Learning_curves/Readme.md

+21
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
## Complexity and Learning curves
2+
Complexity and learning curve analyses are some of the most important tasks in a Machine Learning project.
3+
4+
**Learning curve**: Graphs that compares the performance of a model on training and testing data over a varying number of training instances.
5+
6+
We should generally see performance improve as the number of training points increases.
7+
8+
**Complexity curve**: Graphs that show the model performance over training and validation set for varying degree of model complexity
9+
(e.g. degree of polynomial for linear regression, number of layers or neurons for neural networks,
10+
number of estimator trees for a Boosting algorithm or Random Forest)
11+
12+
Complexity curve allows us to verify when a model has learned as much as it can about the data without fitting to the noise.
13+
The optimum learning (given the fixed data) occurs when,
14+
15+
* The performances on the training and testing sets reach a plateau
16+
* There is a consistent gap between the two error rates
17+
18+
The key is to find the sweet spot that minimizes bias and variance by finding the right level of model complexity.
19+
Of course with more data any model can improve, and different models may be optimal.
20+
21+
### Following is the intuitive illustration of model complexity curve from Andrew Ng's Machine Learning course

0 commit comments

Comments
 (0)