File tree 1 file changed +21
-0
lines changed
Complexity_Learning_curves
1 file changed +21
-0
lines changed Original file line number Diff line number Diff line change
1
+ ## Complexity and Learning curves
2
+ Complexity and learning curve analyses are some of the most important tasks in a Machine Learning project.
3
+
4
+ ** Learning curve** : Graphs that compares the performance of a model on training and testing data over a varying number of training instances.
5
+
6
+ We should generally see performance improve as the number of training points increases.
7
+
8
+ ** Complexity curve** : Graphs that show the model performance over training and validation set for varying degree of model complexity
9
+ (e.g. degree of polynomial for linear regression, number of layers or neurons for neural networks,
10
+ number of estimator trees for a Boosting algorithm or Random Forest)
11
+
12
+ Complexity curve allows us to verify when a model has learned as much as it can about the data without fitting to the noise.
13
+ The optimum learning (given the fixed data) occurs when,
14
+
15
+ * The performances on the training and testing sets reach a plateau
16
+ * There is a consistent gap between the two error rates
17
+
18
+ The key is to find the sweet spot that minimizes bias and variance by finding the right level of model complexity.
19
+ Of course with more data any model can improve, and different models may be optimal.
20
+
21
+ ### Following is the intuitive illustration of model complexity curve from Andrew Ng's Machine Learning course
You can’t perform that action at this time.
0 commit comments