Skip to content

Commit 7de87eb

Browse files
author
Robert Coleman
authored
Update README.md
1 parent cd194aa commit 7de87eb

File tree

1 file changed

+14
-3
lines changed

1 file changed

+14
-3
lines changed

README.md

+14-3
Original file line numberDiff line numberDiff line change
@@ -2,8 +2,9 @@
22

33
Classifies an image as containing either a dog or a cat (using Kaggle's <a href="https://www.kaggle.com/c/dogs-vs-cats-redux-kernels-edition/data">public dataset</a>), but could easily be extended to other image classification problems.
44

5-
To run this script, you must have keras, numpy, scipy, and h5py installed, and GPU acceleration with keras is <em>highly</em> recommended.
5+
To run these scripts/notebooks, you must have keras, numpy, scipy, and h5py installed, and enabling GPU acceleration is highly recommended if that's an option.
66

7+
## img_clf.py
78
The data directory structure I used was:
89

910
* project
@@ -23,7 +24,17 @@ Most of the code / strategy here was based on <a href="https://blog.keras.io/bui
2324

2425
Pre-trained VGG16 model weights can be downloaded <a href="https://gist.github.com/baraldilorenzo/07d7802847aaad0a35d3">here</a>.
2526

26-
### UPDATE:
27-
Added jupyter notebook that produced a slightly better score (.161 log loss on kaggle test set). Might run into memory errors because of the large image dimensions -- if so reducing the number of folds and saving the model weights rather than keeping the models in memory should do the trick. The notebook uses a slightly flatter directory structure, with the validation split happening after the images are loaded.
27+
## cats_n_dogs.ipynb:
28+
This produced a slightly better score (.161 log loss on kaggle test set). The better score most likely comes from having larger images and ensembling a few models, despite the fact there's no image augmentation in the notebook.
29+
30+
Might run into memory errors because of the large image dimensions -- if so reducing the number of folds and saving the model weights rather than keeping the models in memory should do the trick. The notebook uses a slightly flatter directory structure, with the validation split happening after the images are loaded:
31+
32+
* project
33+
* data
34+
* train
35+
* dogs
36+
* cats
37+
* test
38+
* test
2839

2940

0 commit comments

Comments
 (0)