EasyEnsembleGeneralization #4

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

chkoar wants to merge 3 commits into master from easy_ensemble_generalization

Owner

chkoar commented Jul 20, 2017

@glemaitre I was thinking that I could proceed to the experiments using this implementation. Any suggestions are welcome.

chkoar force-pushed the easy_ensemble_generalization branch 2 times, most recently from efec02e to b216fe7 Compare

July 20, 2017 02:53

glemaitre reviewed

View reviewed changes

Collaborator

glemaitre left a comment

Couple of thoughts. It is a quick review I would need more time to ensure that everything that I said is programmable.

imblearn/ensemble/easy_ensemble_generalization.py Outdated

+                      random_state = check_random_state(self.random_state)
+                      estimator_seeds = random_state.randint(MAX_INT, size=self.n_estimators)
+                      sampler_seeds = random_state.randint(MAX_INT, size=self.n_estimators)

Collaborator

glemaitre Jul 20, 2017

I think that we should use the _set_random_states from the ensemble.base.py
https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/ensemble/base.py

imblearn/ensemble/easy_ensemble_generalization.py Outdated

+                      pipelines = []
+                      seeds = zip(estimator_seeds, sampler_seeds)
+                      for i, (estimator_seed, sampler_seed) in enumerate(seeds):

Collaborator

glemaitre Jul 20, 2017

If the random state is properly done before, we could do that in parallel with joblib

imblearn/ensemble/easy_ensemble_generalization.py Outdated

+                          sampler = clone(self.base_sampler_)
+                          sampler.set_params(random_state=sampler_seed)
+                          if hasattr(self.base_estimator_, 'random_state'):

Collaborator

glemaitre Jul 20, 2017

We should reuse make_estimator?
https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/ensemble/base.py#L119

imblearn/ensemble/easy_ensemble_generalization.py Outdated

+                      for i, (estimator_seed, sampler_seed) in enumerate(seeds):
+                          sampler = clone(self.base_sampler_)
+                          sampler.set_params(random_state=sampler_seed)

Collaborator

glemaitre Jul 20, 2017

we should create a make_sampler similarly to make_estimator

imblearn/ensemble/easy_ensemble_generalization.py Outdated


		from ..pipeline import Pipeline
		from ..under_sampling import RandomUnderSampler as ROS

Collaborator

glemaitre Jul 20, 2017

I would use the full name since we are using it once :) Might be more intuitive and this is a burden to right it once.

Owner Author

chkoar commented Jul 21, 2017 •

edited

Loading

@glemaitre since _set_random_states sets the random_state recursively of all nested objects it helped to remove some lines.

Collaborator

glemaitre commented Jul 21, 2017

For the remaining error, it could because we creating a meta-estimator which should not go through the same common test than estimator:
https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/utils/testing.py#L508

Owner Author

chkoar commented Jul 21, 2017 •

edited

Loading

@glemaitre I can think two optionn right now:

Don't bother right now
Copy the ensemble code

Collaborator

glemaitre commented Jul 21, 2017

We should be able to monkey patch as well. But I would go for 1. for the moment

Owner Author

chkoar commented Jul 21, 2017 •

edited

Loading

@glemaitre We should be able to monkey patch as well.

I didn't mention that, in purpose, cause I thought that you wont like this approach.

Collaborator

glemaitre commented Jul 21, 2017 via email

If this is very small patch, I am for it ;)

codecov bot commented Jul 21, 2017 •

edited

Loading

Codecov Report

Merging #4 into master will decrease coverage by <.01%.
The diff coverage is 97.93%.

@@            Coverage Diff             @@
##           master       #4      +/-   ##
==========================================
- Coverage   98.32%   98.31%   -0.01%     
==========================================
  Files          68       70       +2     
  Lines        3879     3975      +96     
==========================================
+ Hits         3814     3908      +94     
- Misses         65       67       +2

Impacted Files	Coverage Δ
imblearn/ensemble/__init__.py	`100% <100%> (ø)`	⬆️
...nsemble/tests/test_easy_ensemble_generalization.py	`100% <100%> (ø)`
imblearn/ensemble/easy_ensemble_generalization.py	`96.29% <96.29%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2c0628f...69069fb. Read the comment docs.


          EasyEnsembleGeneralization Step1

aa1f233

chkoar force-pushed the easy_ensemble_generalization branch from 409737a to aa1f233 Compare

August 7, 2017 07:23


          make travis happy

cd972c5

chkoar force-pushed the easy_ensemble_generalization branch from b71d8cd to cd972c5 Compare

August 7, 2017 07:54


          make travis happy2

69069fb

chkoar force-pushed the master branch from cecd68f to edd7522 Compare

November 18, 2020 14:21

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet