Skip to content

using dask_ml with imblearn #701

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
sephib opened this issue Mar 29, 2020 · 4 comments
Closed

using dask_ml with imblearn #701

sephib opened this issue Mar 29, 2020 · 4 comments

Comments

@sephib
Copy link

sephib commented Mar 29, 2020

Hi,
I want to try and work on implementing dask_ml when running imblearn.
I've opened an issue on dask_ml issue # 317 however I thought it would be a good idea to ping this repo.
I wanted to ask if someone is already working on such implementation, and if not - is there any additional input that can assist while working on the above issue.

@chkoar
Copy link
Member

chkoar commented Apr 15, 2020

Do you want to contribute in imbalanced-learn or in dask_ml?

@sephib
Copy link
Author

sephib commented May 8, 2020

To dask_ml, not sure how this issue can be solved in imbalanced-learn

@glemaitre
Copy link
Member

The thing is that we would need to handle dask array and dask dataframe which is not compatible with scikit-learn (at least they will be converted to NumPy array). One need to modify the sampler to handle the dask array natively.

FYI: there is an effort to port the sampler there dask/dask-ml#638 (review)

@vishalvvs
Copy link

Any success in having imblearn capabilities in Dask-ml?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants