SMOTE for regression
-
09-12-2020 - |
Domanda
I am looking into upsampling an imbalanced dataset for a regression problem (Numerical target variables) in python.
I attached paper and R package that implement SMOTE for regression, can anyone recommend a similar package in Python? Otherwise, what other methods can be use to upsample the numerical target variable?
smoteRegress: SMOTE algorithm for imbalanced regression problems
Update:
I found the following python library which implements Synthetic Minority Over-Sampling Technique for Regression with Gaussian Noise
Soluzione
I think SMOGN will work for your problem. The method is described in a paper titled: "SMOGN: a Pre-processing Approach for Imbalanced Regression". You can find it on arXiv. There is also a python implementation called "SMOGN" which can be installed through PyPI. You can find the package description at https://pypi.org/project/smogn/