Question

I am looking into upsampling an imbalanced dataset for a regression problem (Numerical target variables) in python.

I attached paper and R package that implement SMOTE for regression, can anyone recommend a similar package in Python? Otherwise, what other methods can be use to upsample the numerical target variable?

SMOTE for Regression

smoteRegress: SMOTE algorithm for imbalanced regression problems

Update:

I found the following python library which implements Synthetic Minority Over-Sampling Technique for Regression with Gaussian Noise

smogn

Was it helpful?

Solution

I think SMOGN will work for your problem. The method is described in a paper titled: "SMOGN: a Pre-processing Approach for Imbalanced Regression". You can find it on arXiv. There is also a python implementation called "SMOGN" which can be installed through PyPI. You can find the package description at https://pypi.org/project/smogn/

Licensed under: CC-BY-SA with attribution
Not affiliated with datascience.stackexchange
scroll top