Pregunta

I'm trying to build a predictive model from about 1 million rows of data. My goal is to predict a certain numerical value.

I have the intuition that I should use very few numerical binary columns so I don't get data points that are too separated, a.k.a., the curse of dimensionality. Is this true? Besides, is it the same for numeric columns?

No hay solución correcta

Licenciado bajo: CC-BY-SA con atribución
scroll top