Data Balancing in Data Mining



Hi,

What do you think of the concept of Data Balancing in Data Mining. The
idea is to replicate the data in the data set such that the response
variable's outcomes are relatively equally distributed.

I feel that this will bias the dataset, am I correct? Comments?

.