Text this: Addressing imbalance in health datasets: A new method NR-clustering SMOTE and distance metric modification