We examined the effectiveness an optimized cluster-based undersampling technique.
We used a GA-based optimization approach for selecting the appropriate instances.
A critical issue of real-world knowledge extraction is the data imbalance problem.
The proposed method is successfully applied to the bankruptcy prediction problem.