p.2528
p.2533
p.2538
p.2542
p.2547
p.2552
p.2557
p.2562
p.2567
Sample Size on the Impact of Imbalance Learning
Abstract:
Classification of imbalanced data sets is widely used in many real life applications. Most state-of-the-art classification methods which assume the data sets are relatively balanced lose their efficiency. The paper discusses the factors which influence the modeling of a capable classifier in identifying rare events, especially for the factor of sample size. Carefully designed experiments using Rotation Forest as base classifier, carried on 3 datasets from UCI Machine Learning Repository based on weak show that, in particular imbalance ratio, increases the size of training set by unsupervised resample the large error rate caused by the imbalanced class distribution decreases. The common classification algorithm can reach good effect.
Info:
Periodical:
Pages:
2547-2551
Citation:
Online since:
September 2013
Authors:
Price:
Сopyright:
© 2013 Trans Tech Publications Ltd. All Rights Reserved
Share:
Citation: