p.1089
p.1095
p.1099
p.1104
p.1108
p.1112
p.1117
p.1125
p.1129
A Fuzzy C-Means Approach for Incomplete Data Sets Based on Nearest-Neighbor Intervals
Abstract:
Partially missing data sets are a prevailing problem in pattern recognition. In this paper, the problem of clustering incomplete data sets is considered, and missing attribute values are imputed by the centers of corresponding nearest-neighbor intervals. Firstly, the algorithm estimates the nearest-neighbor intervals of missing attribute values by using the attribute distribution information of the data sets sufficiently. Secondly, the missing attribute values are imputed by the center of the intervals so as to clustering incomplete data sets. The proposed algorithm introduces the nearest neighbor information into incomplete data clustering, and the comparisons of the experimental results for two UCI data sets demonstrate the capability of the proposed algorithm.
Info:
Periodical:
Pages:
1108-1111
Citation:
Online since:
September 2013
Authors:
Keywords:
Price:
Сopyright:
© 2013 Trans Tech Publications Ltd. All Rights Reserved
Share:
Citation: