Detection for Approximately Duplicate Records Based on Fuzzy Comprehensive Evaluation

Article Preview

Abstract:

To solve the problem of attribute weight determination in the approximately duplicate records, we put forward a method based on fuzzy comprehensive evaluation to get attribute weight in data set. We first perform an analysis of the composition factors of attribute. Then we carry out an evaluation of their rank. Finally, we make a determination of the attribute weight using the fuzzy comprehensive evaluation method, on the basis of which the approximately duplicate records are detected. Theoretical analysis and experimental results show that the method can objectively determine all attributes weight, and effectively detect the approximately duplicate records in massive data set.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

2464-2468

Citation:

Online since:

September 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Rahm E, Hong H D: IEEE data Engineering Bulletin vol. 23(2005), p.3.

Google Scholar

[2] Monge A: http: /citeseer. ist. psu. edu/Monge adaptive. html (2007-09-02).

Google Scholar

[3] Duvall Scott L , Fraser Alison M , Kerber Richard A , Mineau Geraldine P: Studies in health technology and informatics Vol. 160( 2010), p.1122.

Google Scholar

[4] Xingyi L, Congjian B, Huaji S: Journal of University of Electronic Science and Technology of China vol. 36 (2007), p.1273(In Chinese).

Google Scholar

[5] Jiang C, Tianxiang Y, Enhai Z: Shanxi Architecture vol. 38(2012)p.210(In Chinese).

Google Scholar

[6] Yang L, Song L , Jing H, Jinkai L: Journal of Software Vol. 7(2012), p.2217.

Google Scholar

[7] Zhongwen Y, Jin-yu Z, Ling Z, Hailin W: Application of Fuzzy Comprehensive Evaluation in Coal Mine Construction Project Management, Proceedings of International Conference on Future Information Technology and Management Engineering (FITME 2010) Vol 1, p.446.

DOI: 10.1109/fitme.2010.5656666

Google Scholar