p.1481
p.1489
p.1493
p.1497
p.1503
p.1508
p.1512
p.1517
p.1522
System of Fuzzy Duplicates Detection
Abstract:
In the paper we discuss the problem of fuzzy duplicate detecting. There are given the basic approaches to detection of text duplicates. We review the existing methods of fuzzy duplicate detecting. There is presented algorithm of fuzzy duplicate detection. Algorithm is based on method of shingles. We describe modification of algorithm. We propose to consider not all text of document but its processed and filtered copy. There is presented the structure of system for fuzzy duplicates detection. System checks text duplications in the internal database and in Internet.
Info:
Periodical:
Pages:
1503-1507
Citation:
Online since:
January 2014
Authors:
Keywords:
Price:
Сopyright:
© 2014 Trans Tech Publications Ltd. All Rights Reserved
Share:
Citation: