p.1651
p.1655
p.1659
p.1665
p.1669
p.1675
p.1679
p.1684
p.1689
Research on Entity Resolution Algorithm Based on Domain Ontology Using MapReduce
Abstract:
DO-Swoosh algorithm maps the input data to domain ontology and count the amount of data for each leaf node. Then define the distance between nodes according to the hierarchical relationships reflected by domain ontology and propose a node merging algorithm based on the principles of data balance and nearest merge. At last, perform entity resolution for each group according to the node merging result. DO-Swoosh still keeps the good generality and gets better performance and scalability with the aid of MapReduce.
Info:
Periodical:
Pages:
1669-1674
Citation:
Online since:
August 2013
Authors:
Keywords:
Price:
Сopyright:
© 2013 Trans Tech Publications Ltd. All Rights Reserved
Share:
Citation: