Semantic Clustering and Similarity Calculation for Social Tagging Systems

Article Preview

Abstract:

Social tagging systems allow Internet users to annotate resources with tags. Internet is an open system, which permits users the freedom to explore tags. However, the freedom afforded users comes at a cost: an uncontrolled vocabulary can result in tag ambiguity hindering navigation. So, a key question is how to harvest tag semantics from these systems. We present an algorithm of hierarchical tag clustering. With this algorithm, we clustering the tags into a semantic tree, then we turn every resource item into an induced tree. We propose a new method for resource retrieval, which based on semantic similarity. We present extensive experimental results on a real world dataset. When we retrieve the resource based on semantic similarity, our algorithm shows the high recall and precision. Furthermore, our algorithm demonstrates more utility for recommendation.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1559-1564

Citation:

Online since:

June 2014

Authors:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] J. Bar-Ilan, S. Shoham, A. Idan, Y. Miller, and A. Shachak. Structured vs. unstructured tagging—a case study. In Proc. WWW Collaborative Web Tagging Workshop, (2006).

DOI: 10.1108/14684520810914016

Google Scholar

[2] G. Begelman, P. Keller, and F. Smadja. Automated Tag Clustering: Improving search and exploration in the tagspace. Proc. of the Collaborative Web TaggingWorkshop at WWW, 6, (2006).

Google Scholar

[3] H. Chen and S. Dumais. Bringing order to the Web: automatically categorizing search results. Proceedings of theSIGCHI conference on Human factors in computing systems, pages 145–152, (2000).

DOI: 10.1145/332040.332418

Google Scholar

[4] J. Gemmell, A. Shepitsen, B. Mobasher, and R. Burke. Personalization in Folksonomies Based on Tag Clustering. Intelligent Techniques for Web Personalization &Recommender Systems, (2008).

Google Scholar

[5] J. Gemmell, A. Shepitsen, B. Mobasher, and R. Burke. Personalizing navigation in folksonomies using hierarchical tag clustering. Proceedings of the 10th International Conference on Data Warehousing and Knowledge Discovery, (2008).

DOI: 10.1007/978-3-540-85836-2_19

Google Scholar

[6] T. Hammond, T. Hannay, B. Lund, and J. Scott. Social Bookmarking Tools (I). D-Lib Magazine, 11(4): 1082–9873, (2005).

DOI: 10.1045/april2005-hammond

Google Scholar

[7] C. Hayes and P. Avesani. Using tags and clustering to identify topic-relevant blogs. International Conference on Weblogs and SocialMedia, March (2007).

Google Scholar

[8] P. Heymann and H. Garcia-Molina. Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems. Technical report, Technical Report 2006-10, Computer Science Department, April (2006).

Google Scholar

[9] S. Bao, G. Xue, X. Wu, Y. Yu, B. Fei, and Z. Su. Optimizing web search using social annotations. In Proc. 16th Intl. Conf. on World Wide Web, pages 501–510, (2007).

DOI: 10.1145/1242572.1242640

Google Scholar

[10] G. Begelman, P. Keller, and F. Smadja. Automated tag clustering: Improving search and exploration in the tag space. In Proc. WWW Collaborative Web Tagging Workshop, (2006).

Google Scholar

[11] P. Boldi, M. Santini, and S. Vigna. Do your worst to make the best: Paradoxical effects in pagerank incremental computations. Internet Mathematics, 2(3): 387–404, (2005).

DOI: 10.1080/15427951.2005.10129106

Google Scholar

[12] S. Brin and L. Page. The Anatomy of a Large-Scale Hypertextual Web Search Engine. Computer Networks and ISDN Systems, 30(1-7): 107–117, April (1998).

DOI: 10.1016/s0169-7552(98)00110-x

Google Scholar

[13] C. Brooks and N. Montanez. Improved annotation of the blogosphere via autotagging and hierarchical clustering. In Proc. 15th Intl. Conf. on World Wide Web, pages 625–632, (2006).

DOI: 10.1145/1135777.1135869

Google Scholar