A Unified Microblog Web Page Structured Information Extraction Method Based on Hierarchical Clustering

Article Preview

Abstract:

Rich information is contributed to microblogs by millions of users all around the world. However, few work has been done on the study of microblog web page extraction so far. We proposed a unified structured information extraction method based on hierarchical clustering which is suitable for microblog web pages of any microblog websites. The experiment result on microblog web pages of some popular microblog service providers indicates the high performance of our method.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

2489-2492

Citation:

Online since:

September 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Zhao Chen, Dongmei Zhang: submitted to Application Research of Computers (2012) (In Chinese).

Google Scholar

[2] Xiaofeng Meng, Weiyi Meng: submitted to IEEE Transactions on Knowledge and Data Engineering (2010).

Google Scholar

[3] Jianwen Ou, Shoubin Dong and Bin Cai: submitted to J Tsinghua Univ (Sci & Tech) (2005) (In Chinese).

Google Scholar

[4] Zhixuan Zhang, Chuang Zhang, Zhiqing Lin and Bo Xiao, in: Blog Extraction with Template-Independent Wrapper, 2010 2nd IEEE International Conference on Network Infrastructure and Digital Content (2010).

DOI: 10.1109/icnidc.2010.5657967

Google Scholar

[5] Wei Liu, Hualiang Yan: submitted to Computer Engineering (2012) (In Chinese).

Google Scholar

[6] Jie Yang, Lixiu Yao: Data Mining and Its Applications (Shanghai Jiao Tong University Press, Shanghai 2011) (In Chinese).

Google Scholar