Auto Detection of Top Part for Research Paper with a Mixed Method

Article Preview

Abstract:

The detection of elements in top part of research paper is very important, because these elements are often used as the search items by user. This paper provides a mixed method for auto detection of top part from research paper. The papers feature of keyword, layout and content similarity are mixed to accurately find the area of top part and recognize the elements in top part. Experiments show the advantage of our method over existing methods, and future work is also described in the paper.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

2691-2694

Citation:

Online since:

August 2013

Keywords:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Lawrence, C. L. Giles, and K. Bollacker. 1999. Digital Libraries and Autonomous Citation Indexing. IEEE Computer, 32(6): 67-71.

DOI: 10.1109/2.769447

Google Scholar

[2] McCallum, K. Nigam, J. Rennie, K. Seymore. 2000. Automating the Construction of Internet Portals with Machine Learning. Information Retrieval Journal, volume 3, pages 127-163. Kluwer. (2000).

DOI: 10.1023/a:1009953814988

Google Scholar

[3] Chao Guang Li, Ming Zhang, et al. Auto Extraction of metadata from thesis. Computer Engineering and Applications. Pages 189-191, 2002(21).

Google Scholar

[4] Min Yuh Day, Richard Tzong Han Tsai, Cheng Lung Sung, et al. Reference Metadata Extraction Using a Hierarchical Knowledge Representation Framework[J]. Decision Support Systems, 2007(43): 152-167.

DOI: 10.1016/j.dss.2006.08.006

Google Scholar

[5] Hu Y H, Li H, CaoY B, et al. Automatic Extraction of Titles from General Documents Using Maching Learning[J]. Information Processing and Management, pages 1276-1293.

DOI: 10.1016/j.ipm.2005.12.001

Google Scholar

[6] Yu J D, Fan X Z. Metadata Extraction from Chinese Research Papers Based on Conditional Random Fields. (2008).

Google Scholar

[7] PyPDF: http: /knowah. github. io/PyPDF2.

Google Scholar