p.1148
p.1157
p.1162
p.1167
p.1174
p.1180
p.1185
p.1192
p.1199
An Improved Method for Mathematical Formula Extraction in Printed English and Chinese Documents
Abstract:
Accurately locating mathematical formulas in scientific documents is the basis of their recognition. The existing formula extraction methods mostly aim at the documents in one language, which is inadaptable to the documents in other languages. This paper describes an improved method to extract formulas not only in Chinese but also in English documents. First, using run-number as the features to distinguish the documents’ language; and then according to the difference between Chinese and English documents, corresponding features and parameters are chosen for the formula extraction. The experimental results show that this method can improve the robustness of formula extraction.
Info:
Periodical:
Pages:
1174-1179
Citation:
Online since:
January 2010
Authors:
Price:
Сopyright:
© 2010 Trans Tech Publications Ltd. All Rights Reserved
Share:
Citation: