p.1004
p.1009
p.1013
p.1018
p.1024
p.1028
p.1033
p.1037
p.1042
Identifying Abbreviations in Biomedical Literature Based on Maximum Entropy with Web Features
Abstract:
The number of biomedical literatures is growing rapidly, and biomedical literature mining is becoming essential. A learning classifier based on maximum entropy (ME) for identifying abbreviations is proposed. Two innovative Web-based features for extracting additional semantic information are developed. The study shows the Web as a knowledge source can be incorporated effectively in the machine learning framework and significantly improves its performance. The ME classifier achieves 95% precision and 89% recall on the gold standard corpus “Medstract” and 91% precision and 84% recall on the larger test data that includes 128 full text literatures.
Info:
Periodical:
Pages:
1024-1027
Citation:
Online since:
July 2014
Authors:
Price:
Сopyright:
© 2014 Trans Tech Publications Ltd. All Rights Reserved
Share:
Citation: