The Study on Enlarging Specific Extractor for Technology-Related Named Entity Extraction from Text Collections of Applied Mechanics Field

Article Preview

Abstract:

This paper presents additional linguistic factors that should be considered to more effectively extract terms from the machinery industry documents by augmenting the general extraction patterns. We expand on the general term extraction patterns with patterns that are tailored for machinery industry documents to improve precision and recall. We establish a theoretical basis for developing a system to support information research in the machinery industry. Using this system, we expect to increase the efficiency of new business planning process in the machine industry.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

451-454

Citation:

Online since:

December 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] NAVER Dictionary http: /dic. naver. com.

Google Scholar

[2] Telecommunications Technology Association of Korea's Dictionary http: /word. tta. or. kr/terms/terms. jsp.

Google Scholar

[3] GENIA Sentence Splitter http: /www-tsujii. is. s. u-tokyo. ac. jp/~y-matsu/geniass.

Google Scholar

[4] DIQUEST Inc. http: /www. diquest. com.

Google Scholar

[5] MiriAn http: /mirian. kisti. re. kr.

Google Scholar

[6] Korea Intellectual Property Rights Information Service http: /eng. kipris. or. kr.

Google Scholar

[7] Didier Bourigault, Surface grammatical analysis for the extraction of terminological noun phrases, Proceeding COLING '92 Proceedings of the 14th conference on Computational linguistics, Volume 3 (1992).

DOI: 10.3115/992383.992415

Google Scholar

[8] Sophia Ananiadou, A methodology for automatic term recognition, Proceeding COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2 (1994).

DOI: 10.3115/991250.991317

Google Scholar

[9] Fred J. Damerau, Generating and evaluating domain-oriented multi-word terms from texts, Information Processing & Management, Vol. 29, Issue 4, July-August 1993, pp.433-447.

DOI: 10.1016/0306-4573(93)90039-g

Google Scholar

[10] John S. Justeson, Slava M. Katz, Technical terminology: some linguistic properties and an algorithm for identification in text, Natural Language Engineering, Volume 1, Issue 01 (1995).

DOI: 10.1017/s1351324900000048

Google Scholar

[11] Chantal Enguehard, Laurent Pantera, Automatic natural acquisition of a terminology, J. of Quantitative Linguistics, Volume 2, Issue 1, 1995, pp.27-32.

DOI: 10.1080/09296179508590032

Google Scholar