Annotation System for the Construction of Synchronous Grammar Tree Alignment Relationships

Article Preview

Abstract:

The construction of syntactic tree structures is vital to different Natural Language Processing applications. In the meanwhile, the use of monolingual or bilingual structures directly affects the quality of Machine Translation systems. However, manually annotated syntactic tree structures are not only considered as a time consuming task but also a very expensive work, and automatic construction approaches cannot always guarantee the quality of the syntactic trees. In this paper, a system for annotating and constructing synchronous grammar tree structures in a semi-automatic way is proposed. The system is built in the Web environment with a graphical display for users to review and modify alignment relationships between nodes of the tree and strings or trees of the other language. The core part relies on the parsing of Constraint Synchronous Grammar, and consists of several modules in establishing alignments at different levels between the languages, including relationships between syntactic tree and strings of the other language, and bilingual tree alignments. Moreover, it provides import functions in obtaining monolingual skeletal bracketing syntactic tree and Translation Corresponding Tree structures for the creation of synchronous rules in order to have wider applicability.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

3357-3361

Citation:

Online since:

January 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] F. Wong, F. Oliveira, Y. P. Li, Hybrid Machine Aided Translation System based on Constraint Synchronous Grammar and Translation Corresponding Tree, Journal of Computers, 7(2) (2012), 309-316.

DOI: 10.4304/jcp.7.2.309-316

Google Scholar

[2] J. Waszczuk, K. Glowinska, A. Savary, A. Przepiorkowski, Tools and Methodologies for Annotating Syntax and Named Entities in the National Corpus of Polish, in: Proceedings of the International Multiconference on Computer Science and Information Technology (2010), 531-539.

DOI: 10.1109/imcsit.2010.5680057

Google Scholar

[3] M. T. Kromann, The Danish dependency treebank and the DTAG treebank tool, in: Proceedings of the 2nd Workshop on Treebanks and Linguistic Theories (2003), 217-220.

Google Scholar

[4] T. Kakkonen, DepAnn - An Annotation Tool for Dependency Treebanks, in: Proceedings of the 11th ESSLLI Session at the 18th European Summer School in Logic, Language and Information (2006).

Google Scholar

[5] A. Burchardt, E. Katrin, F. Anette, K. Andrea, P. Sebastian, SALTO: A versatile multi-level annotation tool, in: Proceedings of the 5th International Conference on Language Resources and Evaluation, (2006), 517-520.

Google Scholar

[6] K. Erk, A. Kowalski, S. Pado, The SALSA Annotation Tool, in: Proceedings of the Workshop on Prospects and Advances in the Syntax/Semantics Interface (2003).

Google Scholar

[7] M. Bertran, O. Borrega, M. Recasens, B. Soriano, AnCoraPipe: A tool for multilevel annotation, Procesamiento del Lenguaje Natural, 41 (2008), 291-292.

Google Scholar

[8] S. Montemagni, F. Barsotti, M. Battista, N. Calzolari, O. Corazzari, A. Lenci, A. Zampolli, F. Fanciulli, M. Massetani, R. Raffaeli, R. Basili, M. T. Pazienza, D. Saracino, F. Zanzotto, N. Mana, F. Pianesi, R. Delmonte, Building the Italian Syntactic-Semantic Treebank, Text, Speech and Language Technology, 20(1) (2003), 189-210.

DOI: 10.1007/978-94-010-0201-1_11

Google Scholar

[9] F. Wong, M. C. Dong, D. C. Hu, Machine Translation by Using Constraint-Based Synchronous Grammar, Tsinghua Science and Technology, 11(3) (2006), 295-306.

DOI: 10.1016/s1007-0214(06)70193-6

Google Scholar

[10] F. Wong, D. C. Hu, Y. H. Mao, A Flexible Example Annotation Schema: Translation Corresponding Tree Representation, in: Proceedings of the 20th International Conference on Computational Linguistics, (2004), 25-31.

DOI: 10.3115/1220355.1220510

Google Scholar

[11] M. Tomita, An efficient augmented-context-free parsing algorithm, Computational Linguistics, 13 (1987).

Google Scholar

[12] F. Wong, S. Chao, iSentenizer: An Incremental Sentence Boundary Classifier, in: Proceedings of the IEEE International Conference on Natural Language Processing and Knowledge Engineering, (2010).

DOI: 10.1109/nlpke.2010.5587856

Google Scholar