p.4970
p.4974
p.4980
p.4985
p.4990
p.4996
p.5000
p.5004
p.5007
Chinese-Uyghur Sentences Alignment Using Multiple Clues
Abstract:
This paper introduces a new method to Chinese-Uyghur sentence alignment, in which a two-step procedure is applied. In the first step, multiple clues such as proper names, technical terms, numbers, punctuation marks, location information and length information are used to generate anchor sentences that satisfy some conditions. In the second step, texts are divided into several segments by using the anchor sentences as boundaries, and then the sentences in each segment are aligned by using a length-based approach. By applying the segmentation technique, the method avoids complex computation and error spreading. Experimental result shows that the accuracy of the method is 95.2% on the average for multi-domain texts.
Info:
Periodical:
Pages:
4990-4995
Citation:
Online since:
July 2014
Authors:
Keywords:
Price:
Сopyright:
© 2014 Trans Tech Publications Ltd. All Rights Reserved
Share:
Citation: