p.386
p.390
p.394
p.399
p.403
p.409
p.416
p.421
p.426
Improved Backtracking-Forward Algorithm for Maximum Matching Chinese Word Segmentation
Abstract:
In order to improve the accuracy of segmentation, analysis of backtracking-forward maximum matching algorithm exists two defects when dealing with crossing ambiguity, and on this basis, an improved-backtracking forward algorithm for maximum matching algorithm is presented. The improved algorithm is based on the backtracking-forward maximum matching algorithm and adds a module, a chain length of one and 3-words, that can detect and process crossing ambiguity, and taking advantage of counting method, we can merely sort out the defragmenter fields that occurred crossing ambiguity. A number of selected language corpus tests prove that under the premise of the segmentation speed, the improved algorithm can enhance the segmentation accuracy.
Info:
Periodical:
Pages:
403-406
Citation:
Online since:
April 2014
Authors:
Price:
Сopyright:
© 2014 Trans Tech Publications Ltd. All Rights Reserved
Share:
Citation: