A Pattern Matching Algorithm for Double-Type Characters

Article Preview

Abstract:

A fast and efficient matching algorithm is proposed to address the issue on multi-pattern matching of double-byte string, for example Chinese characters, which has major difference with single-byte string matching algorithm. The algorithm capitalizes on double cross link data list and two finite prefix automata to match a double-byte character, so as to solve the storage expansion problems in which the double-byte cross data link table results. The method requires less storage in comparison with double-byte cross data link table, and has the same order of magnitude in efficiency as a single-byte cross-link table approach.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

461-464

Citation:

Online since:

June 2014

Authors:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Navarro G, Raffinot M. Flexible Pattern Matching in Strings: Practical On-Line Search Algorithms for Texts and BiologicalSequences. Cambridge: Cambridge University Press, 2002. 41-75.

DOI: 10.1017/cbo9781316135228

Google Scholar

[2] A V Aho, M J Corasick. Efficient string matching: An aid tobibliographic search. Communications of the ACM, 1975, 18(6): 333~340.

DOI: 10.1145/360825.360855

Google Scholar

[3] WU S, MANBER U. A fast algorithm for multi-pattern searching. http: /webglimpse. org/pubs/TR94-17. pdf.

Google Scholar

[4] Wei Li,Xiaohong Guan,Wenrong Tang. A fast matching algorithm based on a special cross data link table. JOURNAL OF CHINA INSTITUTE OF COMMUNICATIONS , 2004, 25(4): 38-44(In Chinese).

Google Scholar

[5] WU S, MANBER U. A fast algorithm for multi-pattern searching. http: /webglimpse. org/pubs/TR94-17. pdf.

Google Scholar