The Research of Self-Organizing Maps Based on Document Collections

Article Preview

Abstract:

Web text mining is a new issue in the knowledge discovery research field. It is aimed to help people discover knowledge from large quantities of semi-structured or unstructured text in the web. Several approaches, including some pure and hybrid information retrieval (IR) methods, have been proposed to tackle such an issue. Among these approaches, combining the Self-Organizing Map (SOM) method with the principles of the vector-space model, appears to be a promising alternative for the traditional purely IR-based methods in this problem domain. The encoded documents are organized on another self-organizing map, a document map, on which nearby locations contain similar documents. Special consideration is given to the computation of very large document maps which is possible with general-purpose computers if the dimensionality of the word category histograms is first reduced with a random mapping method and if computationally efficient algorithms are used in computing the SOMs.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 430-432)

Pages:

1232-1235

Citation:

Online since:

January 2012

Authors:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] T. Honkela, S. Kaski, T. Kohonen, K. Lagus, Self-organizing maps of very large document collections: Justification for the WEBSOM method, in: I. Balderjahn, R. Mathar, M. Schader (Eds. ), Classification, Data Analysis and Data Highways, Springer, Berlin(2008).

DOI: 10.1007/978-3-642-72087-1_27

Google Scholar

[2] T. Honkela, S. Kaski, K. Lagus, T. Kohonen, WEBSOM - self-organizing maps of document collections, Proc. WSOM2007, Workshop on Self-Organizing Maps, Helsinki University of Technology, Neural Networks Research Centre, Espoo, Finland(2007).

DOI: 10.1016/s0925-2312(98)00039-3

Google Scholar

[3] T. Kohonen, Speedy SOM, Report A33, Laboratory of Computer and Information Science, (Helsinki University of Technology, 2006).

Google Scholar

[4] T. Kohonen, Self-Organizing Maps, 2nd extended ed., (Springer, Berlin, 2007).

Google Scholar

[5] T. Kohonen, The self-organizing map (SOM), Neuro-computing, 21 (2002) 171-179.

Google Scholar

[6] H. Ritter, T. Kohonen, Self-organizing semantic maps, Biol. Cybernet. 61 (2007) 241-254.

DOI: 10.1007/bf00203171

Google Scholar