Paper Title:
Semi-Supervised Classification with Co-Training for Deep Web
  Abstract

The main problems in Web Pages classification are lack of labeled data, as well as the cost of labeling the unlabeled data. In this paper we discuss the application of semi-supervised machine learning method co-training on classification of Deep Web query interfaces to boost the performance of a classifier. Then, Bayes and Maxim Entropy algorithm are co-operated to incorporate labeled data with unlabeled data in training process incrementally. Our experiment results show the novel approach has a promising performance.

  Info
Periodical
Key Engineering Materials (Volumes 439-440)
Edited by
Yanwen Wu
Pages
183-188
DOI
10.4028/www.scientific.net/KEM.439-440.183
Citation
W. Fang, Z. M. Cui, "Semi-Supervised Classification with Co-Training for Deep Web", Key Engineering Materials, Vols. 439-440, pp. 183-188, 2010
Online since
June 2010
Export
Price
$32.00
Share

In order to see related information, you need to Login.

In order to see related information, you need to Login.

Authors: Li Min Wang, Xiong Fei Li, Xue Cheng Wang
Abstract:Dimensionality reduction is useful for improving the performance of Bayesian networks. In this paper we suggest an effective method of...
240
Authors: Dech Thammasiri, Phayung Meesad
Chapter 17: Metrology and Measurement
Abstract:In this research we propose an ensemble classification technique base on creating classification from a variety of techniques such as...
6572
Authors: Wei Mei Zhi, Hua Ping Guo, Ming Fan
Chapter 4: Data, Image and Signal Processing
Abstract:Most classifiers lose efficiency with the problem of imbalanced class distribution, which, however, often shows statistical significant in...
622
Authors: Jia Qiang Dong
Chapter 18: Computer Applications in Industry and Engineering
Abstract:The Web database's classification is the key step which integrates with the Web database classification and retrieves. The traditional search...
2920
  | Authors: Jun Zheng Shi, Lei Guo, Shi Min Wei
Chapter 3: Hardware, Information Technology and System
Abstract:A great demand of sentiment classification comes with the rapid development of the internet. At present, the methods about sentiment...
553