Study of Active-Learning-Based Theme Crawling System


Article Preview

This paper aims to design the active-learning-based theme crawling system and to realize the capacity of active learning of theme crawling by the use of semantic functions of ontology. The most conspicuous feature of this crawling system is the introduction of iterative process of ontology-incremental active learning. With the above circulating iterative process, the system can capture large quantities of theme-related web pages and the relatedness of the obtained web pages and the capturing rate are obviously superior to the crawling system without the ontology.



Edited by:

Zhengyi Jiang, Yugui Li, Xiaoping Zhang, Jianmei Wang and Wenquan Sun




B. Ren et al., "Study of Active-Learning-Based Theme Crawling System", Applied Mechanics and Materials, Vols. 220-223, pp. 2852-2856, 2012

Online since:

November 2012




[1] Sure Y, Angele J, Erdmann M, Staab S, Studer R, Wenke D. OntoEdit: Collaborative ontology engineering for the semantic Web. In: Horrocks I, Hendler JA, eds. Proc. of the ISWC 2002. Heidelberg: Springer-Verlag, 2002. p.221−235.


[2] Bechhofer S, Horrocks I, Goble C, Stevens R. OILed: A reason-able ontology editor for the semantic Web. In: Baader F, Brewka G, Eiter T, eds. Proc. of the KI 2001, Joint German/Austrian Conf. on AI. Heidelberg: Springer-Verlag, 2001. p.396−408.


[3] Noy NF, Fergerson RW, Musen MA. The knowledge model of protégé-2000: Combining interoperability and flexibility. In: Dieng R, Corby O, eds. Proc. of the EKAW 2000. Heidelberg: Springer-Verlag, 2000. p.17−32.


[4] Missikoff M, Navigli R, Velardi P. Integrated approach for web ontology learning and engineering. IEEE Computer, 2002, 35(11). pp.60-63.