Semi-Supervised Word Sense Disambiguation via Context Weighting

Article Preview

Abstract:

Word sense disambiguation as a central research topic in natural language processing can promote the development of many applications such as information retrieval, speech synthesis, machine translation, summarization and question answering. Previous approaches can be grouped into three categories: supervised, unsupervised and knowledge-based. The accuracy of supervised methods is the highest, but they suffer from knowledge acquisition bottleneck. Unsupervised method can avoid knowledge acquisition bottleneck, but its effect is not satisfactory. With the built-up of large-scale knowledge, knowledge-based approach has attracted more and more attention. This paper introduces a new context weighting method, and based on which proposes a novel semi-supervised approach for word sense disambiguation. The significant contribution of our method is that thesaurus and machine learning techniques are integrated in word sense disambiguation. Compared with the state of the art on the test data of the English all words disambiguation task in Sensaval-3, our method yields obvious improvements over existing methods in nouns, adjectives and verbs disambiguation.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 1049-1050)

Pages:

1327-1338

Citation:

Online since:

October 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] C. Stokoe. Differentiating Homonymy and Polysemy in Information Retrieval. Proc. Conf. Human Language Technology and Empirical Methods in Natural Language Processing. 2005, pp: 403-410.

DOI: 10.3115/1220575.1220626

Google Scholar

[2] R. Sproat, J. Hirschberg, D. Yarowsky. A corpus-based synthesizer. Proceedings of the International Conference on Spoken Language Processing. (1992).

DOI: 10.21437/icslp.1992-130

Google Scholar

[3] D. Vickrey, L. Biewald, M. Teyssier, D. Koller. Word-Sense Disambiguation for Machine Translation. Proc. Conf. Human Language Technology and Empirical Methods in Natural Language Processing. 2005, pp: 771-778.

DOI: 10.3115/1220575.1220672

Google Scholar

[4] R. Barzilay, M. Elhadad. Using Lexical Chains for Text Summarization. Proc. ACL Workshop Intelligent Scalable Text Summarization. 1997, pp: 10-17.

Google Scholar

[5] G. Ramakrishnan, A. Jadhav, A. Joshi, S. Chakrabarti, P. Bhattacharyya. Question Answering via Bayesian Inference on Lexical Relations. Proc. ACL Workshop Multilingual Summarization and Question Answering. 2003, pp: 1-10, (2003).

DOI: 10.3115/1119312.1119313

Google Scholar

[6] Van-Nam Huynh, Tri Thanh Nguyen, Cuong Anh Le. Adaptively entropy-based weighting classifiers in combination using Dempster-Shafer theory for word sense disambiguation. Computer Speech and Language. 2010. 24: 461-473.

DOI: 10.1016/j.csl.2009.06.003

Google Scholar

[7] Roberto Navigli. Word Sense Disambiguation: A Survey. ACM Computing Surveys. 2009. 41(2): 1011-1069.

DOI: 10.1145/1459352.1459355

Google Scholar

[8] C. Fellbaum. WordNet: An electronic lexical database. MIT press Cambridge, MA, (1998).

Google Scholar

[9] R. L. Rivest. Learning decision lists. Machine learning. 1987. 2(3): 229-246.

Google Scholar

[10] D. Yarowsky. Hierarchical decision lists for word sense disambiguation. Computers and the Humanities. 2000. 34(1): 179-186.

Google Scholar

[11] Geoffrey Towell, Ellen M. Voorhees. Disambiguating highly ambiguous words. Computational Linguistics. 1998. 24(l): 125-145.

Google Scholar

[12] A. Azzini, C. da Costa Pereira, M. Dragoni, A. G. B. Tettamanzi. Evolving Neural Word Sense Disambiguation Classifiers with a Letter-Count Distributed Encoding. ARTIFICIAL LIFE AND EVOLUTIONARY COMPUTATION. 2010. 111-120.

DOI: 10.1142/9789814287456_0010

Google Scholar

[13] Dan Klein, Kristina Toutanova, H. Tolga Ilhan, Sepandar D. Kamvar, Christopher D. Manning. Combining heteregeneous classifiers for Word-Sense disambiguation. the SIGLEX/SENSEVAL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions. 2002. pp: 74-80.

DOI: 10.3115/1118675.1118686

Google Scholar

[14] RADU FLORIAN, SILVIU CUCERZAN, CHARLES SCHAFER, DAVID YAROWSKY. Combining classifiers for word sense disambiguation. Natural Language Engineering. 2002. 8(04): 327-341.

DOI: 10.1017/s1351324902002978

Google Scholar

[15] Hinrich Schütze. Automatic word sense discrimination. Computational Linguistics. 1998. 24(1): 97-123.

Google Scholar

[16] Ted. Pedersen, Rebecca. Bruce. Distinguishing word senses in untagged text. the 1997 Conference on Empirical Methods in Natural Language Processing. 1997, pp: 197-207.

Google Scholar

[17] Amruta Purandare and Ted Pedersen. Word sense discrimination by clustering contexts in vector and Similarity spaces. the Conference on Computational Natural Language Learning. 2004, pp: 41-48.

Google Scholar

[18] Navigli Roberto, Lapata Mirella. An Experimental Study of Graph Connectivity for Unsupervised Word Sense Disambiguation. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE. 2010. 32(4): 678-692.

DOI: 10.1109/tpami.2009.36

Google Scholar

[19] Ehsan Hessami, Faribourz Mahmoudi, Amir Hossien Jadidinejad. Unsupervised Weighted Graph for Word Sense Disambiguation. WICT. 2011, pp: 733-737.

DOI: 10.1109/wict.2011.6141337

Google Scholar

[20] Yuret Deniz, Yatbaz Mehmet Ali. The Noisy Channel Model for Unsupervised Word Sense Disambiguation. COMPUTATIONAL LINGUISTICS. 2010. 36(1): 111-127.

DOI: 10.1162/coli.2010.36.1.36103

Google Scholar

[21] Ping Chen, Wei Ding, Max Choly, Chris Bowes. Word Sense Disambiguation with Automatically Acquired Knowledge. Intelligent Systems. 2010. PP(99): 1-7.

DOI: 10.1109/mis.2010.134

Google Scholar

[22] Satanjeev Banerjee, Ted Pedersen. Extended Gloss Overlaps as A Measure of Semantic Relatedness. the 18th International Joint Conference on Artificial Intelligence. 2003, pp.805-810.

Google Scholar

[23] Eneko Agirre, Aitor Soroa, Mark Stevenson. Graph-based Word Sense Disambiguation of Biomedical Documents. Computer and Information Science. 2010. 26(22): 2889-2896.

DOI: 10.1093/bioinformatics/btq555

Google Scholar

[24] Tae-Gil Noh, Seong-Bae Park, Sang-Jo Lee. Unsupervised Word Sense Disambiguation in Biomedical Texts with Co-occurrence Network and Graph Kernel. the ACM Fourth International Workshop on Data and Text Mining in Biomedical Informatics. 2010, pp: 61-64.

DOI: 10.1145/1871871.1871883

Google Scholar

[25] Federica Mandreoli, Riccardo Martoglia. Knowledge-based sense disambiguation (almost) for all structures. Information Systems. 2010. 36(2): 406-430.

DOI: 10.1016/j.is.2010.08.004

Google Scholar

[26] Timothy Baldwin, Su Nam Kim, Francis Bond, Sanae Fujita, David Martinez and Takaaki Tanaka. A Reexamination of MRD-Based Word Sense Disambiguation. ACM Transactions on Asian Language Information Processing. 2010. 9(1).

DOI: 10.1145/1731035.1731039

Google Scholar

[27] Hwang Myunggwon, Choi Chang, Kim Pankoo. Automatic Enrichment of Semantic Relation Network and Its Application to Word Sense Disambiguation. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING. 2011. 23(6): 845-858.

DOI: 10.1109/tkde.2010.163

Google Scholar

[28] Wei Jan Lee, Edwin Mit. Word Sense Disambiguation by Using Domain Knowledge. STAIR. 2011, pp: 237-242.

Google Scholar

[29] Heyan Huang, Wenpeng Lu. Knowledge-based Word Sense Disambiguation with Feature Words Based on Dependency Relation and Syntax Tree. International Journal of Advancements in Computing Technology. 2011. 3(8): 73-81.

DOI: 10.4156/ijact.vol3.issue8.9

Google Scholar

[30] S. Patwardhan, S. Banerjee, T. Pedersen. UMND1: Unsupervised Word Sense Disambiguation Using Contextual Semantic Relatedness. the 4th International Workshop on Semantic Evaluations. 2007, pp: 390-393.

DOI: 10.3115/1621474.1621560

Google Scholar

[31] T. Pedersen, V. Kolhatkar. WordNet: SenseRelate: AllWords -A Broad Coverage Word Sense Tagger that Maximizes Semantic Relatedness. NAACL HLT 2009. 2009, pp: 17-20.

DOI: 10.3115/1620959.1620964

Google Scholar

[32] Diana McCarthy, Rob Koeling, Julie Weeds, John Carroll. Unsupervised Acquisition of Predominant Word Senses. Computational Linguistics. 2007. 33(4): 553-590.

DOI: 10.1162/coli.2007.33.4.553

Google Scholar

[33] Eneko Agirre, Oier Lopez de Lacalle, Aitor Soroa. Knowledge-Based WSD on Specific Domains: Performing Better than Generic Supervised WSD. the Twenty-First International Joint Conference on Artificial Intelligence. 2009, pp: 1501-1506.

Google Scholar

[34] Zhimao Lu, Ting Liu, Gang Zhang, Sheng Li. Word Sense Disambiguation Based on Dependency Relationship Analysis and Bayes Model. High Technology Letters. 2003. 13(5): 1-7.

Google Scholar

[35] P. Chen, W. Ding, C. Bowes and D. Brown. A Fully Unsupervised Word Sense Disambiguation Method Using Dependency Knowledge. Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the ACL. 2009, pp: 28-36.

DOI: 10.3115/1620754.1620759

Google Scholar

[36] Lu Wenpeng, Huang Heyan, Zhu Chaoyong. Feature Words Selection for Knowledge-based Word Sense Disambiguation with Syntactic Parsing. PRZEGLAD ELEKTROTECHNICZNY. 2012. 88: 82-87.

Google Scholar

[37] Heyan Huang, Wenpeng Lu. Knowledge-based Word Sense Disambiguation with Feature Words Based on Dependency Relation and Syntax Tree. IJACT. 2011. 3(8): 73-81.

DOI: 10.4156/ijact.vol3.issue8.9

Google Scholar

[38] Tesnière L. Elements of Structural Syntax. Klincksieck. (1959).

Google Scholar

[39] N. Chomsky. Three models for the description of language. Institute of Radio Engineers Transactions on Information Theory 2. 1956, pp: 113-124.

Google Scholar

[40] P. Jin. Researches on Some Key Issues of Word Sense Disambiguation. PhD dissertation. Peking University. (2009).

Google Scholar

[41] Claudia Leacock, Martin Chodorow. Combining local context and WordNet similarity for word sense identification. In Fellbaum. 1998, pp: 265-283.

DOI: 10.7551/mitpress/7287.003.0018

Google Scholar