Product Name Recognition for Informal Text: Exploring Features

Article Preview

Abstract:

Product name recognition refers to locate the name of product in text automatically. We built a conditional random field to recognize product name from forum posts and explored various features to compare their impact on the performance. These features include not only traditional features used for NER but also distributed word representations which are novel features obtained from new area of machine learning.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

617-620

Citation:

Online since:

May 2014

Authors:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] R. Grishman and B. Sundheim, Message Understanding Conference-6: A Brief History, in Proceedings of the 16th Conference on Computational Linguistics, 1996, pp.466-471.

DOI: 10.3115/992628.992709

Google Scholar

[2] A. McCallum and W. Li, Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons, in Proceedings of the seventh conference on Natural language learning at HLT-NAACL, 2003, pp.188-191.

DOI: 10.3115/1119176.1119206

Google Scholar

[3] M. Craven and J. Kumlien, Constructing biological knowledge bases by extracting information from text sources, in Intelligent Systems in Molecular Biology. vol. 1999, 1999, pp.77-86.

Google Scholar

[4] R. Bunescu and R. J. Mooney, Relational markov networks for collective information extraction, in Proceedings of the ICML-2004 Workshop on Statistical Relational Learning and its Connections to Other Fields, (2004).

DOI: 10.3115/1218955.1219011

Google Scholar

[5] E. Minkov, et al., Extracting personal names from email: applying named entity recognition to informal text, in Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, 2005, pp.443-450.

DOI: 10.3115/1220575.1220631

Google Scholar

[6] A. Ritter, et al., Named entity recognition in tweets: an experimental study, in Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2011, pp.1524-1534.

Google Scholar

[7] M. Tkachenko and A. Simanovsky, Named entity recognition: Exploring features, in Proceedings of KONVENS, 2012, pp.118-127.

Google Scholar

[8] P. F. Brown, et al., Class-based n-gram models of natural language, Computational linguistics, vol. 18, pp.467-479, (1992).

Google Scholar

[9] J. Turian, et al., Word representations: a simple and general method for semi-supervised learning, in Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, 2010, pp.384-394.

Google Scholar

[10] J. Lafferty, et al., Conditional random fields: Probabilistic models for segmenting and labeling sequence data, in Proceedings of the Eighteenth International Conference on Machine Learning, (2001).

DOI: 10.1145/1015330.1015422

Google Scholar

[11] Y. Bengio, et al., Representation learning: A review and new perspectives, Pattern Analysis and Machine Intelligence, vol. 35, pp.1798-1828, (2013).

DOI: 10.1109/tpami.2013.50

Google Scholar

[12] G. Melli and C. Romming, An Overview of the CPROD1 Contest on Consumer Product Recognition within User Generated Postings and Normalization against a Large Product Catalog, in Proceedings of the ICDM-2012 Workshop on Consumer Product Contest 2012, pp.861-864.

DOI: 10.1109/icdmw.2012.104

Google Scholar

[13] S. Miller, et al., Name Tagging with Word Clusters and Discriminative Training, in Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2004, pp.337-342.

Google Scholar

[14] P. Liang, Semi-supervised learning for natural language, Massachusetts Institute of Technology, (2005).

Google Scholar

[15] L. Ratinov and D. Roth, Design challenges and misconceptions in named entity recognition, in Proceedings of the Thirteenth Conference on Computational Natural Language Learning, 2009, pp.147-155.

DOI: 10.3115/1596374.1596399

Google Scholar

[16] T. Mikolov, et al., Linguistic regularities in continuous space word representations, in Proceedings of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2013, pp.746-751.

Google Scholar

[17] R. Collobert and J. Weston, A unified architecture for natural language processing: Deep neural networks with multitask learning, in Proceedings of the 25th international conference on Machine learning, 2008, pp.160-167.

DOI: 10.1145/1390156.1390177

Google Scholar

[18] A. Mnih and G. E. Hinton, A Scalable Hierarchical Distributed Language Model, in Proceedings of the Conference on Neural Information Processing Systems, 2008, pp.1081-1088.

Google Scholar

[19] H. M. Wallach, Conditional random fields: An introduction, Technical Reports (CIS), p.22, (2004).

Google Scholar

[20] N. Okazaki. (2007). CRFsuite: a fast implementation of conditional random fields (CRFs). Available: http: /www. chokkan. org/software/crfsuite.

Google Scholar