An Extractive Question Answering System for the Tamil Language

Aravind Krishnan; Srinivasa Ramanujan Sriram; Balaji Vishnu Raj Ganesan; S. Sridhar

doi:10.4028/p-tsrdcl

Paper Titles

Diagnosis of Alzheimer’s Disease Using CNN on MRI Data
p.277

Predicting and Classifying Diabetic Retinopathy (DR) Using 5-Class Label Based on Pre-Trained Deep Learning Models
p.285

Early Wheat Leaf Disease Detection Using CNN
p.295

Sentiment Analysis on Food-Reviews Dataset
p.305

An Extractive Question Answering System for the Tamil Language
p.312

Deep Learning-Based Speech Emotion Recognition
p.320

Social Media User Oppression Detection Technique Using Supervised and Unsupervised Machine Learning Algorithms
p.330

Medical Diagnosis through Chatbots
p.335

Sentiment Analysis of National Eligibility-Cum Entrance Test on Twitter Data Using Machine Learning Techniques
p.344

HomeAdvances in Science and TechnologyAdvances in Science and Technology Vol. 124An Extractive Question Answering System for the...

An Extractive Question Answering System for the Tamil Language

Abstract:

In the field of Natural Language Processing, Question Answering is a cardinal task that has garnered a lot of attention. With the development of multiple language models, question answering systems have been developed and deployed to facilitate enhanced information retrieval. These systems, however, have been implemented to a large extent only in English. Our objective was to create such a question answering system for the Tamil Language. We decided to use XLM-RoBERTa as our language model, which has been trained on a variety of datasets. We have also employed a hand-annotated dataset for the purpose of validation. We trained the model on two types of datasets, the first one being only in Tamil, whereas the other one being a mixture of Indian languages along with Tamil. The results were satisfactory in both cases. Given the huge amount of computational power the model required for training, we utilized the Colab Pro Plus cloud GPU from Google to satisfy our demands. We will also be publishing our dataset on huggingface so that fellow researchers can use it for further analysis.

You might also be interested in these eBooks

Proceedings: IoT, Cloud and Data Science

View Preview

Info:

Periodical:

Advances in Science and Technology (Volume 124)

Pages:

312-319

DOI:

https://doi.org/10.4028/p-tsrdcl

Citation:

Cite this paper

Online since:

February 2023

Authors:

Aravind Krishnan, Srinivasa Ramanujan Sriram*, Balaji Vishnu Raj Ganesan, S. Sridhar

Keywords:

Deep Learning, Natural Language Processing (NLP), Question Answering, Question Answering Dataset, Tamil Question Answering, Xlm Roberta

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

* - Corresponding Author

References

[1] Pranav Rajpurkar, Jian Zhang, Konstantin Lopyrev, Percy Liang, SQuAD: 100,000+ Questions for Machine Comprehension of Text,, arXiv:1606.05250, [cs], June (2016).