Word Sense Disambiguation of Regional Language using Deep Learning

Kanthirekha Miriyala*
Periodicity:April - June'2025

Abstract

 India is widely renowned for having many different languages. Indians must be quite proud of the diversity of languages in our country. As a result, there are numerous languages and a large number of meaningful words. There are terms that have synonyms in every language. There will be a time when learning a new language is necessary. Natural language processing (NLP) is helpful in achieving such. It has been around for more than 50 years, and the roots of NLP may be found in the study of language. It is used in a variety of fields, including corporate intelligence, search engines, and medical research. Ambiguity is the quality of being subject to multiple interpretations, and is frequently referred to as the imprecision of a word's meaning. The method for fixing this problem is called disambiguation. This study used a large-scale dataset that included words with many meanings and senses. Additionally, the dataset is in Telugu, the regional language of Andhra Pradesh. Some of the Telugu words in this dataset are unclear when used in various contexts. Deep neural networks are utilized to do this. Two algorithms namely Bidirectional Long Short Term Memory (BiLSTM) and Bidirectional Gated Recurrent Unit (RBGRU) are used in order to obtain the noticeable sense of given ambiguous word. Accurate word sense prediction achieved an accuracy of 86.30% for word sense disambiguation for regional language results. This outcome is remarkable when compared with other approaches of word sense prediction of various regional languages. :  India is widely renowned for having many different languages. Indians must be quite proud of the diversity of languages in our country. As a result, there are numerous languages and a large number of meaningful words. There are terms that have synonyms in every language. There will be a time when learning a new language is necessary. Natural language processing (NLP) is helpful in achieving such. It has been around for more than 50 years, and the roots of NLP may be found in the study of language. It is used in a variety of fields, including corporate intelligence, search engines, and medical research. Ambiguity is the quality of being subject to multiple interpretations, and is frequently referred to as the imprecision of a word's meaning. The method for fixing this problem is called disambiguation. This study used a large-scale dataset that included words with many meanings and senses. Additionally, the dataset is in Telugu, the regional language of Andhra Pradesh. Some of the Telugu words in this dataset are unclear when used in various contexts. Deep neural networks are utilized to do this. Two algorithms namely Bidirectional Long Short Term Memory (BiLSTM) and Bidirectional Gated Recurrent Unit (RBGRU) are used in order to obtain the noticeable sense of given ambiguous word. Accurate word sense prediction achieved an accuracy of 86.30% for word sense disambiguation for regional language results. This outcome is remarkable when compared with other approaches of word sense prediction of various regional languages.

Keywords

Word Sense Disambiguation (WSD), Sense prediction method, Bidirectional Long Short-Term Memory (BiLSTM), Bidirectional Gated Recurrent Unit (RB-GRU), Deep Neural Networks.

How to Cite this Article?

References

If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Pdf 35 35 200 20
Online 15 15 200 15
Pdf & Online 35 35 400 25

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.