DSpace Repository

HindiMD: A Multi-domain Corpora for Low-resource Sentiment Analysis

Show simple item record

dc.contributor.author MAMTA
dc.contributor.author EKBAL A.
dc.contributor.author BHATTACHARYYA P.
dc.contributor.author SAHA T.
dc.contributor.author KUMAR A.
dc.contributor.author SRIVASTAVA S.
dc.date.accessioned 2023-03-17T06:16:54Z
dc.date.available 2023-03-17T06:16:54Z
dc.date.issued 2022
dc.identifier.citation 2022 Language Resources and Evaluation Conference, LREC 20227061-7070 en_US
dc.identifier.isbn 9791095546726
dc.identifier.uri http://localhost:8080/xmlui/handle/100/43080
dc.description.abstract Social media platforms such as twitter have evolved into a vast information sharing platform, allowing people from a variety of backgrounds and expertise to share their opinions on numerous events such as terrorism, narcotics and many other social issues. People sometimes misuse the power of social media for their agendas, such as illegal trades and negatively influencing others. Because of this, sentiment analysis has won the interest of a lot of researchers to widely analyze public opinion for social media monitoring. Several benchmark datasets for sentiment analysis across a range of domains have been made available, especially for high-resource languages. A few datasets are available for low-resource indian languages like hindi, such as movie reviews and product reviews, which do not address the current need for social media monitoring. In this paper, we address the challenges of sentiment analysis in hindi and socially relevant domains by introducing a balanced corpus annotated with the sentiment classes, viz. Positive, negative and neutral. To show the effective usage of the dataset, we build several deep learning based models and establish them as the baselines for further research in this direction. © european language resources association (elra), licensed under cc-by-nc-4.0. en_US
dc.language.iso English en_US
dc.publisher European Language Resources Association (ELRA) en_US
dc.subject BERT en_US
dc.subject DEEP LEARNING en_US
dc.subject INDIAN LANGUAGE en_US
dc.subject LOW-RESOURCE LANGUAGE en_US
dc.subject MULTI-DOMAIN en_US
dc.subject SENTIMENT en_US
dc.subject.other Deep learning en_US
dc.subject.other Social aspects en_US
dc.subject.other Social networking (online) en_US
dc.subject.other BERT en_US
dc.subject.other Deep learning en_US
dc.subject.other Indian languages en_US
dc.subject.other Information sharing platforms en_US
dc.subject.other Low resource languages en_US
dc.subject.other Multi-domains en_US
dc.subject.other Sentiment en_US
dc.subject.other Sentiment analysis en_US
dc.subject.other Social media monitoring en_US
dc.subject.other Social media platforms en_US
dc.subject.other Sentiment analysis en_US
dc.title HindiMD: A Multi-domain Corpora for Low-resource Sentiment Analysis en_US
dc.type Conference Paper en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account