Introduction

Emotions are highly useful to model human behavior being at the core of what makes us human. Today, people abundantly express and share emotions through social media. Technological advancements in such platforms enable sharing opinions or expressing any specific emotions towards what others have shared, mainly in the form of textual data. This entails an interesting arena for analysis; as to whether there is a disconnect between the writer’s intended emotion and the reader’s perception of textual content.In this context we procure a Readers’ Emotion News datasets by using the social news network, Rappler and its award-winning Mood Meter widget. Mood Meter enables readers to cast their emotion votes towards several categories of emotions (Afraid, Amused, Angry, Annoyed, Don’t care, Happy, Inspired, and Sad) and records the total percentage of votes obtained for each emotion. Unlike other sources, we choose Rappler due to its simplicity, popularity, and ease of organizing several news articles under multiple genres and associated emotion profiles. We manually collect only the popular news articles by checking for high emotion votings represented in the Rappler Mood Meter, to ensure that the selected news articles have a high social reach. REN-20k is our newly curated Readers’ Emotion News dataset procured in a similar fashion of RENh-4k from the popular online news network Rappler, where we collect news articles manually, from the year span 2014 to 2019, by checking articles with high emotion votings in the Mood Meter widget of Rappler indicating high popularity and social reach of these articles. But it is an advanced version containing 20474 numbers of documents with corresponding readers' emotion profiles collected for diverse classes of emotions Afraid, Amused, Angry, Annoyed, Don't care, Happy, Inspired, and Sad. Also, each document consists of the whole news content including headlines, abstract and full-length news story excluding images and videos, making it a long-text dataset with average words per document as 527.84. With the help of genre information available in the portal and by manual annotations, we assign each document to a diverse set of genres, Business, Entertainment, Lifestyle, Sports, Technology and Others, unlike RENh-4k that considers only three genres, Health \& well-being, Social issues and Others.



Dataset Sample

News Headline: Countries ban China arrivals as virus death toll hits 213
News Abstract: Nearly 10,000 people have been infected in China by the new coronavirus and new cases are found abroad, with more than ...
News Content: BEIJING, China – Countries stepped up travel restrictions on arrivals from China on Friday, January 31, after a global health emergency was declared over a viral epidemic that has killed 213 people. Nearly 10,000 people have been infected in China by the new coronavirus and ...
News Category: Health & well-being
Readers' Emotion:

Anger = 5%
Fear = 75%
Joy = 0%
Sadness = 20%
Surprise = 0%


People

  1. Anoop K, University of Calicut, Kerala, India. (anoopk_dcs@uoc.ac.in)
  2. Deepak P, Queen’s University Belfast, Northern Ireland, UK. (deepaksp@acm.org)
  3. Manjary P Gangan, University of Calicut, Kerala, India.
  4. Savitha Sam Abraham , School of Science and Technology, Örebro University, Örebro, Sweden.
  5. Lajish V L, University of Calicut, Kerala, India.


Publication

Anoop Kadan, Deepak P., Manjary P. Gangan, Savitha Sam Abraham, Lajish V. L. REDAffectiveLM: Leveraging Affect Enriched Embedding and Transformer-based Neural Language Model for Readers' Emotion Detection. arXiv preprint arXiv:2301.08995 (2023) DOI: https://doi.org/10.48550/arXiv.2301.08995


logo

Abstract: Technological advancements in web platforms allow people to express and share emotions towards textual write-ups written and shared by others. This brings about different interesting domains for analysis; emotion expressed by the writer and emotion elicited from the readers. In this paper, we propose a novel approach for Readers' Emotion Detection from short-text documents using a deep learning model called REDAffectiveLM. Within state-of-the-art NLP tasks, it is well understood that utilizing context-specific representations from transformer-based pre-trained language models helps achieve improved performance. Within this affective computing task, we explore how incorporating affective information can further enhance performance. Towards this, we leverage context-specific and affect enriched representations by using a transformer-based pre-trained language model in tandem with affect enriched Bi-LSTM+Attention. For empirical evaluation, we procure a new dataset REN-20k, besides using RENh-4k and SemEval-2007. We evaluate the performance of our REDAffectiveLM rigorously across these datasets, against a vast set of state-of-the-art baselines, where our model consistently outperforms baselines and obtains statistically significant results. Our results establish that utilizing affect enriched representation along with context-specific representation within a neural architecture can considerably enhance readers' emotion detection. Since the impact of affect enrichment specifically in readers' emotion detection isn't well explored, we conduct a detailed analysis over affect enriched Bi-LSTM+Attention using qualitative and quantitative model behavior evaluation techniques. We observe that compared to conventional semantic embedding, affect enriched embedding increases the ability of the network to effectively identify and assign weightage to the key terms responsible for readers' emotion detection to improve prediction.

REN-20k Datasets Download

Please follow the steps below to download the REN-20k Dataset.

Step 1: Please fill the request form to submit.
Step 2: You will be given a download link within few days of submitting the request.
Step 3: If you use this dataset in your research, please acknowledge the Readers' Emotion News (REN-20k) Dataset and its authors as the citation below:

Cite the work: BibTeX
Anoop Kadan, Deepak P., Manjary P. Gangan, Savitha Sam Abraham, Lajish V. L. REDAffectiveLM: Leveraging Affect Enriched Embedding and Transformer-based Neural Language Model for Readers' Emotion Detection. arXiv preprint arXiv:2301.08995 (2023) DOI: https://doi.org/10.48550/arXiv.2301.08995



Other Related Publication

1. Anoop K., Deepak P., Savitha Sam Abraham, Lajish V. L., Manjary P. Gangan. Readers’ affect: predicting and understanding readers’ emotions with deep learning. J Big Data June 2022, 9:82, Springer Nature, ISSN: 2196-1115, DOI: https://doi.org/10.1186/s40537-022-00614-2


Acknowledgements

The authors thankfully acknowledge the popular leading digital media company RAPPLER for the data source of news data along with associated emotions from their online portal that very relevantly helped to conduct this research. The authors thankfully acknowledge Arjun K. Sreedhar, Dheeraj K., Sarath Kumar P. S., and Vishnu S., the postgraduate students of the Department of Computer Science, University of Calicut, who have been involved in dataset procurement. Manjary P. Gangan was supported by the Women Scientist Scheme-A (WOS-A), Department of Science and Technology (DST) of the Government of India for Research in Basic/Applied Science under the Grant SR/WOS-A/PM-62/2018.

Dataset Download Request Form

Please enter a working email address.
We will use this email to send the confirmation.

REN-20k Terms of Use

Copyright © 2022 by the Computational Intelligence and Data Analytics Lab, Department of Computer Science, University of Calicut, Kerala, India.
If it is your intent to use this dataset for non-commercial purposes, such as in academic research, this dataset is free.
If you use this dataset in your research, please acknowledge the REN-20k Dataset and its authors as the citation below :-
Anoop Kadan, Deepak P., Manjary P. Gangan, Savitha Sam Abraham, Lajish V. L. REDAffectiveLM: Leveraging Affect Enriched Embedding and Transformer-based Neural Language Model for Readers' Emotion Detection. arXiv preprint arXiv:2301.08995 (2023) DOI: https://doi.org/10.48550/arXiv.2301.08995

I have read and agree to these terms of use