Introduction

Emotions are highly useful to model human behavior being at the core of what makes us human. Today, people abundantly express and share emotions through social media. Technological advancements in such platforms enable sharing opinions or expressing any specific emotions towards what others have shared, mainly in the form of textual data. This entails an interesting arena for analysis; as to whether there is a disconnect between the writer’s intended emotion and the reader’s perception of textual content.In this context we procure a Readers’ Emotion News datasets by using the social news network, Rappler and its award-winning Mood Meter widget. Mood Meter enables readers to cast their emotion votes towards several categories of emotions (Afraid, Amused, Angry, Annoyed, Don’t care, Happy, Inspired, and Sad) and records the total percentage of votes obtained for each emotion. Unlike other sources, we choose Rappler due to its simplicity, popularity, and ease of organizing several news articles under multiple genres and associated emotion profiles. We manually collect only the popular news articles by checking for high emotion votings represented in the Rappler Mood Meter, to ensure that the selected news articles have a high social reach. RENh-4k is a short-text dataset with 4000 news documents and associated readers’ emotion profiles. News headlines and associated abstract/snippet are combined to form the documents, and corresponding readers’ emotion profiles are obtained from readers’ votings on Mood Meter for emotion classes: Afraid, Angry, Happy, Inspired, and Sad. We also assign documents into either of the categories, Health & well-being, Social issues or Others, after manually verifying news genres.

Dataset Sample

News Headline: Countries ban China arrivals as virus death toll hits 213
News Abstract: Nearly 10,000 people have been infected in China by the new coronavirus and new cases are found abroad, with more than ...
News Content: BEIJING, China – Countries stepped up travel restrictions on arrivals from China on Friday, January 31, after a global health emergency was declared over a viral epidemic that has killed 213 people. Nearly 10,000 people have been infected in China by the new coronavirus and ...
News Category: Health & well-being
Readers' Emotion:

Anger = 5%
Fear = 75%
Joy = 0%
Sadness = 20%
Surprise = 0%

People

  1. Anoop K, University of Calicut, Kerala, India. (anoopk_dcs@uoc.ac.in)
  2. Deepak P, Queen’s University Belfast, Northern Ireland, UK. (deepaksp@acm.org)
  3. Savitha Sam Abraham , School of Science and Technology, Örebro University, Örebro, Sweden.
  4. Lajish V L, University of Calicut, Kerala, India.
  5. Manjary P Gangan, University of Calicut, Kerala, India.

Related Publication

Anoop K., Deepak P., Savitha Sam Abraham, Lajish V. L., Manjary P. Gangan. Readers’ affect: predicting and understanding readers’ emotions with deep learning. J Big Data June 2022, 9:82, Springer Nature, ISSN: 2196-1115, DOI: https://doi.org/10.1186/s40537-022-00614-2


logo

Abstract: Emotions are highly useful to model human behavior being at the core of what makes us human. Today, people abundantly express and share emotions through social media. Technological advancements in such platforms enable sharing opinions or expressing any specific emotions towards what others have shared, mainly in the form of textual data. This entails an interesting arena for analysis; as to whether there is a disconnect between the writer’s intended emotion and the reader’s perception of textual content. In this paper, we present experiments for Readers’ Emotion Detection through multi-target regression settings by exploring a Bi-LSTM-based Attention model, where our major intention is to analyze the interpretability and effectiveness of the deep learning model for the task. To conduct experiments, we procure two extensive datasets REN-10k and RENh-4k, apart from using a popular benchmark dataset from SemEval-2007. We perform a two-phase experimental evaluation, first being various coarse-grained and fine-grained evaluations of our model performance in comparison with several baselines belonging to different categories of emotion detection, viz., deep learning, lexicon based, and classical machine learning. Secondly, we evaluate model behavior towards readers’ emotion detection assessing attention maps generated by the model through devising a novel set of qualitative and quantitative metrics. The first phase of experiments shows that our Bi-LSTM+Attention model significantly outperforms all baselines. The second analysis reveals that emotions may be correlated to specific words as well as named entities.

RENh-4k Datasets Download

Please follow the steps below to download the RENh-4k Dataset.

Step 1: Please fill the request form to submit.
Step 2: You will be given a download link within few days of submitting the request.
Step 3: If you use this dataset in your research, please acknowledge the Readers Emotion News headlines (RENh-4k) Dataset and its authors as the citation below:

Cite the work: BibTeX
Anoop K., Deepak P., Savitha Sam Abraham, Lajish V. L., Manjary P. Gangan. Readers’ affect: predicting and understanding readers’ emotions with deep learning. J Big Data June 2022, 9:82, Springer Nature, ISSN: 2196-1115, DOI: https://doi.org/10.1186/s40537-022-00614-2



Acknowledgements

The authors thankfully acknowledge the popular leading digital media company RAPPLER for allowing to procure news data along with associated emotions from their online portal that very relevantly helped to conduct this research; and the authors also acknowledge project interns Renjitha Rajendran and Shonima Sanil, Department of Information Technology Kannur University, Athira Biju and Sruthi S Kumar, Department of Computer Science Mahatma Gandhi University, and Amrutha Praseeth, Diya Rajan and Rahul Das H, the postgraduate students of Department of Computer Science University of Calicut, who have been involved in dataset procurement.

Dataset Download Request Form

Please enter a working email address.
We will use this email to send the confirmation.

RENH-4k Terms of Use

Copyright © 2022 by the Computational Intelligence and Data Analytics Lab, Department of Computer Science, University of Calicut, Kerala, India.
If it is your intent to use this dataset for non-commercial purposes, such as in academic research, this dataset is free.
If you use this dataset in your research, please acknowledge the RENh-4k Dataset and its authors as the citation below :-
Anoop K., Deepak P., Savitha Sam Abraham, Lajish V. L., Manjary P. Gangan.Readers’ affect: predicting and understanding readers’ emotions with deep learning. J Big Data June 2022, 9:82, Springer Nature, ISSN: 2196-1115, DOI: https://doi.org/10.1186/s40537-022-00614-2

I have read and agree to these terms of use